NLTK Book Chapter 2

as with Chapter 1, be sure to run the following in the python interpreter beforehand:

>>> import nltk
>>> nltk.download()

(download nltk_book corpus)

Accessing Text Corpora and Lexical Resources

Accessing Text Corpora

Gutenberg Corpus


In [1]:
%matplotlib inline
import matplotlib.pyplot as plt

import nltk
nltk.corpus.gutenberg.fileids()


Out[1]:
['austen-emma.txt',
 'austen-persuasion.txt',
 'austen-sense.txt',
 'bible-kjv.txt',
 'blake-poems.txt',
 'bryant-stories.txt',
 'burgess-busterbrown.txt',
 'carroll-alice.txt',
 'chesterton-ball.txt',
 'chesterton-brown.txt',
 'chesterton-thursday.txt',
 'edgeworth-parents.txt',
 'melville-moby_dick.txt',
 'milton-paradise.txt',
 'shakespeare-caesar.txt',
 'shakespeare-hamlet.txt',
 'shakespeare-macbeth.txt',
 'whitman-leaves.txt']

Pick Emma (Jane Austen) from corpus


In [2]:
emma = nltk.corpus.gutenberg.words('austen-emma.txt')
emma


Out[2]:
['[', 'Emma', 'by', 'Jane', 'Austen', '1816', ']', ...]

In [3]:
len(emma)


Out[3]:
192427

In [4]:
from nltk.corpus import gutenberg
gutenberg.fileids()


Out[4]:
['austen-emma.txt',
 'austen-persuasion.txt',
 'austen-sense.txt',
 'bible-kjv.txt',
 'blake-poems.txt',
 'bryant-stories.txt',
 'burgess-busterbrown.txt',
 'carroll-alice.txt',
 'chesterton-ball.txt',
 'chesterton-brown.txt',
 'chesterton-thursday.txt',
 'edgeworth-parents.txt',
 'melville-moby_dick.txt',
 'milton-paradise.txt',
 'shakespeare-caesar.txt',
 'shakespeare-hamlet.txt',
 'shakespeare-macbeth.txt',
 'whitman-leaves.txt']

In [5]:
emma = gutenberg.words('austen-emma.txt')
emma


Out[5]:
['[', 'Emma', 'by', 'Jane', 'Austen', '1816', ']', ...]

In [6]:
type(gutenberg)


Out[6]:
nltk.corpus.reader.plaintext.PlaintextCorpusReader

other information from the gutenberg object


In [7]:
print('avg word length |', 'avg sentence length |', '# times each vocab item appears on avg |', 'fileid')

for fileid in gutenberg.fileids():
    num_chars = len(gutenberg.raw(fileid))
    num_words = len(gutenberg.words(fileid))
    num_sents = len(gutenberg.sents(fileid))
    num_vocab = len(set(w.lower() for w in gutenberg.words(fileid)))
    print(round(num_chars / num_words), 
          round(num_words/num_sents), 
          round(num_words/num_vocab),
          fileid)


avg word length | avg sentence length | # times each vocab item appears on avg | fileid
5 25 26 austen-emma.txt
5 26 17 austen-persuasion.txt
5 28 22 austen-sense.txt
4 34 79 bible-kjv.txt
5 19 5 blake-poems.txt
4 19 14 bryant-stories.txt
4 18 12 burgess-busterbrown.txt
4 20 13 carroll-alice.txt
5 20 12 chesterton-ball.txt
5 23 11 chesterton-brown.txt
5 18 11 chesterton-thursday.txt
4 21 25 edgeworth-parents.txt
5 26 15 melville-moby_dick.txt
5 52 11 milton-paradise.txt
4 12 9 shakespeare-caesar.txt
4 12 8 shakespeare-hamlet.txt
4 12 7 shakespeare-macbeth.txt
5 36 12 whitman-leaves.txt

"average word length appears to be a general property of English, since it has a recurrent value of 4" -- num_chars counts spaces though...

Other values seem to be characteristics of particular authors.

Methods:

  • raw gives raw text as a string
  • words gives a list-like object of words in the text
  • sents gives a list-like object of sentences from the text (each sentence is a list of words)

In [8]:
macbeth_sentences = gutenberg.sents('shakespeare-macbeth.txt')
macbeth_sentences


Out[8]:
[['[', 'The', 'Tragedie', 'of', 'Macbeth', 'by', 'William', 'Shakespeare', '1603', ']'], ['Actus', 'Primus', '.'], ...]

In [9]:
macbeth_sentences[1116]


Out[9]:
['Double',
 ',',
 'double',
 ',',
 'toile',
 'and',
 'trouble',
 ';',
 'Fire',
 'burne',
 ',',
 'and',
 'Cauldron',
 'bubble']

In [10]:
longest_len = max(len(s) for s in macbeth_sentences)
[s for s in macbeth_sentences if len(s) == longest_len][:1]


Out[10]:
[['Doubtfull',
  'it',
  'stood',
  ',',
  'As',
  'two',
  'spent',
  'Swimmers',
  ',',
  'that',
  'doe',
  'cling',
  'together',
  ',',
  'And',
  'choake',
  'their',
  'Art',
  ':',
  'The',
  'mercilesse',
  'Macdonwald',
  '(',
  'Worthie',
  'to',
  'be',
  'a',
  'Rebell',
  ',',
  'for',
  'to',
  'that',
  'The',
  'multiplying',
  'Villanies',
  'of',
  'Nature',
  'Doe',
  'swarme',
  'vpon',
  'him',
  ')',
  'from',
  'the',
  'Westerne',
  'Isles',
  'Of',
  'Kernes',
  'and',
  'Gallowgrosses',
  'is',
  'supply',
  "'",
  'd',
  ',',
  'And',
  'Fortune',
  'on',
  'his',
  'damned',
  'Quarry',
  'smiling',
  ',',
  'Shew',
  "'",
  'd',
  'like',
  'a',
  'Rebells',
  'Whore',
  ':',
  'but',
  'all',
  "'",
  's',
  'too',
  'weake',
  ':',
  'For',
  'braue',
  'Macbeth',
  '(',
  'well',
  'hee',
  'deserues',
  'that',
  'Name',
  ')',
  'Disdayning',
  'Fortune',
  ',',
  'with',
  'his',
  'brandisht',
  'Steele',
  ',',
  'Which',
  'smoak',
  "'",
  'd',
  'with',
  'bloody',
  'execution',
  '(',
  'Like',
  'Valours',
  'Minion',
  ')',
  'caru',
  "'",
  'd',
  'out',
  'his',
  'passage',
  ',',
  'Till',
  'hee',
  'fac',
  "'",
  'd',
  'the',
  'Slaue',
  ':',
  'Which',
  'neu',
  "'",
  'r',
  'shooke',
  'hands',
  ',',
  'nor',
  'bad',
  'farwell',
  'to',
  'him',
  ',',
  'Till',
  'he',
  'vnseam',
  "'",
  'd',
  'him',
  'from',
  'the',
  'Naue',
  'toth',
  "'",
  'Chops',
  ',',
  'And',
  'fix',
  "'",
  'd',
  'his',
  'Head',
  'vpon',
  'our',
  'Battlements']]

Web and Chat Text

  • Firefox discussion forum
  • Conversations overheard in New York
  • movie script from Pirates of the Caribbean
  • personal ads
  • wine reviews

In [11]:
from nltk.corpus import webtext
for fileid in webtext.fileids():
    print(fileid, '\t', webtext.raw(fileid)[:65], '...', end='\n\n')


firefox.txt 	 Cookie Manager: "Don't allow sites that set removed cookies to se ...

grail.txt 	 SCENE 1: [wind] [clop clop clop] 
KING ARTHUR: Whoa there!  [clop ...

overheard.txt 	 White guy: So, do you have any plans for this evening?
Asian girl ...

pirates.txt 	 PIRATES OF THE CARRIBEAN: DEAD MAN'S CHEST, by Ted Elliott & Terr ...

singles.txt 	 25 SEXY MALE, seeks attrac older single lady, for discreet encoun ...

wine.txt 	 Lovely delicate, fragrant Rhone wine. Polished leather and strawb ...


Corpus of instant messaging chat sessions


In [12]:
from nltk.corpus import nps_chat
chatroom = nps_chat.posts('10-19-20s_706posts.xml')
chatroom[123]


Out[12]:
['i',
 'do',
 "n't",
 'want',
 'hot',
 'pics',
 'of',
 'a',
 'female',
 ',',
 'I',
 'can',
 'look',
 'in',
 'a',
 'mirror',
 '.']

Brown Corpus

Text sources categorized by genre (news, editorial, etc.)


In [13]:
from nltk.corpus import brown
brown.categories()


Out[13]:
['adventure',
 'belles_lettres',
 'editorial',
 'fiction',
 'government',
 'hobbies',
 'humor',
 'learned',
 'lore',
 'mystery',
 'news',
 'religion',
 'reviews',
 'romance',
 'science_fiction']

Selecting multiple categories...


In [14]:
brown.words(categories='news')


Out[14]:
['The', 'Fulton', 'County', 'Grand', 'Jury', 'said', ...]

In [15]:
brown.words(fileids=['cg22'])


Out[15]:
['Does', 'our', 'society', 'have', 'a', 'runaway', ',', ...]

In [16]:
brown.sents(categories=['news', 'editorial', 'reviews'])


Out[16]:
[['The', 'Fulton', 'County', 'Grand', 'Jury', 'said', 'Friday', 'an', 'investigation', 'of', "Atlanta's", 'recent', 'primary', 'election', 'produced', '``', 'no', 'evidence', "''", 'that', 'any', 'irregularities', 'took', 'place', '.'], ['The', 'jury', 'further', 'said', 'in', 'term-end', 'presentments', 'that', 'the', 'City', 'Executive', 'Committee', ',', 'which', 'had', 'over-all', 'charge', 'of', 'the', 'election', ',', '``', 'deserves', 'the', 'praise', 'and', 'thanks', 'of', 'the', 'City', 'of', 'Atlanta', "''", 'for', 'the', 'manner', 'in', 'which', 'the', 'election', 'was', 'conducted', '.'], ...]

comparing genres by their usage of modal verbs


In [17]:
from nltk.corpus import brown
news_text = brown.words(categories='news')
fdist = nltk.FreqDist(w.lower() for w in news_text)
modals = ['can', 'could', 'may', 'might', 'must', 'will']

for m in modals:
    print(m + ':', fdist[m], end=' ')


can: 94 could: 87 may: 93 might: 38 must: 53 will: 389 

obtaining counts for each genre of interest


In [18]:
cfd = nltk.ConditionalFreqDist((genre, word) 
                               for genre in brown.categories() 
                               for word in brown.words(categories=genre))
cfd


Out[18]:
defaultdict(nltk.probability.FreqDist,
            {'adventure': Counter({'Without': 1,
                      'lighted': 5,
                      'laden': 1,
                      'shimmering': 1,
                      'More': 1,
                      'assent': 1,
                      'festival': 1,
                      'speak': 6,
                      'located': 2,
                      'pandanus': 1,
                      'education': 4,
                      'house': 60,
                      'muscles': 5,
                      "She'd": 1,
                      'tube': 1,
                      'travelers': 1,
                      'Adams': 5,
                      'waters': 6,
                      'forearm': 1,
                      'dig': 2,
                      'hurdled': 1,
                      'suds': 3,
                      'use': 15,
                      'calloused': 1,
                      'killed': 15,
                      'experience': 1,
                      'sunbaked': 1,
                      'served': 3,
                      'octagonal': 1,
                      'bars': 5,
                      'many-times': 1,
                      'thoroughly': 1,
                      "Ayres'": 1,
                      'unaware': 1,
                      'singed': 1,
                      'abrupt': 1,
                      'certain': 7,
                      'heaved': 1,
                      'Tomorrow': 1,
                      'Nazi': 1,
                      'demanded': 4,
                      'shade': 3,
                      'others': 19,
                      'sunshine': 2,
                      'vicinity': 1,
                      'Wait': 4,
                      'Unconcerned': 1,
                      'strains': 1,
                      'mercy': 1,
                      'bucking': 1,
                      'taut': 1,
                      'catch': 10,
                      'ngandlu': 1,
                      'beautiful': 5,
                      'warrior': 3,
                      "stealin'": 1,
                      'supported': 2,
                      'brushing': 2,
                      'grotesquely': 1,
                      'McLish': 3,
                      'gesticulating': 1,
                      'Stevens': 8,
                      'Kneeling': 1,
                      'lapsing': 1,
                      'paradoxically': 1,
                      'chicken': 4,
                      'determine': 2,
                      'goaded': 1,
                      'correct': 2,
                      'cabins': 1,
                      'glance': 6,
                      'cups': 1,
                      'kitchen': 7,
                      'amount': 2,
                      'rabbits': 1,
                      'crack': 5,
                      'glances': 1,
                      'pressure': 6,
                      'overrun': 1,
                      'thet': 2,
                      'cheek': 11,
                      'perpetrator': 1,
                      'buddies': 2,
                      'dangle': 1,
                      'way': 65,
                      'gauche': 1,
                      'Ballet': 1,
                      'bawled': 1,
                      "wouldn't": 25,
                      'gulps': 1,
                      'cheated': 1,
                      'through': 78,
                      'rimmed': 1,
                      'Just': 22,
                      "we'll": 6,
                      'each': 26,
                      'Because': 3,
                      'Plus': 1,
                      'Ah': 8,
                      'interminable': 1,
                      'mustering': 1,
                      'pavement': 6,
                      'tons': 3,
                      'surreptitiously': 1,
                      'timid': 1,
                      'Seton': 1,
                      'decaying': 1,
                      'Walters': 1,
                      'forget': 3,
                      'apparently': 2,
                      'unselfishly': 1,
                      "yesterday's": 1,
                      'ye': 1,
                      'tossed': 5,
                      'shy': 2,
                      'raised': 7,
                      'fertility': 1,
                      'stray': 1,
                      'Easterners': 1,
                      'stream': 2,
                      'revealed': 2,
                      'sprung': 1,
                      'eternal': 1,
                      'fences': 1,
                      'At': 25,
                      ')': 22,
                      'wallowed': 1,
                      'focused': 1,
                      'patrolman': 3,
                      'clog': 1,
                      'tipsy': 1,
                      'Those': 4,
                      'surprisingly': 1,
                      'half-a-dozen': 1,
                      'sizzled': 1,
                      'lucky': 5,
                      'career': 1,
                      'idly': 2,
                      'meld': 1,
                      'husband': 13,
                      'friends': 6,
                      'bet': 3,
                      'Prime': 2,
                      'shouting': 7,
                      'worse': 2,
                      'Shillong': 1,
                      'slip': 1,
                      'unenunciated': 1,
                      'small': 35,
                      'bull': 1,
                      'doorways': 1,
                      'enemy': 15,
                      'wept': 2,
                      'neckline': 1,
                      'also': 19,
                      'knife': 11,
                      'recently': 2,
                      'reined': 2,
                      'distinct': 2,
                      'snapped': 7,
                      'next': 22,
                      'cipher': 1,
                      'propeller': 1,
                      'sly': 1,
                      'sweltering': 1,
                      'strike': 3,
                      'four': 16,
                      'lame': 1,
                      'coiling': 1,
                      'really': 18,
                      'broad': 4,
                      'Please': 3,
                      'comforting': 1,
                      'Pistol-whipping': 1,
                      'rushed': 3,
                      'effect': 3,
                      'planted': 1,
                      'exact': 1,
                      'clothed': 1,
                      'Anything': 3,
                      'cell': 3,
                      'King': 1,
                      'steeper': 2,
                      "bo'sun's": 1,
                      "Pettigrew's": 1,
                      'Remembering': 1,
                      'evening': 9,
                      'dripped': 1,
                      'pungent': 1,
                      'Houses': 1,
                      'Whaddya': 1,
                      'telegraph': 1,
                      'Joseph': 3,
                      'coward': 3,
                      'daylight': 5,
                      'These': 6,
                      'retrieved': 1,
                      'towering': 3,
                      'runaway': 1,
                      "Todman's": 1,
                      'ask': 8,
                      'found': 36,
                      'steadiness': 1,
                      "one's": 2,
                      'Laurel': 1,
                      'badge-toter': 1,
                      'pretty': 6,
                      'mother': 14,
                      "t'lah": 1,
                      'things': 18,
                      'Now': 37,
                      'thank': 4,
                      'balled': 1,
                      'negligence': 1,
                      "He'll": 2,
                      'Running': 1,
                      'permission': 2,
                      'suppose': 4,
                      'vertical': 1,
                      'bits': 2,
                      'goodbye': 1,
                      'onct': 2,
                      'cocked': 2,
                      'midst': 1,
                      'drawn': 6,
                      'sir': 2,
                      'bat': 2,
                      'cellophane': 1,
                      'gallop': 2,
                      'marrowbones': 1,
                      'Bear': 1,
                      'brethren': 1,
                      'hatred': 3,
                      'snack': 1,
                      'German': 11,
                      'neighbor': 1,
                      'scream': 4,
                      'signal': 5,
                      'grabbed': 10,
                      'musicians': 1,
                      'gushed': 2,
                      'shoot': 7,
                      'pierced': 2,
                      'shadow': 5,
                      'pinpoints': 1,
                      'stiffened': 1,
                      'radii': 1,
                      "wasn't": 17,
                      "Delphine's": 2,
                      'narrowing': 1,
                      'flights': 5,
                      'dying': 5,
                      'required': 1,
                      'remained': 11,
                      'Griggs': 3,
                      'Lord': 23,
                      'throng': 1,
                      'beasts': 1,
                      'strip': 6,
                      'hunting': 3,
                      'resource': 1,
                      'run': 11,
                      'discussed': 1,
                      'speedy': 1,
                      'stores': 1,
                      'veering': 1,
                      'cobra': 2,
                      'quavered': 1,
                      'web': 1,
                      'lilting': 1,
                      'nerves': 3,
                      'shark-infested': 1,
                      '30': 1,
                      'motioning': 2,
                      'themselves': 6,
                      'developing': 1,
                      'prettiness': 1,
                      'storefront': 1,
                      "wieners'": 1,
                      'Deputy': 2,
                      'experiences': 1,
                      'wicker': 1,
                      'However': 1,
                      'drahve': 1,
                      'stumps': 1,
                      'positive': 2,
                      'Dammit': 2,
                      'Cheyennes': 2,
                      'execution': 1,
                      'Howard': 1,
                      'speaks': 1,
                      'thorny': 1,
                      'clean': 6,
                      "mulatto's": 1,
                      'pulled': 19,
                      'doggone': 1,
                      'framed': 2,
                      'Dawn': 1,
                      'methods': 3,
                      'visited': 3,
                      'loaded': 1,
                      'Also': 3,
                      'radio': 8,
                      'coat': 11,
                      'Hustle': 1,
                      'trees': 21,
                      'anonymous': 5,
                      'Japanese': 4,
                      'Begging': 1,
                      'action-packed': 1,
                      'seeming': 2,
                      'devastating': 1,
                      'sixty': 2,
                      'broad-brimmed': 1,
                      'problem': 4,
                      'winter': 1,
                      'distance': 12,
                      'divining': 1,
                      'walking': 3,
                      'sleeve': 6,
                      'floorboards': 1,
                      'proffered': 2,
                      'enlightened': 1,
                      "money's": 1,
                      'looking': 19,
                      'goad': 1,
                      'Journal': 1,
                      'chamois': 1,
                      'crook': 1,
                      'splinters': 1,
                      'suggest': 1,
                      'British': 3,
                      'sheeted': 1,
                      'trading': 2,
                      'Time': 2,
                      'humor': 1,
                      'baggy': 1,
                      'fibers': 1,
                      'Smiling': 1,
                      "We're": 8,
                      'sentimental': 1,
                      'Big': 5,
                      'descending': 1,
                      'thrills': 1,
                      'garbed': 1,
                      'corkers': 1,
                      'inductees': 1,
                      'pointed': 7,
                      'wounded': 6,
                      'thumbing': 1,
                      'determination': 1,
                      'hey': 1,
                      'slowly': 14,
                      'place': 39,
                      'piece': 8,
                      'rode': 12,
                      'amusement': 1,
                      'bombers': 1,
                      'Piwen': 1,
                      'tasted': 1,
                      'atmosphere': 2,
                      'teetering': 1,
                      'Montero': 10,
                      'urgent': 1,
                      'land': 16,
                      'gravely': 2,
                      'blocking': 1,
                      'wires': 2,
                      'think': 44,
                      'could': 151,
                      'enjoying': 1,
                      'rusty': 4,
                      'stopped': 17,
                      'Wilson': 17,
                      'Dangerous': 2,
                      'marvelled': 1,
                      'related': 1,
                      'elaborate': 2,
                      'Along': 2,
                      'pride': 4,
                      'compulsion': 1,
                      'twisting': 5,
                      'darkened': 2,
                      'rattle': 1,
                      'reeking': 1,
                      'rites': 1,
                      'brute': 1,
                      'Juan': 2,
                      'Corporal': 2,
                      'ran': 33,
                      'wagons': 7,
                      'meant': 14,
                      'Oooo': 1,
                      'shield': 1,
                      'intent': 1,
                      'tiny': 4,
                      'seepage': 1,
                      'picture': 3,
                      'guarding': 2,
                      'A.M.': 2,
                      'sympathetic': 1,
                      'Chicago': 1,
                      'stairway': 1,
                      'colossal': 1,
                      'buildings': 10,
                      'between': 19,
                      "weeks'": 1,
                      'cost': 2,
                      'collapsing': 1,
                      'host': 1,
                      'courteously': 2,
                      'intimate': 2,
                      'lifeless': 1,
                      'braces': 1,
                      'eighth': 1,
                      'tombstone': 1,
                      'Senora': 1,
                      'recognize': 1,
                      'emerge': 1,
                      'Ramey': 28,
                      'wagon': 27,
                      'companions': 1,
                      'higher': 5,
                      'Dutch': 4,
                      'willow': 1,
                      'foliage': 1,
                      'turtlebacks': 1,
                      'nod': 3,
                      'decomposing': 1,
                      'firmly': 5,
                      'deplorable': 1,
                      'Soak': 1,
                      'backed': 6,
                      'gunner': 1,
                      'placid': 2,
                      'Emigrant': 2,
                      'dismissed': 2,
                      'nowhere': 6,
                      'told': 46,
                      'Alice': 1,
                      "Munich's": 1,
                      'The': 410,
                      'Philly': 1,
                      'heat': 7,
                      'someplace': 2,
                      'Opening': 1,
                      'mouldering': 1,
                      'riders': 1,
                      'Resisting': 1,
                      'of': 1322,
                      'feminist': 1,
                      "Morgan's": 1,
                      'withdrawn': 1,
                      'law': 6,
                      'spiritual': 1,
                      'Ponkob': 1,
                      'scouts': 1,
                      'Perdido': 3,
                      'nearing': 1,
                      'stained': 3,
                      'manservant': 2,
                      'police': 5,
                      'guess': 11,
                      'fates': 1,
                      'First': 4,
                      'rear': 19,
                      'crossbars': 1,
                      'Hallelujah': 1,
                      'doubloon': 1,
                      'metal': 5,
                      'corner': 19,
                      'sworn': 2,
                      'ducked': 2,
                      'bold': 5,
                      'lived': 10,
                      'protest': 1,
                      'Sierra': 1,
                      'self-confidence': 1,
                      'mingled': 1,
                      'dispersed': 1,
                      'maid': 1,
                      'Grace': 3,
                      'facts': 1,
                      "two-bits'": 1,
                      'Figger': 1,
                      'unusually': 4,
                      'cursed': 3,
                      'idiotically': 1,
                      'craved': 1,
                      'peculiar': 1,
                      'display': 2,
                      'Siberia': 1,
                      'frothing': 1,
                      'funny': 2,
                      'entry': 1,
                      'overhead': 2,
                      'outside': 10,
                      'hung': 14,
                      'book': 4,
                      'Methodists': 1,
                      'suggested': 1,
                      "coroner's": 4,
                      'harshly': 3,
                      'comport': 1,
                      'jacket': 6,
                      'Seeing': 2,
                      'bound': 4,
                      'intend': 1,
                      'precaution': 2,
                      'Okay': 1,
                      'poems': 4,
                      'endlessly': 1,
                      'purpose': 5,
                      'blood-chilling': 1,
                      'give': 27,
                      'engraving': 1,
                      'Bullets': 1,
                      'John-Henry': 1,
                      'wisps': 1,
                      'dishes': 5,
                      'tape': 1,
                      'unnamed': 1,
                      'odor': 2,
                      'stomping': 1,
                      'Warmly': 1,
                      'Stetson': 1,
                      'ammunition': 5,
                      'flatly': 1,
                      'tunnel': 1,
                      'blond': 5,
                      'waving': 1,
                      'crest': 2,
                      'applying': 3,
                      'allow': 4,
                      'dangerous': 2,
                      'Aj': 1,
                      'Embarcadero': 1,
                      'daylights': 1,
                      'missing': 1,
                      'actually': 6,
                      'Several': 1,
                      'connected': 1,
                      'softened': 2,
                      'Billy': 17,
                      'after': 53,
                      'questioning': 4,
                      'young': 24,
                      'anyway': 7,
                      'authority': 3,
                      'nester': 2,
                      'Perhaps': 6,
                      'grace': 1,
                      'confused': 2,
                      'Shall': 4,
                      'big': 50,
                      'ill-conceived': 1,
                      'fawn': 1,
                      'quantities': 1,
                      'arresting': 1,
                      'grenades': 1,
                      'riddled': 1,
                      'fills': 1,
                      'attempted': 4,
                      'completely': 7,
                      'pleasure': 5,
                      "'tain't": 1,
                      'unwaveringly': 1,
                      'opened': 19,
                      'left': 47,
                      'stain': 1,
                      'fortified': 1,
                      'Holy': 1,
                      'clip': 1,
                      'blast': 4,
                      'stood': 36,
                      'worst': 3,
                      'Coble': 1,
                      'deck': 7,
                      'yuh': 1,
                      'drinking': 6,
                      'toasted': 1,
                      'street': 23,
                      'liveried': 1,
                      'bow': 2,
                      'lawmen': 2,
                      'current': 3,
                      'streamed': 2,
                      'fill': 1,
                      'ludicrous': 1,
                      'easiest': 1,
                      'priest': 1,
                      'Leaning': 2,
                      "t'hi-im": 1,
                      'town': 28,
                      'now': 90,
                      'OK.': 1,
                      'Drink': 2,
                      'levi-clad': 1,
                      'Melanesian': 2,
                      'buxom': 1,
                      'Favor': 1,
                      'cab': 2,
                      'suddenly': 16,
                      'turtle': 2,
                      'lay': 15,
                      'cohnfidunt': 1,
                      'loss': 1,
                      'three': 35,
                      'shut': 7,
                      'bobbing': 1,
                      'demand': 2,
                      'batch': 2,
                      'stock': 5,
                      'Spring': 1,
                      'throughout': 1,
                      'planks': 3,
                      'San': 3,
                      'saddles': 1,
                      'wage': 1,
                      'industrial': 2,
                      'dozed': 1,
                      'pat': 1,
                      'pure': 3,
                      'extended': 6,
                      'sluices': 1,
                      'demoniac': 1,
                      'patiently': 1,
                      'twins': 1,
                      'Slow': 1,
                      'joy': 2,
                      'detective': 1,
                      'gag': 1,
                      'car': 36,
                      'murders': 1,
                      'clambered': 2,
                      'cap': 6,
                      'West': 7,
                      '$30': 1,
                      'A.': 1,
                      'discovered': 3,
                      'present': 5,
                      'galley': 2,
                      'be': 183,
                      'racing': 3,
                      'names': 4,
                      'ready': 14,
                      'corpses': 1,
                      'sodden': 1,
                      'gunslinger': 1,
                      'passiveness': 1,
                      'glistened': 2,
                      'litle': 1,
                      'Ten': 4,
                      'witches': 2,
                      'abundance': 1,
                      'sincerity': 1,
                      'heap': 3,
                      'sparring': 1,
                      'conclusion': 1,
                      'ant': 1,
                      'ceased': 2,
                      'auto': 1,
                      'minutes': 27,
                      'pinch': 1,
                      'Gas': 1,
                      'equipped': 1,
                      'intertwined': 1,
                      'joined': 6,
                      "He's": 15,
                      'hush': 1,
                      'leaping': 1,
                      'load': 5,
                      'convicts': 1,
                      'distributor': 1,
                      'sharpened': 1,
                      "Amelia's": 2,
                      'covering': 5,
                      'afternoon': 9,
                      'scored': 1,
                      'indistinguishable': 1,
                      'drained': 2,
                      'central': 1,
                      'shivered': 1,
                      'Poussin': 1,
                      'brandishing': 1,
                      'fail': 2,
                      'boarding': 1,
                      'burial': 2,
                      'weeds': 1,
                      'suitably': 1,
                      'force': 6,
                      'replaced': 2,
                      'Bonaventure': 2,
                      "Christ's": 1,
                      'prevented': 2,
                      'Raton': 1,
                      'absent-mindedly': 1,
                      'spasms': 1,
                      'cringing': 2,
                      'eastern': 1,
                      "Tilghman's": 4,
                      'plates': 3,
                      'wildly': 7,
                      'Neal': 3,
                      'loin': 1,
                      'Donald': 4,
                      'ate': 2,
                      'alone': 14,
                      'pardon': 2,
                      'trail-worn': 1,
                      'Venice': 1,
                      'beneath': 15,
                      'bullet': 12,
                      'offsaddled': 1,
                      "You'd": 3,
                      'slugs': 1,
                      'backbone': 1,
                      'tongue-tied': 1,
                      'camouflage': 1,
                      'into': 180,
                      'address': 3,
                      'keel': 1,
                      'else': 21,
                      'let': 38,
                      'panted': 1,
                      'Shu-tt': 1,
                      'becoming': 2,
                      'May': 2,
                      'artfully': 1,
                      'vanishing': 1,
                      'brother': 2,
                      'Somebody': 6,
                      'managed': 3,
                      'ruthlessly': 1,
                      'method': 2,
                      'murder': 9,
                      '2': 2,
                      'associates': 1,
                      '400': 1,
                      'blinked': 1,
                      'dispatched': 1,
                      'ferociously': 1,
                      'trigger-happy': 2,
                      'visible': 2,
                      'involved': 4,
                      'detonation': 1,
                      'pupils': 1,
                      'quirt': 8,
                      'straining': 1,
                      'style': 1,
                      'switch': 4,
                      "Cargill's": 1,
                      'electric': 1,
                      'reunion': 3,
                      'shots': 7,
                      "marshal's": 2,
                      'muttered': 3,
                      'honeymoon': 1,
                      "I'm": 50,
                      'forgive': 3,
                      'secrets': 1,
                      'congealed': 1,
                      'corners': 1,
                      'around': 71,
                      'gasped': 1,
                      'Donna': 4,
                      'engages': 3,
                      'edges': 1,
                      'missy': 1,
                      'Otherwise': 1,
                      "I'll": 35,
                      'sling': 1,
                      'slivery': 1,
                      'rockstrewn': 1,
                      'monotonous': 1,
                      'stilts': 1,
                      'naked': 5,
                      'clicks': 1,
                      'clothes': 11,
                      'gripped': 6,
                      'clapped': 2,
                      'gruff': 2,
                      'headwalls': 2,
                      'thankful': 1,
                      'released': 4,
                      'ends': 1,
                      'possibilities': 1,
                      'eromonga': 1,
                      'attracted': 2,
                      'controlled': 2,
                      'hide': 3,
                      'farming': 2,
                      'mid-section': 1,
                      'inadvertently': 1,
                      'self-dictate': 1,
                      'parked': 4,
                      'biggest': 1,
                      'from': 260,
                      'devilish': 1,
                      'Jim': 1,
                      'Hot': 3,
                      'ages': 2,
                      'Fat': 1,
                      'warning': 5,
                      'dejectedly': 1,
                      'Murder': 1,
                      'ball': 10,
                      'Mother': 1,
                      'spade': 1,
                      'catlike': 1,
                      'reality': 1,
                      'stoker': 1,
                      'studied': 7,
                      'twenty-five': 1,
                      "boat's": 1,
                      'appreciate': 2,
                      'prancing': 1,
                      'deeds': 1,
                      'years': 32,
                      'sticks': 3,
                      'probably': 18,
                      'listless': 1,
                      'frequently': 2,
                      'Splendid': 1,
                      'During': 1,
                      'well-brushed': 1,
                      'majuh': 1,
                      'Blackfeet': 2,
                      'Cold': 1,
                      'tore': 3,
                      'cashmere': 1,
                      'Ducking': 1,
                      'drunkenly': 1,
                      'victuals': 1,
                      'or': 115,
                      'madly': 2,
                      "Don't": 14,
                      "Carwood's": 1,
                      'fifteen': 3,
                      'shaky': 1,
                      'destruction': 1,
                      'dives': 3,
                      'blissfully': 1,
                      'ma': 1,
                      'city': 5,
                      'sex': 3,
                      'Looking': 2,
                      'muzzles': 1,
                      'stomach': 9,
                      'sensing': 1,
                      'holstered': 3,
                      'Miraculously': 3,
                      'hosts': 1,
                      'backing': 1,
                      'easier': 1,
                      'consciousness': 1,
                      'plans': 2,
                      'ropes': 3,
                      'pick-up': 2,
                      'sprawled': 2,
                      'Vs.': 1,
                      'heart': 11,
                      'primary': 1,
                      'national': 1,
                      'ah': 2,
                      'tomorrow': 3,
                      'Dealing': 2,
                      'suspicion': 2,
                      'Gloom': 1,
                      'bandits': 1,
                      'tacked': 1,
                      'boiler': 1,
                      'called': 22,
                      'Cabot': 1,
                      'year': 11,
                      'Straits': 1,
                      'exerted': 1,
                      'Too': 6,
                      'courage': 1,
                      'sighted': 2,
                      'painted': 1,
                      'agricultural': 1,
                      'slumped': 5,
                      'savvy': 1,
                      'Resignedly': 1,
                      'Ah-ah': 1,
                      'scraps': 1,
                      'courtliness': 1,
                      'conspicuously': 1,
                      'Hoe-Down': 1,
                      'organized': 2,
                      'wonder': 6,
                      'partial': 1,
                      'revelry': 1,
                      'gestured': 1,
                      'sharply': 4,
                      'makeshift': 1,
                      'chance': 20,
                      'squeaking': 1,
                      'munched': 1,
                      'stampeded': 1,
                      'somnolent': 1,
                      'sourly': 1,
                      'vine-crisscrossed': 1,
                      'fellow': 4,
                      'halfway': 3,
                      'mission': 4,
                      'Hair': 1,
                      'not': 179,
                      'blazed': 1,
                      'sons': 1,
                      'taught': 2,
                      'headquarters': 2,
                      'Brakes': 1,
                      'basket': 1,
                      'suffer': 2,
                      'soared': 3,
                      'bedroom': 6,
                      'growth': 1,
                      'Arm': 1,
                      'Eyes': 8,
                      'shattering': 4,
                      'college': 4,
                      'unoccupied': 1,
                      '15th': 1,
                      'placed': 7,
                      'moaned': 2,
                      'landmarks': 1,
                      'boat': 4,
                      'noble': 4,
                      'zigzagging': 3,
                      'set': 17,
                      'refuse-littered': 1,
                      'sliding': 2,
                      'glanced': 4,
                      'winked': 1,
                      'worried': 4,
                      'hated': 6,
                      'arrangement': 2,
                      'Meaning': 1,
                      'essential': 1,
                      'crowd': 4,
                      'five': 13,
                      'begrudge': 1,
                      'thrilled': 1,
                      'neatly': 1,
                      'less': 13,
                      "Breed's": 1,
                      'movement': 4,
                      'scraping': 3,
                      'dwindling': 1,
                      'united': 1,
                      'application': 2,
                      'smaller': 3,
                      'crimsoning': 1,
                      'surreptitious': 1,
                      'shall': 7,
                      'word': 8,
                      'innocence': 1,
                      'magnificent': 1,
                      'quarter-mile': 1,
                      'detectives': 1,
                      'quieted': 1,
                      'searching': 4,
                      "hadn't": 14,
                      'shells': 1,
                      'ground': 29,
                      "We'd": 2,
                      'Ferguson': 1,
                      'broil': 1,
                      'locate': 3,
                      'bowstring': 1,
                      'Walter': 7,
                      'stir': 2,
                      'liaisons': 1,
                      'mudguard': 1,
                      '7:30': 1,
                      'business': 8,
                      'strength': 5,
                      'claims': 1,
                      'wreck': 1,
                      'adults': 2,
                      'wallet': 1,
                      'oval': 1,
                      'sack': 2,
                      'Start': 3,
                      'unusual': 3,
                      'holster': 8,
                      'clasped': 1,
                      'ostensible': 1,
                      'obedience': 1,
                      'flow': 2,
                      'white': 26,
                      'dreamed': 2,
                      'thimble-sized': 1,
                      'hoof': 1,
                      'desperadoes': 1,
                      'lacked': 1,
                      "Dill's": 3,
                      'task': 1,
                      ...}),
             'belles_lettres': Counter({'Without': 12,
                      'unanswered': 1,
                      'taxing': 1,
                      'More': 8,
                      '23': 1,
                      'festival': 1,
                      'speak': 30,
                      'located': 3,
                      'economics': 3,
                      'Bang-Jensen': 25,
                      'intimidation': 1,
                      'house': 40,
                      'promises': 2,
                      'dispatch': 5,
                      'Corporations': 3,
                      'invulnerability': 1,
                      'building': 11,
                      'arithmetized': 1,
                      'waters': 4,
                      'crocketed': 1,
                      'Between': 2,
                      'restrain': 2,
                      'sieben': 1,
                      'use': 56,
                      'dissatisfactions': 1,
                      'killed': 7,
                      'experience': 92,
                      'endure': 1,
                      'horseman': 1,
                      'served': 20,
                      'Principal': 1,
                      'corroding': 1,
                      'population': 11,
                      'warrants': 1,
                      'titanic': 1,
                      'mana': 2,
                      'Viscount': 1,
                      'gibe': 1,
                      'Sex': 1,
                      'Dreiser': 1,
                      'surrendering': 1,
                      'pleading': 3,
                      "Whitman's": 1,
                      'Tessie': 7,
                      'H.M.': 2,
                      'wages': 1,
                      'zeal': 1,
                      'Continental': 4,
                      'shade': 6,
                      'sunshine': 1,
                      'mercy': 2,
                      'grew': 12,
                      'Convocation': 3,
                      "Peoples'": 1,
                      'chancellor': 2,
                      'beautiful': 15,
                      'Analogously': 1,
                      'marriages': 2,
                      'relay': 1,
                      'Karamazov': 2,
                      'denying': 1,
                      'Eternal': 1,
                      'rheum': 1,
                      'sorted': 1,
                      'Madeleine': 3,
                      'faires': 1,
                      'thrillers': 1,
                      'hardships': 3,
                      'constituents': 1,
                      'chicken': 3,
                      'barrage': 1,
                      'materially': 1,
                      'commentator': 1,
                      'reborn': 1,
                      'glance': 8,
                      'cups': 1,
                      'quiescent': 1,
                      'tangential': 1,
                      'kitchen': 2,
                      'amount': 10,
                      'pursuits': 1,
                      'crack': 1,
                      'Called': 1,
                      'pressure': 14,
                      'Short': 1,
                      'Creator': 3,
                      'legitimacy': 1,
                      'archaic': 3,
                      'evils': 1,
                      "wouldn't": 9,
                      'reinforcing': 1,
                      'through': 135,
                      '$300,000': 1,
                      "James'": 1,
                      'migrate': 1,
                      'Relativism': 1,
                      'renamed': 2,
                      'Just': 11,
                      'specific': 18,
                      'deemed': 2,
                      'Because': 12,
                      'Ah': 2,
                      'ascertain': 2,
                      'Quod': 1,
                      'hug': 2,
                      'Justice': 4,
                      'Sumter': 1,
                      'surreptitiously': 1,
                      'timid': 1,
                      'menaced': 1,
                      'grimmer': 1,
                      'entrepreneur': 6,
                      "yesterday's": 1,
                      'Democrats': 11,
                      'myne': 1,
                      'toil': 1,
                      'cottage': 1,
                      'raised': 11,
                      'stray': 1,
                      'flocking': 1,
                      'meanings': 1,
                      'stream': 9,
                      'revealed': 8,
                      'grave': 7,
                      'Rousseau': 2,
                      'suburbs': 1,
                      'eternal': 5,
                      'epitomized': 1,
                      'experimenter': 1,
                      'cheere': 1,
                      'At': 73,
                      ')': 251,
                      'college-educated': 1,
                      'Boheme': 1,
                      'non-Jew': 2,
                      'stimulated': 3,
                      'mastery': 3,
                      'Gullah': 1,
                      'quoted': 4,
                      'accumulates': 1,
                      'Those': 15,
                      'surprisingly': 4,
                      'enlist': 2,
                      'Nelson': 2,
                      'recurred': 2,
                      'inculcation': 1,
                      'lucky': 2,
                      'prophets': 2,
                      'hesitant': 1,
                      'bet': 1,
                      'English-Scottish-French': 1,
                      'subtly': 2,
                      'Peruvian': 1,
                      'Cruel': 1,
                      'sands': 1,
                      'slip': 2,
                      'negative': 17,
                      "patient's": 1,
                      'small': 48,
                      'doorways': 1,
                      'enemy': 25,
                      'Apologie': 1,
                      'understatement': 1,
                      'quo': 6,
                      'eventfully': 1,
                      'revolutions': 1,
                      'knife': 1,
                      'perceptual': 4,
                      'Verner': 1,
                      'enforced': 5,
                      'Sighting': 1,
                      'consultative': 1,
                      "lover's": 1,
                      'scornfully': 1,
                      'reared': 1,
                      'petitioned': 2,
                      'strike': 7,
                      'four': 30,
                      'Dorado': 1,
                      'e.g.': 10,
                      'really': 36,
                      'peoples': 13,
                      'lends': 1,
                      'deferents': 1,
                      'striving': 1,
                      'signals': 3,
                      'effect': 36,
                      'sports': 9,
                      'shawl': 1,
                      'Anything': 1,
                      'huckster': 1,
                      'four-thirty': 1,
                      'falcon': 1,
                      'King': 16,
                      'artists': 6,
                      'sanctimonious': 1,
                      'good-bye': 2,
                      'confronting': 2,
                      'evening': 22,
                      'prisoners': 7,
                      'Joseph': 12,
                      'relax': 2,
                      "B-52's": 1,
                      'wives': 2,
                      'recalcitrant': 2,
                      'runaway': 3,
                      'regretted': 5,
                      'Cam': 1,
                      'relief': 6,
                      'prettier': 1,
                      'stead': 2,
                      'ask': 12,
                      'gret': 1,
                      'America': 42,
                      'found': 81,
                      'Christie': 2,
                      'stewed': 1,
                      'resurrecting': 1,
                      'barefooted': 1,
                      'wonders': 2,
                      'ethos': 2,
                      'mother': 38,
                      'Garry': 4,
                      'things': 69,
                      'shaven': 1,
                      'severe': 2,
                      'alienated': 2,
                      'Abstract': 1,
                      'capacities': 2,
                      'independent': 12,
                      'nomination': 3,
                      'chow': 1,
                      'Hall': 5,
                      'singularly': 1,
                      'transported': 1,
                      "buildin'": 1,
                      'dextrous-fingered': 1,
                      'appreciative': 1,
                      'inaugural': 2,
                      'stops': 1,
                      "Coleridge's": 2,
                      'gallop': 1,
                      'Plummer': 2,
                      'unprofessional': 1,
                      'constituted': 4,
                      'Ratified': 1,
                      'sombre': 2,
                      'Welfare': 2,
                      'neighbor': 3,
                      'vehicles': 2,
                      '1959': 5,
                      'managing': 1,
                      'False': 1,
                      "Childhood's": 5,
                      'yielded': 1,
                      'Mannerhouse': 1,
                      'shadow': 2,
                      'concerto': 1,
                      'generalized': 2,
                      'fabric': 3,
                      'Hutton': 1,
                      'hogs': 1,
                      'adieu': 1,
                      'indecision': 1,
                      'Bright': 6,
                      'viability': 2,
                      'Tuscany': 1,
                      'outset': 2,
                      'dying': 1,
                      'inability': 6,
                      'allegro': 1,
                      'Merchant': 2,
                      'candid': 1,
                      'Gaieties': 3,
                      '1961': 4,
                      'breadth': 2,
                      '6th': 2,
                      'hunting': 1,
                      'run': 18,
                      'absorbing': 1,
                      'web': 1,
                      'demanded': 4,
                      'entitle': 3,
                      'grandchildren': 1,
                      'songbook': 1,
                      'uncomfortable': 3,
                      'denies': 3,
                      'knack': 1,
                      'themselves': 72,
                      'stronghold': 1,
                      'logically': 2,
                      'Whig': 6,
                      'interludes': 1,
                      'Santa': 5,
                      'Plymouth': 4,
                      'actuality': 3,
                      'Good': 3,
                      'Howard': 2,
                      'S-D': 1,
                      'somnolence': 1,
                      'speaks': 4,
                      'elegant': 2,
                      'lift': 3,
                      'pulled': 5,
                      'sprinkling': 1,
                      'domestic': 6,
                      'pursues': 1,
                      'radio': 5,
                      'university-educated': 1,
                      'coat': 4,
                      'Oscar': 1,
                      'Kitchin': 1,
                      'Frederick': 5,
                      'due': 8,
                      'fiction-writing': 1,
                      'attainment': 1,
                      'demander': 1,
                      'seeming': 2,
                      'semi-catatonic': 1,
                      'business-minded': 1,
                      'problem': 40,
                      'pulse': 2,
                      'infantile': 1,
                      '1930': 2,
                      'Kingsley': 1,
                      'percentage': 4,
                      'Apostles': 1,
                      'shawls': 2,
                      'dares': 1,
                      'Rhine': 6,
                      'looking': 21,
                      'dictate': 2,
                      'Journal': 1,
                      'nuns': 1,
                      'sixth': 1,
                      'Watch': 1,
                      'clever': 2,
                      'humor': 6,
                      'baggy': 1,
                      "Agamemnon's": 1,
                      'Pendleton': 2,
                      'delegation': 5,
                      'sentimental': 2,
                      'introductions': 1,
                      'supervened': 1,
                      "Bradbury's": 2,
                      'ranked': 2,
                      'exemplar': 2,
                      'slowly': 10,
                      'transcendence': 1,
                      'H.L.': 1,
                      'bombers': 4,
                      'filth': 1,
                      'atmosphere': 10,
                      '4,585': 1,
                      '1821': 1,
                      '1859': 1,
                      'announcements': 1,
                      'Eh': 1,
                      'equal': 9,
                      'instrument': 14,
                      'Especially': 1,
                      'Lawrence': 10,
                      'Beauchamps': 1,
                      'stripes': 1,
                      'rich': 12,
                      'Wilson': 4,
                      'Gatsby': 1,
                      'spurious': 1,
                      'pride': 6,
                      'Franklin': 9,
                      'compulsion': 3,
                      'brute': 1,
                      'Reflections': 1,
                      'Makers': 1,
                      'ran': 22,
                      'wagons': 1,
                      'optimism': 3,
                      'habitually': 1,
                      'morals': 3,
                      'tiny': 6,
                      'Reformation': 2,
                      'Tomonggong': 1,
                      'inventory': 2,
                      'socio-political': 1,
                      'Kooning': 1,
                      'A.M.': 2,
                      'besmirching': 1,
                      'adapted': 2,
                      'stairway': 1,
                      'cutthroat': 1,
                      'buildings': 9,
                      'between': 146,
                      'cost': 5,
                      '1643': 1,
                      'courteously': 1,
                      'intimate': 2,
                      'premieres': 2,
                      'outmoded': 1,
                      'solar': 3,
                      "Toscanini's": 1,
                      'smacks': 1,
                      'proletariat': 2,
                      'tombstone': 1,
                      'masterpiece': 5,
                      'Atlas': 5,
                      'Regiment': 2,
                      'improve': 2,
                      'Anderson': 2,
                      'typifying': 1,
                      'Olgivanna': 7,
                      'companions': 5,
                      'higher': 12,
                      'constructively': 1,
                      'Ptolemaists': 1,
                      'foliage': 1,
                      'Stanley': 2,
                      'Sadie': 5,
                      'firmly': 12,
                      'Until': 4,
                      'Hideous': 1,
                      'complection': 1,
                      'obscenity': 1,
                      'planetoid': 1,
                      'braweling': 1,
                      'hospitable': 2,
                      'dismissed': 3,
                      'shabbily': 1,
                      'told': 57,
                      'composers': 3,
                      'discovery': 7,
                      'transcend': 1,
                      'The': 1030,
                      'oratory': 1,
                      'heat': 5,
                      'D.': 4,
                      'facile': 1,
                      'repudiating': 1,
                      'merited': 1,
                      'of': 6289,
                      'wherefores': 1,
                      "Morgan's": 5,
                      'non-interference': 1,
                      'withdrawn': 1,
                      'law': 58,
                      'bankers': 1,
                      '1920s': 1,
                      'flappers': 1,
                      "barn-burner's": 1,
                      'two-room': 1,
                      "Pandora's": 2,
                      'police': 21,
                      'Seebohm': 4,
                      'impending': 2,
                      'Proprietorships': 3,
                      'conception': 12,
                      'bold': 4,
                      'Silk': 1,
                      'Wordsworth': 2,
                      'self-destructive': 1,
                      'wars': 6,
                      'Sierra': 1,
                      'gastronomy': 1,
                      'dispersed': 1,
                      'maid': 6,
                      'facts': 19,
                      'recreation': 1,
                      'unusually': 1,
                      'eschew': 1,
                      'terror-stricken': 1,
                      'peculiar': 8,
                      'Hooghli': 1,
                      'factory': 2,
                      'entry': 2,
                      "America's": 4,
                      'carelessly': 1,
                      'outside': 25,
                      'detain': 1,
                      'arc': 1,
                      'Apology': 1,
                      'book': 58,
                      "Ward's": 1,
                      'heliotrope': 1,
                      'immoralities': 1,
                      'mentions': 3,
                      'jacket': 2,
                      'Resolves': 1,
                      'psychopomp': 1,
                      'sullen': 1,
                      'Brigade': 1,
                      'presents': 6,
                      'pump': 1,
                      'endlessly': 1,
                      'give': 56,
                      'loathing': 1,
                      'Sentiments': 1,
                      'Dashiell': 3,
                      'presaged': 1,
                      'incompatible': 1,
                      'boring': 1,
                      'cheek': 1,
                      'dishes': 1,
                      'treason': 1,
                      'mobility': 1,
                      'infatuation': 2,
                      'hindmost': 1,
                      "Auerbach's": 1,
                      "Czarina's": 3,
                      'tunnel': 1,
                      'blond': 1,
                      'Nineteenth': 4,
                      'waving': 3,
                      'diverse': 2,
                      'strenuous': 3,
                      'protestations': 2,
                      'wholeness': 2,
                      'applying': 2,
                      'critical': 12,
                      'hoodlums': 1,
                      'missing': 2,
                      'strivings': 2,
                      'explanation': 7,
                      'softened': 1,
                      'tablet': 2,
                      'fiendish': 1,
                      'yff': 1,
                      'N': 2,
                      'after': 123,
                      'conjectures': 1,
                      'questioning': 5,
                      'reached': 21,
                      'arrangements': 6,
                      'Riesman': 1,
                      'nectar': 1,
                      'response': 16,
                      'Japan': 2,
                      'bargain': 1,
                      'authority': 9,
                      'gingham': 1,
                      'Perhaps': 19,
                      'grace': 10,
                      'fitness': 2,
                      'punctuated': 2,
                      'missile': 3,
                      'strikingly': 1,
                      'big': 18,
                      'hadd': 1,
                      'Soiree': 1,
                      'centre': 2,
                      'Sidney': 5,
                      'nourished': 1,
                      'Erde': 1,
                      'attempted': 6,
                      'completely': 15,
                      'facaded': 1,
                      'cold': 12,
                      'opened': 9,
                      'left': 48,
                      'radioactive': 1,
                      'scaffoldings': 1,
                      'Willa': 1,
                      'Holy': 7,
                      'blast': 2,
                      'stood': 19,
                      'unfortunately': 2,
                      'deck': 4,
                      'lend': 2,
                      'reminding': 1,
                      'street': 14,
                      'doorknob': 1,
                      'sins': 4,
                      'utmost': 2,
                      'Burma': 7,
                      'clothier': 1,
                      'hookup': 1,
                      'patent': 2,
                      'ludicrous': 1,
                      'priest': 2,
                      '$200': 1,
                      'cares': 1,
                      'heresy': 1,
                      'town': 35,
                      'fulfilled': 6,
                      'compresses': 1,
                      'Lymington': 1,
                      'Knightes': 1,
                      'Manley': 3,
                      'mai': 1,
                      'commonplace': 4,
                      'cab': 2,
                      'colder': 2,
                      'Yankee': 2,
                      'steer': 1,
                      'cheerfulness': 1,
                      'three': 59,
                      'shortsighted': 2,
                      'declaring': 4,
                      'dessier': 1,
                      'alcoholics': 1,
                      'wander': 1,
                      'presidential': 1,
                      'wage': 1,
                      'concerted': 1,
                      'virtue': 12,
                      'pure': 19,
                      'Court-packing': 1,
                      'extended': 8,
                      'sluices': 1,
                      'patiently': 1,
                      'intrigues': 1,
                      'singularity': 1,
                      'joy': 10,
                      'beast': 1,
                      'invasions': 9,
                      'continuity': 10,
                      'sundry': 2,
                      'rehearsed': 1,
                      'complaisance': 1,
                      'A.': 9,
                      'self-analysis': 1,
                      'treasured': 1,
                      'grows': 2,
                      'Merit': 1,
                      'Smithsonian': 3,
                      'receives': 5,
                      'be': 843,
                      'wealth': 8,
                      'names': 11,
                      'informational': 2,
                      'tarnished': 1,
                      'Crawford': 1,
                      'culture': 25,
                      'witches': 2,
                      'abundance': 4,
                      'forget': 13,
                      'heap': 1,
                      'conclusion': 10,
                      'grocers': 1,
                      'reasons': 16,
                      'uncompromising': 3,
                      'invent': 4,
                      'minutes': 11,
                      'remote': 4,
                      'touching': 4,
                      'Head': 1,
                      'joined': 10,
                      "He's": 6,
                      'bullshit': 1,
                      'docile': 1,
                      'converge': 3,
                      "Lilly's": 3,
                      'farfetched': 2,
                      'sharpened': 1,
                      'afternoon': 13,
                      'Presupposed': 1,
                      'Fascism': 2,
                      'barnsful': 1,
                      'entirety': 1,
                      'commercial': 3,
                      'exceptional': 3,
                      'central': 18,
                      'questioners': 1,
                      'soloist': 2,
                      'fail': 2,
                      'Stratford': 6,
                      'burial': 2,
                      'suitably': 1,
                      'force': 36,
                      'employ': 3,
                      'replaced': 5,
                      'youth': 15,
                      'prevented': 3,
                      'obligations': 1,
                      'dinners': 1,
                      'brushwork': 1,
                      "child's": 4,
                      'ate': 2,
                      'alone': 19,
                      'Cumberland': 1,
                      'beneath': 12,
                      'compels': 1,
                      'schoolmaster': 3,
                      'backbone': 1,
                      'into': 246,
                      'principally': 1,
                      'solstice': 1,
                      'monument': 5,
                      'edited': 2,
                      'deviation': 1,
                      'Abraham': 1,
                      'address': 9,
                      'skipped': 1,
                      'let': 53,
                      'lewdly': 1,
                      'sprung': 1,
                      "Paula's": 2,
                      'lodgment': 1,
                      'postman': 1,
                      'inhumane': 1,
                      'necessity': 8,
                      'brother': 23,
                      'unworthy': 2,
                      'managed': 5,
                      'sterile': 1,
                      'grapple': 1,
                      'render': 2,
                      'method': 22,
                      'murder': 9,
                      'shakes': 1,
                      'Travelers': 1,
                      'grown-up': 2,
                      'sweeping': 3,
                      'Lillian': 1,
                      '2': 25,
                      'emigrating': 1,
                      'serene': 3,
                      "Dick's": 1,
                      'blinked': 1,
                      'dispatched': 1,
                      'artist-author': 1,
                      'elaborated': 2,
                      'patter': 1,
                      'bicycles': 1,
                      'short-range': 1,
                      'style': 13,
                      'switch': 3,
                      '1639': 1,
                      'Amra': 1,
                      'Gordon': 1,
                      'electric': 1,
                      'reunion': 1,
                      'jammed': 1,
                      'ruffles': 1,
                      'honeymoon': 1,
                      'hypothesis': 4,
                      'absurd': 6,
                      "I'm": 9,
                      'corners': 3,
                      'convalescence': 1,
                      'volunteered': 1,
                      'engages': 1,
                      'Englishy': 1,
                      'Otherwise': 2,
                      'Date': 1,
                      'pastime': 2,
                      'Containment': 1,
                      'focused': 1,
                      'Land': 6,
                      'overly': 1,
                      'tawny': 2,
                      "Williamson's": 1,
                      'preoccupied': 4,
                      'naked': 7,
                      'commands': 10,
                      'clothes': 3,
                      'Constitution': 30,
                      'qualities': 16,
                      'converse': 1,
                      'gruff': 1,
                      'incestuous': 3,
                      'released': 2,
                      'bench': 1,
                      'aggravated': 1,
                      'alleged': 4,
                      'tortures': 2,
                      'lodging': 1,
                      'chronology': 2,
                      'attracted': 6,
                      'rechartering': 1,
                      'hide': 5,
                      'feudal': 5,
                      'ingeniously': 1,
                      'asinine': 1,
                      'dynasties': 1,
                      'malnourished': 1,
                      'obsolete': 2,
                      'invasion': 4,
                      'picket': 2,
                      'parked': 2,
                      'from': 649,
                      'atypical': 1,
                      'B.C.': 1,
                      "four-o'clock": 1,
                      'Hot': 2,
                      'ages': 1,
                      'libraries': 3,
                      'frame': 11,
                      'happenstance': 1,
                      'atheists': 1,
                      'reprinted': 1,
                      'frail': 1,
                      'modulation': 2,
                      'Mother': 2,
                      'peonies': 1,
                      'unreconstructed': 5,
                      'environing': 1,
                      'possessor': 1,
                      "han'": 4,
                      'Communion': 1,
                      'adjunct': 1,
                      'studied': 12,
                      'Moloch': 1,
                      'recognizable': 3,
                      'dystopias': 8,
                      'sticks': 2,
                      'East': 15,
                      'retainers': 1,
                      'orphans': 1,
                      'frequently': 18,
                      'minutiae': 1,
                      'During': 21,
                      'Danchin': 1,
                      'brittle': 1,
                      'scientifically-trained': 1,
                      'sells': 1,
                      'Obviously': 5,
                      'apocryphal': 1,
                      'larkspur': 1,
                      'estates': 1,
                      'unpacking': 1,
                      'destruction': 8,
                      'Prime': 1,
                      "Washington's": 2,
                      'hub': 1,
                      'city': 36,
                      'attributes': 2,
                      'sex': 32,
                      'wall': 5,
                      'active': 9,
                      'hosts': 1,
                      'keeping': 5,
                      'Bruno': 1,
                      'imbedded': 1,
                      'easier': 4,
                      'consciousness': 10,
                      'plans': 9,
                      'accordance': 3,
                      'Shotwell': 1,
                      'hearsay': 1,
                      "Hammarskjold's": 1,
                      'Sam': 7,
                      'invite': 1,
                      'heart': 22,
                      'primary': 17,
                      'national': 37,
                      'Simon': 1,
                      'enables': 1,
                      'Brownlow': 2,
                      'amenable': 1,
                      'father-brother': 1,
                      'Photographer': 1,
                      'Israelite': 1,
                      'authoritarian': 2,
                      'rage': 5,
                      'conscientious': 1,
                      "lady's": 1,
                      'called': 62,
                      'opinions': 13,
                      'slanderer': 3,
                      '1865': 2,
                      'blues': 4,
                      'fictional': 10,
                      'reverse': 2,
                      'victory': 11,
                      'divinity': 1,
                      'adjusted': 2,
                      'Straits': 1,
                      'merry': 3,
                      'Too': 1,
                      'sighted': 1,
                      'slumped': 1,
                      'G.': 4,
                      'shopping': 2,
                      'deals': 4,
                      'childlike': 3,
                      'Esther': 8,
                      'ripening': 1,
                      'Equator': 1,
                      'organized': 18,
                      'irksome': 1,
                      'solution': 6,
                      'born': 24,
                      'propel': 1,
                      'Shirt': 1,
                      'chance': 20,
                      'spirits': 7,
                      "Alfred's": 3,
                      'compel': 1,
                      'Southern': 68,
                      'fellow': 13,
                      'Market': 3,
                      'pigpens': 1,
                      'patrolling': 1,
                      'Amadee': 4,
                      'commonly': 2,
                      'Bean': 1,
                      'headquarters': 10,
                      'utopia': 3,
                      'mars': 1,
                      'Dwight': 1,
                      'basket': 2,
                      'suffer': 12,
                      'generates': 2,
                      'complained': 6,
                      'arcaded': 1,
                      '19': 3,
                      'output': 2,
                      'relationships': 10,
                      '1632': 4,
                      'sixty-five': 1,
                      'face-lifting': 1,
                      'shattering': 1,
                      'college': 27,
                      'envisioned': 1,
                      'unremitting': 1,
                      'public-school': 1,
                      'Frederik': 1,
                      'represent': 3,
                      'distinct': 8,
                      'intentions': 2,
                      'noble': 3,
                      'Nor': 4,
                      'sharers': 1,
                      'Unlike': 3,
                      'dinosaur': 1,
                      '$20,000,000': 1,
                      'hated': 3,
                      'therewith': 1,
                      'imposed': 1,
                      'campus': 5,
                      'contemptuously': 1,
                      'contrived': 1,
                      'obituaries': 1,
                      'self-employed': 1,
                      'Sut': 1,
                      'five': 27,
                      'repulsed': 1,
                      'snapped': 2,
                      'overtake': 1,
                      'common': 49,
                      'poems': 29,
                      'less': 67,
                      'outgoing': 2,
                      'scraping': 1,
                      'thrusting': 1,
                      'coined': 2,
                      'congruent': 2,
                      'participate': 5,
                      'Hearst': 48,
                      "Banks's": 1,
                      'deacon': 1,
                      'antiquarian': 1,
                      'temperament': 1,
                      'Pomham': 2,
                      'smaller': 5,
                      'ceremonies': 2,
                      'Rousseauan': 1,
                      'volens': 1,
                      'differentiate': 1,
                      'shall': 34,
                      'Greek': 23,
                      'larks': 1,
                      'innocence': 19,
                      'magnificent': 4,
                      'cynically': 1,
                      'unconsciously': 1,
                      'decreed': 1,
                      'Report': 1,
                      'detectives': 1,
                      'thinks': 4,
                      'shells': 1,
                      '$5000': 4,
                      "We'd": 1,
                      'primitive': 14,
                      'exclamations': 2,
                      'intrusive': 1,
                      'lange': 1,
                      'Crittenden': 3,
                      'eclectic': 1,
                      'father-murder': 1,
                      'topped': 2,
                      'confiscating': 1,
                      'claims': 13,
                      'executives': 1,
                      'Van': 12,
                      'scion': 1,
                      'seventeenth': 5,
                      'conductor': 7,
                      'grape-arbor': 3,
                      'wallet': 1,
                      'symbolize': 1,
                      'Cooper': 1,
                      'restitution': 1,
                      '2,000': 2,
                      'Start': 1,
                      'swim': 3,
                      'radius': 1,
                      'U.': 1,
                      'Kurd': 1,
                      'Brother': 1,
                      ...}),
             'editorial': Counter({'Without': 5,
                      'vacuum': 1,
                      'lighted': 1,
                      'contented': 1,
                      'RCA': 1,
                      'Dupont': 1,
                      'taxing': 1,
                      'More': 6,
                      '23': 5,
                      'festival': 1,
                      'speak': 4,
                      'located': 2,
                      'education': 6,
                      'house': 8,
                      'knobby-knuckled': 1,
                      'promises': 4,
                      'skilled': 1,
                      'muscles': 1,
                      'dispatch': 3,
                      'darling': 1,
                      'Wilmette': 1,
                      'epoch-making': 1,
                      'building': 4,
                      'Adams': 2,
                      'infernally': 1,
                      'Spirits': 2,
                      'operating': 2,
                      '$133': 1,
                      'use': 30,
                      'enticing': 1,
                      'Lower': 1,
                      'killed': 5,
                      'handicrafts': 1,
                      'experience': 8,
                      'work-weary': 1,
                      'served': 6,
                      'committee': 10,
                      'bars': 4,
                      'sports': 2,
                      'population': 8,
                      'Escalation': 2,
                      'tempted': 2,
                      'migrating': 1,
                      'thoroughly': 2,
                      'February': 1,
                      'surrendering': 2,
                      'Much': 2,
                      "lyin'": 1,
                      'forum': 2,
                      'enamel': 1,
                      "Mayor's": 2,
                      'certain': 13,
                      'season': 6,
                      'zeal': 2,
                      'demanded': 5,
                      'shade': 2,
                      'others': 19,
                      "Luther's": 1,
                      'budgets': 2,
                      'button-down': 1,
                      'transition': 3,
                      'Laboratory': 1,
                      'Persianesque': 1,
                      'catch': 1,
                      'absurdity': 1,
                      'chancellor': 2,
                      'beautiful': 8,
                      'intimidation': 1,
                      'shorter': 2,
                      'Loon': 1,
                      'Mar.': 2,
                      'Jemela': 1,
                      'dictator': 1,
                      'denying': 1,
                      "Democrats'": 1,
                      'second-class': 1,
                      'retains': 5,
                      'taverns': 1,
                      'grotesquely': 2,
                      'imposition': 1,
                      'tyrant': 1,
                      'Enough': 2,
                      'Stevens': 3,
                      'constituents': 1,
                      'chicken': 1,
                      'determine': 3,
                      'amend': 1,
                      '57': 1,
                      'correct': 3,
                      'toughest': 1,
                      'glance': 1,
                      'kitchen': 1,
                      'amount': 11,
                      'glances': 1,
                      'pressure': 14,
                      'Short': 1,
                      'scientists': 5,
                      'besetting': 1,
                      'adviser': 2,
                      "Times'": 1,
                      "wouldn't": 4,
                      'N.': 2,
                      'scratching': 1,
                      'through': 23,
                      'Volumes': 1,
                      'Just': 7,
                      'specific': 3,
                      'battle-cry': 1,
                      'each': 30,
                      'Because': 2,
                      "Board's": 1,
                      "Lucas's": 1,
                      'Justice': 4,
                      'tons': 4,
                      'France': 7,
                      'ceased': 1,
                      'Democrats': 12,
                      'appropriating': 1,
                      'cottage': 1,
                      'shy': 1,
                      'raised': 9,
                      'stray': 1,
                      'stream': 1,
                      'revealed': 3,
                      'grave': 4,
                      'Fortress': 1,
                      'sprung': 2,
                      'believing': 1,
                      'oversimplified': 1,
                      'fabulous': 1,
                      'eternal': 1,
                      'Kokoschka': 1,
                      'situations': 1,
                      'tacitly': 1,
                      'blackout': 2,
                      'At': 16,
                      ')': 95,
                      'seas': 3,
                      'economizing': 1,
                      'colleagues': 1,
                      'staked': 1,
                      'Those': 1,
                      'surprisingly': 1,
                      'enlist': 1,
                      'Nelson': 1,
                      'Murat': 1,
                      'operational': 1,
                      'unforseen': 1,
                      'career': 1,
                      'idly': 1,
                      'husband': 5,
                      'friends': 12,
                      'Prime': 1,
                      'shouting': 2,
                      'sands': 1,
                      'slip': 1,
                      'negative': 4,
                      'thoroughfare': 1,
                      'small': 27,
                      'disreputable': 1,
                      'jam': 2,
                      'enemy': 5,
                      'live-oak': 1,
                      'competition': 4,
                      'recruits': 1,
                      'sect': 1,
                      'Q3': 1,
                      'recession': 3,
                      'missiles': 1,
                      'recently': 11,
                      'enforced': 1,
                      'unbalanced': 2,
                      'steamship': 1,
                      'legislation': 4,
                      'next': 25,
                      'accelerating': 1,
                      'strike': 4,
                      'dead-end': 1,
                      'shall': 19,
                      'searching': 3,
                      'really': 19,
                      'reputation': 3,
                      'broad': 3,
                      'external': 2,
                      "McCone's": 1,
                      'Please': 1,
                      'comforting': 1,
                      'rushed': 2,
                      'Over': 2,
                      'signals': 1,
                      'effect': 13,
                      'whites': 5,
                      'sufficiency': 1,
                      'hunk': 1,
                      'cell': 3,
                      'King': 10,
                      'Nashville': 4,
                      'Nkrumah': 2,
                      'spending': 4,
                      'Remembering': 1,
                      'outsmarted': 1,
                      'evening': 6,
                      'prisoners': 2,
                      'probation': 1,
                      'survivors': 1,
                      'twelve': 2,
                      'Joseph': 1,
                      'hymn': 1,
                      'Dunes': 1,
                      'tip': 1,
                      'relax': 1,
                      'zoned': 1,
                      'wives': 1,
                      'Possibly': 1,
                      'intends': 2,
                      'ask': 7,
                      'assassination': 1,
                      'found': 18,
                      'Truman': 3,
                      'win': 8,
                      '$1,450,000,000': 1,
                      'Cooch': 2,
                      'domes': 1,
                      'wonders': 2,
                      'mother': 16,
                      'Caribbean': 2,
                      'Chiang': 2,
                      'things': 20,
                      'Now': 13,
                      'thank': 2,
                      'oratorical': 1,
                      'department': 6,
                      'independent': 5,
                      'divided': 2,
                      'suppose': 4,
                      'feed': 6,
                      'vertical': 1,
                      '18': 6,
                      'drawn': 4,
                      'stops': 1,
                      'bat': 1,
                      'Lebanese': 2,
                      'advisers': 3,
                      'half-million': 1,
                      'one-third': 1,
                      'hatred': 1,
                      'hooliganism': 1,
                      'German': 17,
                      'Molly': 1,
                      'compulsive': 1,
                      'Engaged': 1,
                      'signal': 1,
                      '1959': 3,
                      'Pauling': 1,
                      'musicians': 1,
                      'yielded': 1,
                      'tenacious': 1,
                      'enjoyable': 1,
                      'fabric': 1,
                      'painstakingly': 1,
                      "K's": 1,
                      'flights': 1,
                      'Hence': 2,
                      'outset': 2,
                      'dying': 3,
                      'cycles': 1,
                      'required': 5,
                      'front': 2,
                      'conceivable': 3,
                      '1961': 8,
                      'strip': 1,
                      'curiously': 1,
                      'Revised': 2,
                      'God': 11,
                      'run': 11,
                      'discussed': 1,
                      'stores': 4,
                      'I.': 3,
                      'careful': 4,
                      'restudy': 1,
                      'lilting': 1,
                      'barest': 1,
                      'feather': 4,
                      'dimension': 1,
                      'uncomfortable': 1,
                      'themselves': 22,
                      'developing': 2,
                      'academic': 1,
                      'dome': 1,
                      'sunshine': 1,
                      'loads': 1,
                      'manmade': 1,
                      '$8.5': 1,
                      'replacing': 2,
                      'Church': 15,
                      'Dec.': 1,
                      "Young's": 1,
                      'execution': 1,
                      'Howard': 1,
                      'reader': 2,
                      'clean': 3,
                      'sophomore': 1,
                      'pulled': 1,
                      'continuous': 2,
                      'cautioned': 1,
                      'framed': 1,
                      'mercy': 1,
                      'sentences': 2,
                      'methods': 4,
                      'visited': 4,
                      'disposed': 1,
                      'Also': 4,
                      'radio': 8,
                      'defensive': 1,
                      'sadder': 1,
                      'northeast': 1,
                      'Elmer': 1,
                      "they're": 3,
                      'Oscar': 1,
                      'Japanese': 2,
                      'exchange': 3,
                      'Frederick': 3,
                      'due': 12,
                      'seeming': 1,
                      'unabashed': 1,
                      'furniture': 1,
                      'sixty': 1,
                      'manpower': 1,
                      'broad-brimmed': 1,
                      'problem': 18,
                      'curtail': 1,
                      'modification': 1,
                      'winter': 3,
                      'distance': 3,
                      'walking': 2,
                      'percentage': 2,
                      'receive': 9,
                      'specialize': 1,
                      'proffered': 1,
                      'enlightened': 1,
                      'looking': 5,
                      'Journal': 3,
                      'British': 18,
                      'trading': 1,
                      'expressway': 2,
                      'Time': 2,
                      'periods': 1,
                      'spenders': 1,
                      'heyday': 1,
                      'signposts': 1,
                      'lavishing': 1,
                      'collected': 2,
                      'Big': 1,
                      'descending': 2,
                      'pointed': 2,
                      'tentacle': 1,
                      'Usually': 3,
                      'boxed': 1,
                      'believes': 3,
                      'determination': 3,
                      '21-22': 1,
                      'evacuation': 1,
                      'slowly': 2,
                      'place': 16,
                      'piece': 9,
                      'treelike': 1,
                      'committees': 2,
                      'payroll': 1,
                      'underestimate': 1,
                      'first': 66,
                      'atmosphere': 12,
                      'buildup': 1,
                      'union': 7,
                      'melting': 2,
                      "Batista's": 2,
                      '1821': 1,
                      'land': 18,
                      'announcements': 1,
                      '110': 1,
                      'unfairly': 1,
                      'could': 56,
                      'Especially': 1,
                      'enjoying': 2,
                      'Lawrence': 2,
                      'conform': 2,
                      'audience': 1,
                      'rich': 4,
                      'stopped': 4,
                      'Wilson': 3,
                      'related': 4,
                      'Mayor-elect': 1,
                      'appreciation': 3,
                      'developed': 3,
                      'pride': 2,
                      'amalgamated': 1,
                      'Franklin': 3,
                      'margins': 1,
                      'uni-directional': 1,
                      'Juan': 1,
                      'Ind.': 1,
                      'search': 1,
                      'comprising': 1,
                      'ran': 3,
                      'optimism': 1,
                      'meant': 4,
                      'reek': 1,
                      'intent': 2,
                      'tiny': 2,
                      'M-1': 1,
                      'accountability': 3,
                      'picture': 7,
                      'Yorkers': 1,
                      'curious': 1,
                      'unemployment': 8,
                      'adapted': 1,
                      'Interpretation': 2,
                      'buildings': 1,
                      'between': 36,
                      'cost': 10,
                      '1953': 1,
                      'premieres': 1,
                      'beam': 1,
                      'Inter-American': 3,
                      'outmoded': 1,
                      'proletariat': 1,
                      'masterpiece': 1,
                      'recognize': 8,
                      'Montgomery': 1,
                      "En-lai's": 1,
                      'marked': 2,
                      'wagon': 3,
                      'higher': 4,
                      "Lincoln's": 2,
                      'inflicting': 1,
                      'Security': 2,
                      'Traffic': 1,
                      'vopos': 1,
                      'Stanley': 2,
                      'nod': 1,
                      'Heights': 1,
                      'firmly': 3,
                      'Until': 4,
                      'backed': 3,
                      'banner': 2,
                      'handicapped': 7,
                      'unemployed': 3,
                      'endowments': 1,
                      'dismissed': 1,
                      'told': 9,
                      'greatness': 1,
                      'civic': 5,
                      'Transportation': 1,
                      'waits': 1,
                      'The': 453,
                      'promise': 3,
                      'D.': 6,
                      'Hodges': 3,
                      'riders': 2,
                      'of': 1976,
                      'Pets': 1,
                      'AEC': 2,
                      'celebrated': 1,
                      'law': 23,
                      'tackle': 1,
                      'Moon-faced': 1,
                      'Reply': 5,
                      'police': 4,
                      'Snodgrass': 1,
                      "sanctuary's": 1,
                      'impending': 1,
                      'transact': 1,
                      'awkwardly': 1,
                      'corner': 3,
                      'Trujillos': 2,
                      'bold': 1,
                      'contribution': 4,
                      'protest': 3,
                      'embroidery': 4,
                      'resigned': 1,
                      'lo': 1,
                      "ladies'": 1,
                      'numb': 1,
                      'Sp': 2,
                      'facts': 9,
                      'recreation': 2,
                      'Confidence': 1,
                      'Hazlitt': 1,
                      'disastrous': 5,
                      '285': 1,
                      'peculiar': 2,
                      'display': 2,
                      'well': 32,
                      'scale': 1,
                      'funny': 1,
                      'datelined': 4,
                      'factory': 4,
                      'entry': 2,
                      "America's": 4,
                      'outside': 10,
                      'zones': 1,
                      'hung': 1,
                      'relation': 3,
                      'book': 19,
                      "Rogues'": 1,
                      'suggested': 6,
                      'harshly': 1,
                      'employees': 2,
                      'whipping-boys': 1,
                      'jacket': 2,
                      'bound': 2,
                      'conventions': 3,
                      'intend': 1,
                      'presents': 2,
                      'endlessly': 1,
                      'purpose': 10,
                      'broadened': 1,
                      'give': 20,
                      'outgrow': 1,
                      'contorted': 1,
                      'absolute': 2,
                      'membership': 6,
                      'Richards': 7,
                      'housing': 2,
                      'beauty': 4,
                      'gamut': 1,
                      'Calvin': 1,
                      'glove': 1,
                      'dealers': 2,
                      'Richard': 5,
                      'sanction': 1,
                      'allow': 2,
                      'critical': 1,
                      'library': 1,
                      'bifocals': 1,
                      'boost': 3,
                      'missing': 3,
                      'actually': 9,
                      'Several': 5,
                      'explanation': 2,
                      'replied': 1,
                      'after': 26,
                      'questioning': 1,
                      'reached': 9,
                      'M-4': 1,
                      'earthly': 2,
                      'anyway': 1,
                      'response': 3,
                      'authority': 10,
                      'Perhaps': 7,
                      'grace': 3,
                      'confused': 4,
                      'Audubon': 1,
                      'commercials': 2,
                      'football': 3,
                      'gallant': 1,
                      'strikingly': 1,
                      'big': 17,
                      'Sidney': 1,
                      'grenades': 1,
                      'riddled': 1,
                      'fills': 1,
                      '13-16': 1,
                      'Cod': 2,
                      'attempted': 1,
                      'completely': 5,
                      'alike': 3,
                      'cold': 11,
                      'shantung-like': 1,
                      'opened': 9,
                      'augmented': 2,
                      'Elliott': 1,
                      'rookies': 1,
                      'radioactive': 3,
                      'Broxodent': 2,
                      'nurses': 3,
                      'Crossman': 2,
                      'fate': 2,
                      'Holy': 2,
                      'woven': 1,
                      'blast': 2,
                      'stood': 1,
                      'unfortunately': 1,
                      'drinking': 1,
                      'Korea': 6,
                      'street': 5,
                      "CDC's": 2,
                      'bow': 1,
                      'doorknob': 1,
                      'long': 36,
                      'sins': 3,
                      'Immigration': 2,
                      'tooth': 1,
                      'current': 6,
                      'locally': 1,
                      'draftee': 1,
                      'bonus': 1,
                      'Enlargement': 1,
                      '$200': 1,
                      'worst-marked': 1,
                      'wears': 2,
                      'accuse': 1,
                      'town': 9,
                      'dropouts': 2,
                      'now': 76,
                      'Starlings': 1,
                      'dramatic': 4,
                      'taunts': 1,
                      "emperor's": 1,
                      'commonplace': 1,
                      'upsets': 1,
                      'reads': 4,
                      'suddenly': 4,
                      'De': 2,
                      'gets': 3,
                      'turtle': 5,
                      'Kerr': 1,
                      "we'll": 1,
                      'lay': 4,
                      'loss': 8,
                      'bigotry': 1,
                      'three': 15,
                      'declaring': 2,
                      'Ahmad': 1,
                      'automatic': 3,
                      'cage': 1,
                      'demand': 10,
                      'stock': 4,
                      'Inauguration': 1,
                      'Spring': 1,
                      "Griffin's": 1,
                      'downed': 1,
                      'lawmakers': 1,
                      'presidential': 1,
                      'wage': 2,
                      'clearer': 6,
                      'industrial': 18,
                      'perform': 2,
                      'organize': 2,
                      'extended': 2,
                      'pursuit': 1,
                      "Louis's": 1,
                      'patiently': 1,
                      'blossom': 1,
                      'stodgy': 1,
                      'motives': 1,
                      "community's": 1,
                      'forceful': 2,
                      'car': 12,
                      'A.': 10,
                      'crickets': 1,
                      "region's": 2,
                      'salad': 1,
                      'present': 25,
                      'describes': 1,
                      'comest': 1,
                      'receives': 1,
                      'be': 421,
                      'wealth': 2,
                      'alternative': 1,
                      'racing': 1,
                      'names': 4,
                      'ready': 16,
                      'exhaustion': 1,
                      'tarnished': 1,
                      'public-spirited': 1,
                      'Ten': 1,
                      'shares': 2,
                      'cons': 1,
                      'Publications': 1,
                      'engaged': 6,
                      'conclusion': 2,
                      'expected': 13,
                      'uncompromising': 1,
                      'auto': 2,
                      'minutes': 3,
                      'Montevideo': 1,
                      'Treasury': 2,
                      'remote': 1,
                      'joined': 1,
                      'Relative': 1,
                      "He's": 3,
                      'ye': 2,
                      'Organization': 8,
                      'load': 1,
                      'Breeding': 2,
                      'Word': 1,
                      'enforcement': 4,
                      'prisoner': 1,
                      'status': 4,
                      'commercial': 3,
                      'exceptional': 1,
                      'central': 5,
                      'fall-outs': 1,
                      'restaurants': 1,
                      'Vacancy': 1,
                      'fail': 3,
                      'bully': 1,
                      'hazard': 3,
                      'Glenn': 1,
                      'dignity': 1,
                      'force': 15,
                      'employ': 1,
                      'Mekong': 1,
                      'youth': 3,
                      'reassert': 1,
                      'abolition': 1,
                      'Transit': 1,
                      'sizes': 1,
                      'obligations': 5,
                      'first-class': 1,
                      'plates': 2,
                      'blackbirds': 1,
                      "child's": 1,
                      'Antoine': 1,
                      'lighting': 1,
                      'alone': 3,
                      'statutes': 2,
                      'cent': 19,
                      'Probable': 1,
                      'beneath': 1,
                      'parole': 1,
                      "You'd": 1,
                      'breathe': 1,
                      'declaration': 1,
                      'counseling': 1,
                      'into': 78,
                      'principally': 1,
                      'Initial': 1,
                      'confirms': 1,
                      'affiliated': 1,
                      'intriguingly': 1,
                      'edited': 2,
                      'deviation': 1,
                      'haul': 1,
                      'address': 2,
                      'keel': 1,
                      'else': 6,
                      'Robby': 1,
                      'justify': 6,
                      'let': 15,
                      'forecasts': 1,
                      'libertarians': 1,
                      'genuinely': 1,
                      'becoming': 1,
                      'May': 5,
                      'promising': 2,
                      'necessity': 4,
                      'team': 5,
                      'unworthy': 1,
                      'managed': 2,
                      'Hughes': 1,
                      'ruthlessly': 1,
                      'method': 4,
                      'murder': 2,
                      'election': 11,
                      'forgiven': 4,
                      'conceptions': 1,
                      'landscapes': 1,
                      '$1,750,000': 1,
                      '2': 19,
                      'decides': 1,
                      'hold': 6,
                      '400': 1,
                      'violent': 1,
                      'expert': 2,
                      'length': 4,
                      'directors': 1,
                      'revisions': 1,
                      "Russia's": 3,
                      'pupils': 3,
                      'rockets': 4,
                      'canning': 1,
                      'straining': 1,
                      'unmoved': 1,
                      "workers'": 1,
                      'resilience': 1,
                      'bruise': 1,
                      'Legion': 4,
                      'administrator': 1,
                      'wisely': 1,
                      'Communications': 2,
                      'reunion': 4,
                      'shots': 6,
                      'moneys': 1,
                      'jammed': 1,
                      'subjugate': 1,
                      'Atty.': 2,
                      'reservations': 2,
                      'totalitarianism': 2,
                      "I'm": 2,
                      'forgive': 9,
                      'attain': 1,
                      'Single-color': 1,
                      'Parkway': 3,
                      'around': 20,
                      'understands': 1,
                      'microfilm': 1,
                      'Curtain': 1,
                      'textures': 3,
                      'Continental': 2,
                      'Otherwise': 1,
                      'S.': 3,
                      'week-ends': 2,
                      '22,000': 1,
                      'Turkish': 3,
                      'growers': 1,
                      'go-it-alone': 1,
                      'Land': 1,
                      'shockingly': 1,
                      'naked': 1,
                      'standards': 4,
                      'qualities': 1,
                      'Authority': 4,
                      'underpaid': 1,
                      'secondary': 1,
                      'released': 1,
                      'ends': 2,
                      'possibilities': 1,
                      'aggravated': 1,
                      'alleged': 1,
                      'armaments': 3,
                      'Kirk': 1,
                      'Guevara': 1,
                      'Objects': 1,
                      'attracted': 2,
                      'controlled': 2,
                      'hide': 1,
                      'asinine': 1,
                      'Jagan': 2,
                      'interposed': 1,
                      'invasion': 2,
                      'substance': 1,
                      'from': 214,
                      'bogey-symbol': 1,
                      'ages': 3,
                      'warning': 1,
                      'appearance': 1,
                      'Karshilama': 1,
                      'ball': 4,
                      'migrates': 1,
                      'demonstrates': 1,
                      'grants': 1,
                      'donates': 1,
                      '72': 1,
                      'reality': 6,
                      'Taken': 1,
                      'continent': 3,
                      'jealousy': 1,
                      'instruction': 3,
                      'studied': 4,
                      'twenty-five': 2,
                      'Moloch': 2,
                      'appreciate': 1,
                      'fields': 2,
                      'years': 63,
                      'sticks': 1,
                      'East': 52,
                      'probably': 11,
                      'Representative': 1,
                      'Mo.': 1,
                      'frequently': 4,
                      "Sunday's": 1,
                      'During': 6,
                      'cans': 3,
                      'dusty-green': 1,
                      'Cold': 2,
                      'wry': 2,
                      'Obviously': 1,
                      'unsurpassed': 1,
                      'Paint': 1,
                      'graduates': 2,
                      'McCloy': 2,
                      '2-year-old': 3,
                      'or': 194,
                      'Law': 1,
                      "Don't": 3,
                      'states': 12,
                      'fifteen': 2,
                      'destruction': 3,
                      'Skill': 1,
                      "Washington's": 2,
                      'city': 40,
                      'flaming': 1,
                      'sex': 2,
                      'middle-age': 1,
                      'stomach': 1,
                      'folk': 1,
                      'chestnuts': 1,
                      "Congo's": 1,
                      'paramilitary': 1,
                      'lesser': 1,
                      'three-man': 1,
                      'backing': 1,
                      'five-hundred-year-old': 1,
                      'easier': 3,
                      'consciousness': 1,
                      'plans': 9,
                      'accordance': 1,
                      'uplift': 1,
                      'allowing': 3,
                      'general': 11,
                      'sprawled': 1,
                      "Hammarskjold's": 4,
                      'Gore': 1,
                      'presentations': 1,
                      'Sam': 8,
                      'invite': 1,
                      'diplomat': 1,
                      'heart': 12,
                      'primary': 4,
                      'bleak': 2,
                      'national': 25,
                      'tomorrow': 1,
                      'wishful': 2,
                      'top-drawer': 1,
                      'half-city': 1,
                      'expanding': 1,
                      'grasp': 2,
                      'conscientious': 3,
                      'collections': 1,
                      'called': 19,
                      'opinions': 3,
                      'financed': 1,
                      'Felix': 2,
                      'eye-to-eye': 1,
                      'pervades': 1,
                      'year': 52,
                      'Carbondale': 2,
                      'Moluccas': 1,
                      'victory': 3,
                      'Inquirer': 13,
                      'Too': 4,
                      'courage': 4,
                      'sighted': 1,
                      'painted': 4,
                      'agricultural': 5,
                      'Radio': 1,
                      'owes': 1,
                      'G.': 2,
                      'shopping': 2,
                      'year-earlier': 1,
                      'encampment': 1,
                      'organized': 2,
                      'wonder': 4,
                      'ascend': 1,
                      'objective': 7,
                      'solution': 7,
                      'squirms': 1,
                      'directly': 5,
                      'born': 5,
                      'propel': 2,
                      'sharply': 2,
                      'preference': 1,
                      'chance': 7,
                      'kinda': 1,
                      'watchdog': 1,
                      'Methodist': 3,
                      'situation': 15,
                      'Market': 5,
                      'mission': 1,
                      'optimistic': 2,
                      'not': 301,
                      'avail': 1,
                      'hard-liquor': 2,
                      'falsehood': 1,
                      'Morale': 1,
                      'taught': 5,
                      'headquarters': 2,
                      'atone': 1,
                      'Lili': 1,
                      'Dwight': 2,
                      'basket': 1,
                      'isolating': 1,
                      'M': 1,
                      'minorities': 1,
                      'suggesting': 1,
                      '19': 4,
                      'growth': 11,
                      'respective': 5,
                      'ethical': 6,
                      'suspensions': 1,
                      'pray': 3,
                      'college': 16,
                      'inborn': 1,
                      '15th': 2,
                      'placed': 7,
                      'distinct': 2,
                      'intentions': 3,
                      'Marcos': 1,
                      'noble': 3,
                      'illustrated': 2,
                      'retraction': 1,
                      'Nor': 3,
                      'Bell': 1,
                      'sliding': 1,
                      'peace-treaty': 1,
                      'Streets': 1,
                      'Bust': 1,
                      'arrangement': 2,
                      'campus': 2,
                      ...}),
             'fiction': Counter({'Without': 3,
                      'vacuum': 1,
                      'lighted': 4,
                      'laden': 1,
                      'dispatching': 1,
                      'More': 2,
                      'Anthony': 3,
                      'anyhow': 2,
                      'speak': 11,
                      'economics': 1,
                      'infectious': 1,
                      'education': 1,
                      'house': 54,
                      'skilled': 1,
                      "She'd": 2,
                      'building': 7,
                      'Apparel': 1,
                      'waters': 2,
                      'Between': 2,
                      'Ballestre': 2,
                      'junction': 1,
                      'restrain': 1,
                      'highboard': 1,
                      'use': 6,
                      'crags': 1,
                      'killed': 3,
                      'Ludie': 13,
                      'experience': 8,
                      'served': 5,
                      'jubilantly': 1,
                      'sports': 1,
                      'Plunking': 1,
                      'colts': 1,
                      'tempted': 1,
                      'Sex': 1,
                      'nods': 1,
                      'thoroughly': 1,
                      'February': 1,
                      'author': 1,
                      'pleading': 1,
                      'Bishopsgate': 2,
                      'plush': 1,
                      'abrupt': 1,
                      'certain': 8,
                      'wages': 3,
                      'heaved': 1,
                      'Tomorrow': 2,
                      'deal': 6,
                      'demanded': 7,
                      'shade': 1,
                      'others': 16,
                      'Wait': 2,
                      'half-understood': 1,
                      'mercy': 1,
                      'taut': 1,
                      'catch': 2,
                      'nauseated': 1,
                      'beautiful': 5,
                      'amazing': 3,
                      'shorter': 1,
                      'marriages': 1,
                      'truthfulness': 1,
                      'indefinable': 1,
                      'Mystery': 1,
                      'cheer': 1,
                      'cha-chas': 1,
                      'believes': 1,
                      'chicken': 3,
                      'determine': 1,
                      'drab': 1,
                      'slow-moving': 1,
                      'brimmed': 1,
                      'reborn': 1,
                      'cabins': 1,
                      'beadsman': 1,
                      'cups': 2,
                      'kitchen': 21,
                      'damnation': 1,
                      'amount': 2,
                      'Burt': 1,
                      'few': 28,
                      'crack': 3,
                      'Seeing': 2,
                      'cheek': 3,
                      'northward': 2,
                      'pompously': 1,
                      "wouldn't": 9,
                      'cheated': 1,
                      'scratching': 7,
                      'through': 60,
                      'Leona': 5,
                      'Know': 3,
                      'Just': 5,
                      "we'll": 1,
                      "Jessica's": 1,
                      'each': 33,
                      'Because': 1,
                      'hacked': 1,
                      'proclamation': 2,
                      'Coffee': 1,
                      'Sumter': 1,
                      'stakes': 1,
                      'Dimes': 1,
                      'France': 3,
                      "yesterday's": 1,
                      'ye': 2,
                      'urns': 1,
                      'postscript': 1,
                      'cottage': 1,
                      'shy': 2,
                      'raised': 9,
                      'stray': 3,
                      'stream': 4,
                      'ate': 6,
                      'Half': 2,
                      'clubbed': 1,
                      'Joshua': 2,
                      'Rousseau': 17,
                      'buggy': 2,
                      'keyhole': 1,
                      'fabulous': 1,
                      'eternal': 4,
                      'blackout': 1,
                      'fences': 1,
                      'At': 37,
                      "I'm": 28,
                      'apex': 1,
                      'colleagues': 1,
                      'Those': 5,
                      'surprisingly': 2,
                      'citations': 1,
                      'sizzled': 1,
                      'drop': 4,
                      'career': 4,
                      'baggage': 1,
                      'husband': 6,
                      'friends': 11,
                      'bet': 3,
                      'concierge': 2,
                      'shouting': 8,
                      'worse': 1,
                      'full-grown': 1,
                      'slip': 1,
                      'Suddenly': 2,
                      'consented': 2,
                      'small': 31,
                      'bull': 1,
                      'jam': 1,
                      'doorways': 1,
                      'enemy': 6,
                      'amuse': 1,
                      'tidings': 1,
                      'bitters': 1,
                      'knife': 7,
                      'hostess': 3,
                      'recently': 4,
                      'distinct': 1,
                      'darkly': 2,
                      'next': 29,
                      'armchair': 2,
                      'reared': 5,
                      'disagreed': 1,
                      'strike': 4,
                      'four': 14,
                      'searching': 3,
                      'really': 25,
                      'putting': 3,
                      'broad': 7,
                      'securely': 1,
                      'striving': 1,
                      'Styka': 16,
                      'borrowed': 1,
                      'stiffness': 1,
                      'effect': 8,
                      'bunk': 2,
                      'exact': 1,
                      'shawl': 2,
                      'whites': 4,
                      'wraith-like': 2,
                      'oranges': 2,
                      'shift': 3,
                      'giggled': 2,
                      'King': 3,
                      'poultry': 1,
                      'decrees': 1,
                      'spending': 3,
                      'evening': 17,
                      'prisoners': 2,
                      'dripped': 4,
                      'maniacal': 1,
                      'pungent': 1,
                      'Houses': 1,
                      'cannon': 2,
                      'telegraph': 1,
                      'Joseph': 4,
                      'tip': 3,
                      'coward': 1,
                      'relax': 3,
                      'Bureau': 1,
                      'McFeeley': 12,
                      'relief': 7,
                      'sixty-eight': 1,
                      'ask': 16,
                      'invested': 1,
                      'found': 42,
                      'hope': 5,
                      "one's": 5,
                      'Laurel': 1,
                      'mittens': 1,
                      'pretty': 10,
                      "afternoon's": 1,
                      'barges': 1,
                      'mother': 9,
                      'Jorge': 3,
                      'things': 31,
                      'Now': 30,
                      'independent': 1,
                      'squealed': 1,
                      'lewd': 1,
                      'wattles': 2,
                      'permission': 2,
                      'divided': 1,
                      'suppose': 5,
                      'Hall': 1,
                      'lyrics': 1,
                      'vertical': 1,
                      'cocked': 2,
                      'Wised': 1,
                      'drawn': 2,
                      'spur': 1,
                      'mischief': 1,
                      'sir': 6,
                      'Gilkson': 1,
                      'Verdi': 2,
                      'dark-brown': 1,
                      'yellowing': 2,
                      'hatred': 2,
                      'German': 1,
                      'signpost': 1,
                      'needled': 1,
                      'scream': 4,
                      'signal': 2,
                      'stropping': 1,
                      'deportees': 1,
                      'musicians': 4,
                      'Timon': 1,
                      'shoot': 2,
                      'shadow': 4,
                      "Eileen's": 3,
                      'marched': 4,
                      'stiffened': 3,
                      'Standing': 1,
                      'Tuscany': 1,
                      'dying': 10,
                      'required': 2,
                      'Cadillac': 4,
                      'season': 2,
                      'Eyke': 1,
                      'Cat': 1,
                      'Lord': 6,
                      'Double': 1,
                      'betrays': 1,
                      'gritty': 1,
                      'strip': 3,
                      'curiously': 2,
                      'run': 12,
                      'discussed': 5,
                      'merging': 1,
                      'stores': 2,
                      'textiles': 1,
                      'lust': 1,
                      "C'est": 2,
                      'web': 1,
                      'poncho': 3,
                      'effected': 2,
                      'unconcerned': 1,
                      'dimension': 1,
                      'uncomfortable': 2,
                      'themselves': 5,
                      'developing': 1,
                      'Anabaptists': 1,
                      'experiences': 2,
                      'Corault': 1,
                      'floundering': 1,
                      'stumps': 1,
                      'Church': 17,
                      'Santa': 1,
                      'Plymouth': 1,
                      'campmate': 1,
                      'sluggishly': 1,
                      'Concorde': 1,
                      'Howard': 13,
                      'reader': 1,
                      'clean': 12,
                      'sophomore': 1,
                      'pulled': 6,
                      'framed': 2,
                      'sentences': 1,
                      'visited': 2,
                      'loaded': 1,
                      'messieurs': 1,
                      'meticulously': 1,
                      'Also': 1,
                      'wanting': 3,
                      'coat': 3,
                      'delivery': 1,
                      'trees': 17,
                      'northeast': 1,
                      "mother's": 3,
                      "they're": 4,
                      'lecturing': 1,
                      'Japanese': 3,
                      'exchange': 2,
                      'Frederick': 3,
                      'due': 4,
                      'Fletcher': 5,
                      'seeming': 2,
                      'warmed': 1,
                      'furniture': 1,
                      'sixty': 3,
                      'checked': 4,
                      'overcomes': 1,
                      'problem': 2,
                      'curtail': 1,
                      'unashamedly': 1,
                      'winter': 5,
                      'distance': 6,
                      'walking': 12,
                      'crashing': 1,
                      'silly': 2,
                      'insurmountable': 1,
                      'looking': 19,
                      'fourth': 6,
                      'embellished': 1,
                      'Rivers': 1,
                      'nuns': 1,
                      'Murray': 1,
                      'sixth': 1,
                      'suggest': 1,
                      'tasting': 2,
                      'British': 6,
                      'celebrate': 1,
                      'Time': 4,
                      'humor': 4,
                      "John's": 5,
                      'Smiling': 2,
                      'entranceway': 1,
                      "We're": 4,
                      'collected': 1,
                      'ungodly': 2,
                      'Big': 16,
                      'descending': 2,
                      'seahorse': 1,
                      'rebutted': 1,
                      'pointed': 9,
                      'wounded': 6,
                      'stoutly': 1,
                      'hey': 2,
                      'slowly': 14,
                      'unlocked': 1,
                      'place': 36,
                      'piece': 15,
                      'rude': 1,
                      'drunk': 6,
                      'rode': 5,
                      'Carrara': 1,
                      'tasted': 1,
                      'atmosphere': 3,
                      'blessed': 1,
                      'melting': 1,
                      'urgent': 2,
                      'track': 1,
                      'bait': 1,
                      'files': 1,
                      'Over': 3,
                      'concussion': 1,
                      'riffle': 1,
                      'instrument': 1,
                      'Heart': 1,
                      'could': 166,
                      "liar's": 1,
                      'enjoying': 2,
                      'Lawrence': 10,
                      'coffeecup': 1,
                      'Federal': 3,
                      'rich': 7,
                      'tap': 1,
                      'Wilson': 19,
                      'Ernest': 2,
                      'aloud': 5,
                      "Pa'd": 1,
                      'appreciation': 2,
                      'planting': 1,
                      'Along': 1,
                      'pride': 5,
                      'reprovingly': 1,
                      'tapping': 2,
                      'rattle': 1,
                      'Nassau': 1,
                      'rites': 2,
                      'Prickly': 1,
                      'Juan': 1,
                      'search': 2,
                      'berated': 1,
                      'ran': 23,
                      'wagons': 1,
                      'optimism': 2,
                      'meant': 7,
                      'pimples': 1,
                      'tiny': 4,
                      'Christianity': 1,
                      'forbids': 1,
                      'Thirty-four': 1,
                      'picture': 8,
                      "Pilate's": 1,
                      'bassinet': 1,
                      'sympathetic': 1,
                      'adapted': 1,
                      'gaunt': 2,
                      'buildings': 2,
                      'Immediately': 1,
                      'between': 13,
                      'cost': 3,
                      'host': 1,
                      'intimate': 3,
                      'beam': 1,
                      'seed': 2,
                      'converts': 1,
                      'eighth': 1,
                      'diagrams': 1,
                      'recognize': 1,
                      'marked': 1,
                      'Anderson': 1,
                      'wagon': 1,
                      'self-flagellation': 1,
                      'higher': 3,
                      "Lincoln's": 1,
                      'Dutch': 2,
                      'swells': 1,
                      'foliage': 1,
                      'transverse': 1,
                      'hearth': 3,
                      'Heights': 3,
                      'Until': 1,
                      'backed': 1,
                      'banner': 1,
                      'movers': 3,
                      'placid': 1,
                      'dismissed': 1,
                      'wall': 17,
                      'oughta': 1,
                      'told': 48,
                      'velvet': 2,
                      'distinctly': 1,
                      'wiping': 3,
                      'The': 369,
                      'heat': 6,
                      'downstairs': 3,
                      'coincidence': 1,
                      'Boy': 2,
                      'candles': 1,
                      'merited': 1,
                      'of': 1419,
                      'wrapper': 1,
                      'shores': 2,
                      'Defrost': 1,
                      'celebrated': 1,
                      'law': 2,
                      'stained': 1,
                      'Sleepy-eyed': 1,
                      "Gre't": 1,
                      'manservant': 1,
                      'police': 1,
                      'whole': 22,
                      'wrack': 1,
                      'rear': 3,
                      'impending': 1,
                      'metal': 5,
                      'yearned': 1,
                      'corner': 18,
                      'sworn': 1,
                      'wanna': 1,
                      'bold': 4,
                      'lived': 9,
                      'trestle': 1,
                      'protest': 5,
                      'spoiled': 1,
                      "Warren's": 1,
                      'roused': 1,
                      'synagogue': 1,
                      "ladies'": 1,
                      'maid': 6,
                      'numb': 1,
                      'stupidest': 1,
                      'facts': 1,
                      'recreation': 1,
                      'Hudson': 8,
                      'fried': 1,
                      'disastrous': 1,
                      'announcing': 1,
                      'craved': 1,
                      'clearness': 1,
                      'display': 1,
                      'Cover': 1,
                      'straightening': 1,
                      '7:25': 1,
                      'swing': 1,
                      'Supplies': 1,
                      'funny': 5,
                      'belligerently': 1,
                      'factory': 1,
                      'Carlson': 2,
                      'outside': 14,
                      'hung': 14,
                      'Stranger': 1,
                      'book': 5,
                      'balls': 2,
                      'suggested': 5,
                      'Ole': 3,
                      'barrack': 1,
                      'stirred': 1,
                      'jacket': 2,
                      'sullen': 2,
                      'intend': 1,
                      'Brigade': 1,
                      'presents': 1,
                      'tableau': 1,
                      'pump': 3,
                      'purpose': 5,
                      'broadened': 1,
                      'give': 22,
                      'clamped': 1,
                      'absolute': 2,
                      'boring': 1,
                      'dishes': 3,
                      'housing': 1,
                      'beauty': 3,
                      'moralistic': 1,
                      'coop': 2,
                      'ammunition': 3,
                      'flatly': 2,
                      'Calvin': 3,
                      'Coughlin': 5,
                      'protestations': 1,
                      'crest': 2,
                      'applying': 1,
                      'allow': 3,
                      'rag': 1,
                      'critical': 1,
                      'blushed': 3,
                      'library': 3,
                      'hawked': 1,
                      'anticipations': 1,
                      'cancer': 1,
                      'missing': 5,
                      'actually': 2,
                      'languishing': 1,
                      'Several': 1,
                      'rocked': 1,
                      'explanation': 2,
                      'replied': 2,
                      'Billy': 1,
                      'jabbed': 2,
                      'everywhere': 2,
                      'nicotine-choked': 1,
                      'questioning': 1,
                      'percolator': 1,
                      'young': 26,
                      'anyway': 3,
                      'sculpture': 5,
                      'bargain': 1,
                      'procurer': 1,
                      'authority': 2,
                      'Point': 2,
                      'Perhaps': 8,
                      'grace': 4,
                      'confused': 3,
                      'cherries': 1,
                      'football': 1,
                      'big': 36,
                      'Fowler': 1,
                      'midair': 1,
                      'Westwood': 1,
                      'fetid': 1,
                      'inspect': 1,
                      'grenades': 1,
                      'services': 1,
                      'nourished': 1,
                      'completely': 3,
                      'Waited': 1,
                      'C': 1,
                      'wooden': 4,
                      'cold': 15,
                      'opened': 17,
                      'augmented': 1,
                      'left': 41,
                      'spoonful': 1,
                      'Stroked': 1,
                      'handlebars': 1,
                      'Holy': 2,
                      'clip': 1,
                      'blast': 1,
                      'stood': 41,
                      'worst': 4,
                      'dreamin': 1,
                      'lipstick': 1,
                      'Demanded': 1,
                      'drinking': 4,
                      'misanthrope': 1,
                      'roughed': 1,
                      'tar': 1,
                      'street': 17,
                      'bow': 4,
                      'doorknob': 1,
                      'long': 53,
                      'sins': 2,
                      'utmost': 1,
                      'current': 2,
                      'senseless': 1,
                      'Huge': 1,
                      'skullcap': 1,
                      'cares': 1,
                      'valet': 2,
                      "H'all": 1,
                      'Rich': 2,
                      'compresses': 1,
                      'now': 100,
                      'composer': 8,
                      'peeked': 1,
                      'dramatic': 4,
                      'Calmly': 1,
                      'suddenly': 23,
                      'De': 6,
                      'gets': 3,
                      'specific': 1,
                      'wig': 1,
                      'bandaged': 2,
                      'lay': 20,
                      'loss': 3,
                      'entanglement': 1,
                      'three': 27,
                      'declaring': 1,
                      'shut': 7,
                      'Jee-sus': 1,
                      'cage': 2,
                      'demand': 1,
                      'stock': 7,
                      'privy': 1,
                      'Scrub': 1,
                      'downed': 1,
                      'San': 2,
                      'pokerfaced': 1,
                      'clearer': 1,
                      'pat': 1,
                      'pure': 1,
                      'extended': 1,
                      'patiently': 1,
                      'lips': 15,
                      'Rathbone': 2,
                      'joy': 5,
                      'detective': 3,
                      'car': 12,
                      'godless': 1,
                      'clambered': 3,
                      'cap': 4,
                      'grows': 1,
                      'sonofabitch': 1,
                      'science': 2,
                      'mackinaw': 1,
                      'be': 254,
                      'wealth': 1,
                      'names': 6,
                      'ready': 17,
                      'Mutton': 5,
                      'glistened': 1,
                      'Ten': 3,
                      'abundance': 1,
                      'forget': 2,
                      'posing': 1,
                      'heap': 2,
                      'engaged': 4,
                      'Fujimoto': 2,
                      'expected': 5,
                      'janitor': 1,
                      'minutes': 12,
                      'Treasury': 1,
                      'remote': 2,
                      'intertwined': 1,
                      'touching': 3,
                      'joined': 2,
                      "He's": 13,
                      'bullshit': 1,
                      'hush': 1,
                      'stalking': 1,
                      'tyrannis': 1,
                      'lettering': 1,
                      'slovenly': 3,
                      'covering': 2,
                      'afternoon': 17,
                      'teasing': 1,
                      'meditate': 1,
                      'status': 1,
                      'rut': 1,
                      'elapsed': 2,
                      'brandishing': 1,
                      'fail': 3,
                      'boarding': 1,
                      'burial': 3,
                      'weeds': 2,
                      'battling': 1,
                      'dignity': 2,
                      'Discourse': 3,
                      'force': 4,
                      'replaced': 1,
                      'overlook': 1,
                      'pipers': 1,
                      'sizes': 1,
                      'first-class': 1,
                      'hire': 1,
                      'wildly': 4,
                      "child's": 2,
                      'tepid': 1,
                      'lighting': 1,
                      'alone': 16,
                      'worshippers': 1,
                      'cent': 2,
                      'Dalloway': 3,
                      'beneath': 5,
                      'bullet': 2,
                      "You'd": 1,
                      'declaration': 1,
                      'into': 147,
                      'strife': 1,
                      'Sunday-school': 1,
                      'Fooled': 1,
                      'snoring': 5,
                      'countenance': 3,
                      'else': 15,
                      'Kahler': 7,
                      'let': 28,
                      'becoming': 5,
                      'May': 2,
                      'promising': 1,
                      'grovel': 1,
                      'necessity': 1,
                      'brother': 7,
                      'Somebody': 3,
                      'managed': 2,
                      'separating': 1,
                      'coyness': 1,
                      'relationship': 2,
                      'murder': 2,
                      'Rabbi': 4,
                      'hocking': 1,
                      'scarf': 2,
                      'mantrap': 1,
                      'sweeping': 2,
                      'associates': 1,
                      'calamities': 1,
                      'bawling': 1,
                      'cleat': 1,
                      'blinked': 3,
                      'ferociously': 1,
                      'length': 6,
                      'involved': 3,
                      'nothings': 1,
                      'straining': 1,
                      'style': 3,
                      'Soothsayer': 1,
                      'unmoved': 1,
                      'switch': 1,
                      'lentils': 1,
                      "Shakespeare's": 1,
                      'electric': 4,
                      'participated': 1,
                      'outdoors': 1,
                      'muttered': 5,
                      'Bud': 1,
                      'absurd': 2,
                      ')': 23,
                      'secrets': 1,
                      'attain': 2,
                      'hoist': 1,
                      'ledgers': 1,
                      'Zionists': 1,
                      'volunteered': 1,
                      'edges': 2,
                      'Otherwise': 1,
                      "I'll": 15,
                      'mosaic': 1,
                      'Turkish': 1,
                      'default': 1,
                      'substituted': 1,
                      'crunched': 1,
                      'Waldensian': 1,
                      'naked': 3,
                      'indefinitely': 1,
                      'merit': 1,
                      'growling': 1,
                      'clothes': 17,
                      'playful': 1,
                      'shutters': 4,
                      'tail': 1,
                      'released': 1,
                      'housework': 1,
                      'aggravated': 1,
                      'long-shanked': 1,
                      'letter': 11,
                      'hide': 3,
                      'feudal': 1,
                      'blueberry': 1,
                      'Its': 2,
                      'Slice': 2,
                      'picket': 1,
                      'parked': 2,
                      'from': 222,
                      'devilish': 1,
                      'jumble': 1,
                      'Ever': 2,
                      'Fat': 1,
                      'warning': 1,
                      'appearance': 4,
                      'Murder': 1,
                      'ball': 9,
                      'Mother': 2,
                      'peonies': 1,
                      'mantel': 1,
                      'melodies': 3,
                      'reality': 4,
                      'undo': 1,
                      'hamper': 2,
                      'instruction': 1,
                      'studied': 8,
                      'twenty-five': 2,
                      'appreciate': 1,
                      'Cleaned': 1,
                      'years': 44,
                      'sticks': 3,
                      'East': 6,
                      'probably': 6,
                      'voulez': 1,
                      'frequently': 1,
                      "Sunday's": 1,
                      'During': 4,
                      "Heat's": 1,
                      'Drury': 1,
                      'chinked': 1,
                      'belch': 1,
                      'tore': 2,
                      'swamps': 2,
                      'drunkenly': 1,
                      'or': 167,
                      'obscenities': 1,
                      'Law': 2,
                      'estates': 1,
                      "Don't": 10,
                      'fifteen': 3,
                      "Alix's": 1,
                      'likes': 1,
                      'city': 18,
                      'sex': 2,
                      'obstruct': 1,
                      'Looking': 2,
                      "'mon": 1,
                      'active': 2,
                      'sensing': 1,
                      'Heaven': 2,
                      'armful': 1,
                      'verandah': 1,
                      'backing': 2,
                      'Shelley': 5,
                      'easier': 2,
                      'thoughtfully': 1,
                      'drive': 5,
                      'general': 2,
                      'sprawled': 3,
                      'Sam': 1,
                      'Afterwards': 1,
                      'heart': 15,
                      'bleak': 2,
                      'national': 2,
                      'History': 1,
                      'ah': 1,
                      'specters': 1,
                      'tomorrow': 8,
                      'foreman': 1,
                      'excellences': 1,
                      'notes': 2,
                      'Myra': 3,
                      'corsage': 1,
                      'Andruses': 1,
                      'called': 28,
                      'opinions': 1,
                      'financed': 1,
                      'workshop': 1,
                      'boaters': 1,
                      'fictional': 1,
                      'year': 9,
                      'Majdan-Tartarski': 1,
                      "Seward's": 3,
                      'victory': 6,
                      'ginmill': 2,
                      'adjusted': 3,
                      'merry': 1,
                      'Pardon': 1,
                      'Too': 3,
                      'courage': 2,
                      'subdued': 1,
                      'red': 13,
                      'bring': 10,
                      'pitcher': 3,
                      'deals': 1,
                      'matriarch': 1,
                      'uneasy': 5,
                      'wonder': 2,
                      'near-absence': 1,
                      'objective': 3,
                      'solution': 1,
                      'directly': 2,
                      'born': 3,
                      'sharply': 2,
                      'makeshift': 1,
                      'Jew': 9,
                      'chance': 7,
                      'Corcoran': 1,
                      'black-market': 1,
                      'scuff': 1,
                      'spirits': 2,
                      'Methodist': 2,
                      'mutilated': 1,
                      'steeple': 9,
                      'Southern': 2,
                      'fellow': 5,
                      'halfway': 2,
                      'Market': 1,
                      'geneticist': 1,
                      'not': 290,
                      'perturbation': 1,
                      'sons': 1,
                      'taught': 4,
                      'fuss': 1,
                      'crook': 1,
                      'headquarters': 3,
                      'basket': 4,
                      'arthritis': 1,
                      'suffer': 1,
                      'minorities': 1,
                      'expiating': 1,
                      'soared': 1,
                      'Rebel': 4,
                      'Eyes': 1,
                      'herding': 1,
                      'pray': 3,
                      'college': 3,
                      'yawning': 2,
                      'climbed': 6,
                      'placed': 7,
                      "Rheinholdt's": 1,
                      'boat': 6,
                      'noble': 1,
                      'Buckra': 2,
                      'set': 17,
                      'sliding': 1,
                      'calumny': 1,
                      'glanced': 5,
                      'winked': 2,
                      'worried': 6,
                      'hated': 6,
                      'arrangement': 1,
                      'campus': 2,
                      'Rebs': 1,
                      'alongside': 1,
                      'contemptuously': 1,
                      'contrived': 1,
                      'crowd': 6,
                      'five': 14,
                      'Bowman': 1,
                      'draper': 1,
                      'Luisa': 1,
                      'common': 4,
                      'poems': 1,
                      'less': 2,
                      'crowed': 1,
                      "rooster's": 1,
                      'Moccasin': 1,
                      'movement': 8,
                      'decorator': 1,
                      'scraping': 1,
                      'thrusting': 1,
                      'dwindling': 1,
                      'united': 2,
                      'Geneva': 3,
                      'Vienot': 1,
                      'Warmongering': 1,
                      'deacon': 1,
                      'straight-backed': 1,
                      'smaller': 1,
                      'cheerful': 2,
                      'gravity': 1,
                      'protect': 1,
                      ...}),
             'government': Counter({'Without': 2,
                      'read': 4,
                      '$1.8': 2,
                      'taxing': 5,
                      'More': 8,
                      '23': 4,
                      'Anthony': 1,
                      'weekend': 1,
                      'entails': 1,
                      'speak': 3,
                      'located': 14,
                      'relocation': 1,
                      'avoiding': 2,
                      'grow': 3,
                      'education': 12,
                      'house': 4,
                      'promises': 1,
                      '1450': 1,
                      'skilled': 8,
                      'Buy': 1,
                      'thickness': 1,
                      'Covington': 1,
                      'tube': 1,
                      'Ballet': 5,
                      'travelers': 1,
                      'endorsement': 1,
                      '607-608': 1,
                      'automotive': 3,
                      'alleged': 2,
                      'individuals': 4,
                      'Between': 2,
                      'threshold': 1,
                      'OBE': 1,
                      'apparatus': 2,
                      'restrain': 2,
                      'operating': 20,
                      'Swift': 1,
                      'softwood': 1,
                      'use': 85,
                      'Procedure': 1,
                      'antennae': 1,
                      'shifts': 1,
                      'lodging': 1,
                      'experience': 10,
                      'applies': 3,
                      'served': 8,
                      'committee': 10,
                      '139': 1,
                      'uncertified': 1,
                      'sports': 5,
                      'population': 12,
                      'Force': 6,
                      'warrants': 2,
                      'attracted': 2,
                      'has': 153,
                      'controlled': 6,
                      'thoroughly': 4,
                      'February': 4,
                      "day's": 1,
                      'maintenance': 22,
                      'when': 56,
                      'our': 144,
                      'along': 7,
                      'procedural': 4,
                      'economics': 2,
                      'Land-based': 1,
                      'certain': 18,
                      'season': 3,
                      'God': 3,
                      'inward': 1,
                      'Reine': 1,
                      'Specifically': 1,
                      'Budgeting': 1,
                      'others': 20,
                      'republic': 2,
                      'budgets': 1,
                      'editorial': 2,
                      'manufacturers': 9,
                      'more': 118,
                      'anyone': 2,
                      'Stamford': 2,
                      'transition': 12,
                      'Laboratory': 6,
                      'contributes': 2,
                      'resolve': 2,
                      'beautiful': 2,
                      'averaged': 2,
                      'mailing': 1,
                      'Electronic': 2,
                      'marriages': 1,
                      'unabated': 1,
                      'initiating': 1,
                      'first': 58,
                      'subsequent': 4,
                      'self-respect': 1,
                      'wires': 2,
                      'related': 12,
                      'developed': 32,
                      'Christiana': 9,
                      'introducing': 1,
                      'films': 2,
                      'sorted': 1,
                      'adding': 1,
                      'cosponsored': 1,
                      'imposition': 1,
                      'company': 11,
                      'constituents': 1,
                      'Radiation': 2,
                      'wall': 6,
                      'page': 10,
                      'greatness': 1,
                      'determine': 15,
                      'Soccer': 2,
                      'under': 89,
                      'Tumor': 4,
                      'correct': 1,
                      'assigns': 2,
                      'corporation': 12,
                      'close': 21,
                      'metal': 9,
                      'roughly': 2,
                      'microscopes': 1,
                      'kitchen': 1,
                      'amount': 43,
                      'NAIR': 1,
                      'blunt': 2,
                      'orders': 7,
                      'tables': 2,
                      'virile': 1,
                      'sue': 1,
                      'ship': 1,
                      'planes': 3,
                      'arranges': 1,
                      'Surgery': 2,
                      'pressure': 11,
                      'seldom': 1,
                      'Md.': 1,
                      '60%': 1,
                      'scientists': 1,
                      'directed': 15,
                      'diminishes': 1,
                      'amended': 11,
                      'Keynotes': 1,
                      'priority': 5,
                      'man': 12,
                      'evils': 3,
                      '606': 1,
                      'draft': 1,
                      'Valley': 1,
                      'sunspot': 1,
                      'action-oriented': 1,
                      'through': 56,
                      'Section': 33,
                      'Following': 2,
                      'anew': 1,
                      'Mussolinis': 1,
                      'reads': 2,
                      'specific': 9,
                      'each': 71,
                      'undue': 1,
                      'Because': 4,
                      'Delaware': 13,
                      'proclamation': 6,
                      'boon': 1,
                      'motion': 1,
                      'Justice': 11,
                      'waters': 2,
                      '1.8': 2,
                      'plain': 1,
                      'forget': 1,
                      'resolute': 1,
                      'burden': 5,
                      'controls': 4,
                      'vegetation': 1,
                      'molecular': 1,
                      'favored': 1,
                      'balanced': 5,
                      'wisdom': 2,
                      'annually': 3,
                      'excludes': 2,
                      'raised': 3,
                      'love': 1,
                      'rise': 3,
                      'approval': 11,
                      'commemorating': 1,
                      'mileage': 8,
                      'sewing': 5,
                      'repeated': 1,
                      'eventually': 5,
                      'Center': 3,
                      'Players': 2,
                      'eternal': 1,
                      'features': 7,
                      'situations': 2,
                      'thereunder': 1,
                      'service': 68,
                      'Haven': 1,
                      'hard': 3,
                      'At': 18,
                      ')': 345,
                      'seas': 3,
                      'Coordination': 1,
                      'mastery': 1,
                      'spindle': 2,
                      'merit': 3,
                      'vessels': 3,
                      'colleagues': 7,
                      'merges': 1,
                      'april': 1,
                      'exigencies': 1,
                      'wire': 4,
                      'Those': 1,
                      'ought': 1,
                      'enlist': 1,
                      '1868': 2,
                      'rejection': 1,
                      '1776': 1,
                      'facility': 3,
                      '125th': 1,
                      'drop': 1,
                      'career': 7,
                      'idly': 1,
                      'calculated': 1,
                      'cause': 13,
                      'friends': 2,
                      'roll': 1,
                      'integral': 2,
                      "pool's": 2,
                      'battery-powered': 2,
                      'incipient': 1,
                      'correlation': 1,
                      'acceptable': 1,
                      'manage': 1,
                      'treasury': 1,
                      'consented': 1,
                      'purchasing': 6,
                      'small': 54,
                      'removal': 1,
                      'Miss': 7,
                      'Times': 11,
                      '6-inch': 1,
                      'hostility': 1,
                      'performances': 1,
                      'continue': 10,
                      'genuinely': 1,
                      'discharged': 2,
                      'warning': 1,
                      'revolutions': 1,
                      'weather': 2,
                      'forward': 8,
                      'recession': 1,
                      'generates': 1,
                      'recently': 10,
                      'tournament': 1,
                      'Macropathology': 1,
                      'distinct': 3,
                      'built-in': 1,
                      'allocable': 3,
                      'legislation': 4,
                      'next': 13,
                      'Registry': 5,
                      'p.': 1,
                      'incurred': 4,
                      "Brown's": 9,
                      'strike': 1,
                      'revenue': 4,
                      'searching': 1,
                      'examined': 1,
                      'appropriating': 1,
                      'really': 1,
                      'peoples': 2,
                      'broad': 11,
                      'lends': 1,
                      'Tube': 1,
                      'passed': 2,
                      'external': 4,
                      'contention': 2,
                      'understanding': 13,
                      'amazement': 1,
                      'mountains': 1,
                      'rushed': 1,
                      'borrowed': 2,
                      'efforts': 17,
                      'signals': 7,
                      'effect': 15,
                      'mountainous': 1,
                      'shielding': 4,
                      'heretofore': 2,
                      'selected': 9,
                      'clothed': 1,
                      'reception': 4,
                      'Anything': 1,
                      'Uses': 1,
                      'ad': 1,
                      'sciences': 1,
                      'embodying': 1,
                      'shift': 5,
                      'King': 1,
                      'Transition': 1,
                      'refund': 8,
                      'Open-flame': 1,
                      'recommending': 4,
                      'family': 10,
                      'physiological': 1,
                      'Lighting': 2,
                      '64-page': 1,
                      'evening': 1,
                      'characteristically': 1,
                      "nation's": 2,
                      'lessen': 2,
                      'heater': 2,
                      'double-wall': 4,
                      'Joseph': 5,
                      'participates': 4,
                      'merits': 1,
                      'define': 2,
                      'Bureau': 10,
                      'daylight': 1,
                      'wives': 1,
                      '106,500': 1,
                      'destructive': 8,
                      'needed': 20,
                      'late': 6,
                      'open': 12,
                      'Communism': 3,
                      'Waltham': 1,
                      'thyratron': 1,
                      'intends': 1,
                      'ask': 3,
                      'invested': 3,
                      'Mediterranean': 1,
                      'found': 18,
                      'Witnesses': 2,
                      'dissent': 1,
                      'overgrazing': 1,
                      'Cement': 1,
                      'Nevertheless': 3,
                      'Major': 1,
                      'machinist': 1,
                      "one's": 3,
                      ':': 110,
                      'cosponsors': 1,
                      'sixty-one': 7,
                      '72': 1,
                      'Membership': 2,
                      'Manuscript': 2,
                      'schedule': 7,
                      'Ventilation': 2,
                      'scholarships': 1,
                      'supported': 3,
                      'foodstuffs': 1,
                      'things': 5,
                      'mycobacteria': 1,
                      'thank': 2,
                      'can': 117,
                      'oratorical': 1,
                      'department': 8,
                      'independent': 7,
                      'similarly': 2,
                      'trailers': 4,
                      'duces': 1,
                      '348': 4,
                      'permission': 1,
                      'divided': 5,
                      'Hall': 3,
                      'Oral': 2,
                      'poured': 1,
                      'manned': 3,
                      "Churchill's": 1,
                      '18': 5,
                      'fleet': 3,
                      'appreciative': 1,
                      'spur': 1,
                      'skeptically': 1,
                      'maps': 3,
                      'cities': 26,
                      'investigations': 2,
                      "Motors'": 3,
                      'advisers': 1,
                      'assessed': 1,
                      'aspire': 1,
                      'formal': 1,
                      'documents': 2,
                      'feelings': 1,
                      'constituted': 1,
                      'intermolecular': 1,
                      'activities': 35,
                      'Early': 1,
                      'great-grandson': 1,
                      'voting': 3,
                      'twister-coners': 1,
                      'Welfare': 1,
                      'leave': 4,
                      'York': 33,
                      'chairman': 2,
                      'signal': 5,
                      'vehicles': 37,
                      '1959': 34,
                      'importance': 17,
                      'power': 17,
                      'leprae': 1,
                      'isolate': 2,
                      'Practical': 1,
                      'apportion': 1,
                      'Westerly': 1,
                      'classes': 7,
                      'any': 142,
                      'executives': 2,
                      'computes': 2,
                      "attorney's": 2,
                      'tended': 1,
                      'factors': 11,
                      'deductible': 1,
                      'scope': 6,
                      'Hence': 2,
                      'outset': 1,
                      'dying': 1,
                      'observe': 1,
                      'perhaps': 4,
                      'required': 35,
                      'talent': 2,
                      'emphasis': 7,
                      'meaningless': 1,
                      'wages': 8,
                      'gross': 4,
                      'Merchant': 4,
                      'Lord': 9,
                      'Division': 25,
                      'crews': 1,
                      '1961': 55,
                      'breadth': 1,
                      '6th': 2,
                      '1680': 1,
                      'resource': 2,
                      'implementation': 1,
                      'discussed': 2,
                      'persons': 9,
                      'experimental': 1,
                      'East': 3,
                      'I.': 3,
                      'equity': 1,
                      'K': 1,
                      'probably': 13,
                      'Atlantic': 3,
                      'shutter': 5,
                      'ordinary': 5,
                      'Projects': 1,
                      'importantly': 2,
                      'addresses': 1,
                      'effected': 2,
                      'Newark': 1,
                      'compilation': 1,
                      'dimension': 3,
                      'uncomfortable': 1,
                      'thousand': 10,
                      'File': 1,
                      'Planning': 7,
                      'govern': 3,
                      'themselves': 19,
                      'developing': 11,
                      'sketch': 1,
                      'academic': 6,
                      'occur': 1,
                      'Shelter': 1,
                      'music': 6,
                      'visit': 1,
                      'experiences': 2,
                      'moral': 1,
                      'simultaneously': 2,
                      'manmade': 1,
                      'compress': 1,
                      'interference': 31,
                      'fine': 1,
                      'remainder': 2,
                      'trustees': 2,
                      'Banker': 1,
                      'showed': 3,
                      'beachhead': 1,
                      'specially': 4,
                      'dropped': 2,
                      'parties': 3,
                      'Virgil': 1,
                      'energies': 2,
                      'terminal': 1,
                      'Mexico': 2,
                      'newsletter': 3,
                      'Forensic': 6,
                      'semi-conductors': 1,
                      '16.7': 1,
                      'unnecessary': 2,
                      'repaid': 2,
                      'her': 3,
                      'Chapter': 11,
                      'bottom': 2,
                      'dysplasia': 1,
                      'fuel': 8,
                      'continuous': 4,
                      'role': 6,
                      'polls': 1,
                      'longer': 4,
                      '328': 1,
                      'induction': 2,
                      'automobiles': 14,
                      'illustration': 3,
                      'correctly': 1,
                      'cost-billing': 1,
                      'Manchester': 3,
                      'domestic': 16,
                      'band': 2,
                      'trends': 1,
                      'visited': 1,
                      'Manufacturers': 3,
                      'ministries': 1,
                      'exploding-wire': 1,
                      'steam': 1,
                      'Also': 4,
                      'radio': 15,
                      'societies': 3,
                      'delivery': 1,
                      'treaties': 1,
                      "petitioner's": 2,
                      'expansions': 3,
                      'Some': 12,
                      'ill': 1,
                      'brochures': 1,
                      'scales': 2,
                      'exchange': 7,
                      'exploding': 3,
                      'due': 29,
                      'considering': 1,
                      'grand': 1,
                      'Fletcher': 1,
                      'selection': 7,
                      'borne': 1,
                      'devastating': 2,
                      'establishments': 1,
                      'Revise': 1,
                      'contraband': 1,
                      'Russian': 6,
                      'checked': 1,
                      'serve': 19,
                      'manpower': 6,
                      '$18.9': 1,
                      'problem': 34,
                      'curtail': 1,
                      'U.s.': 1,
                      'winter': 1,
                      'distance': 3,
                      'Despite': 2,
                      'continuously': 3,
                      'support': 31,
                      'percentage': 16,
                      'allocation': 9,
                      'stage': 4,
                      'Evening': 1,
                      'Yugoslavia': 2,
                      'Purchases': 1,
                      'Trustees': 7,
                      'Catherine': 1,
                      'approach': 13,
                      'reflect': 5,
                      'looking': 3,
                      'finance': 10,
                      'commission': 2,
                      'Rivers': 1,
                      'silicon': 1,
                      'denied': 11,
                      'sugar': 1,
                      'Journal': 11,
                      'Pestle': 1,
                      'thanks': 3,
                      'atmospheres': 3,
                      'manufacture': 9,
                      'British': 3,
                      'hereby': 6,
                      'trading': 1,
                      'moderately': 1,
                      'Time': 1,
                      'past': 13,
                      'incipiency': 2,
                      'Kaisers': 1,
                      'periods': 8,
                      'picnics': 1,
                      'fibers': 2,
                      'tubing': 1,
                      'qualified': 5,
                      'delegation': 1,
                      'exclusive': 3,
                      'collected': 4,
                      'Varying': 1,
                      'attract': 2,
                      'unexpended': 1,
                      'rebutted': 1,
                      'pointed': 3,
                      'infestation': 1,
                      'realized': 4,
                      'determination': 12,
                      'jeopardize': 1,
                      'same': 51,
                      'Americas': 1,
                      'Clinico-pathologic': 3,
                      'servicing': 1,
                      'place': 30,
                      'piece': 3,
                      'foreknowledge': 1,
                      'Education': 3,
                      'vents': 1,
                      'hardly': 1,
                      'committees': 6,
                      'bombers': 6,
                      'variables': 1,
                      'Stalins': 1,
                      'reform': 1,
                      'substantiates': 1,
                      'buildup': 1,
                      'trend': 3,
                      'Communist': 4,
                      'urgent': 5,
                      'track': 2,
                      'mandamus': 1,
                      'safety': 4,
                      '$184': 1,
                      'state-owned': 12,
                      '162,400': 1,
                      'greater': 22,
                      'A.M.A.': 1,
                      'therefore': 19,
                      'expend': 1,
                      'remain': 5,
                      'files': 1,
                      'Over': 2,
                      'three-dimensional': 1,
                      'instrument': 3,
                      'component': 3,
                      'could': 38,
                      'framing': 1,
                      'forte': 1,
                      'Smith': 2,
                      'Lawrence': 1,
                      'conform': 1,
                      'Federal': 43,
                      'rich': 2,
                      'stopped': 1,
                      '10.4': 1,
                      'elaborate': 2,
                      'planting': 1,
                      'Rusk': 1,
                      'creating': 3,
                      'pride': 3,
                      'sequence': 2,
                      'tapping': 1,
                      'Courses': 2,
                      'N.Y.': 2,
                      'wise': 2,
                      'monopolize': 2,
                      'Soviet': 5,
                      'intereference': 1,
                      '1': 107,
                      'Dr.': 7,
                      'optimism': 2,
                      'Cars': 2,
                      'Heywood': 2,
                      'meant': 1,
                      'Monroe': 5,
                      'Refund': 1,
                      'integration': 1,
                      'Prior': 3,
                      'rupees': 14,
                      'sentenced': 1,
                      'utensils': 1,
                      'retains': 1,
                      'endow': 1,
                      'post-World': 1,
                      'lengthened': 1,
                      'inventory': 14,
                      'Rico': 18,
                      'contain': 3,
                      'underground': 4,
                      'guarding': 1,
                      'prevents': 1,
                      'substantially': 13,
                      'for': 806,
                      'adapted': 1,
                      'performance': 10,
                      'count': 2,
                      'calorimeter': 2,
                      'slipped': 1,
                      'equally': 3,
                      'buildings': 5,
                      'between': 44,
                      'rare': 2,
                      'amici': 2,
                      'cost': 34,
                      'original': 8,
                      'host': 7,
                      'reading': 3,
                      'welding': 1,
                      '185th': 1,
                      'corrosive': 1,
                      'Richmond': 3,
                      'heads': 1,
                      'Inter-American': 1,
                      'External': 2,
                      'employee': 5,
                      'report': 39,
                      'ring': 1,
                      'extinction': 1,
                      'subscription': 1,
                      'dweller': 1,
                      'appear': 3,
                      'Instrument': 1,
                      'Roman': 3,
                      'Atlas': 6,
                      'recognize': 2,
                      'improve': 3,
                      'World': 7,
                      'Body': 2,
                      'inhabitants': 1,
                      '603': 1,
                      '$15': 1,
                      'Greene': 1,
                      'higher': 14,
                      '270,000': 1,
                      'Pakistan': 1,
                      'Vice': 2,
                      'diplomatic': 1,
                      'Embassy': 1,
                      '$10.1': 1,
                      'eyes': 3,
                      'Security': 8,
                      "Sprague's": 1,
                      'filled': 5,
                      'energy': 7,
                      'simple': 5,
                      'Harvard': 1,
                      'Until': 1,
                      'Airpark': 1,
                      'student-directed': 1,
                      'marketing': 7,
                      'handicapped': 2,
                      'rehearsals': 1,
                      'retained': 1,
                      'fruit': 1,
                      'sixties': 1,
                      'dismissed': 2,
                      'oppression': 1,
                      'Wildlife': 1,
                      'composers': 1,
                      'they': 92,
                      'done': 6,
                      'delivered': 4,
                      'discovery': 1,
                      'Provost': 1,
                      'civic': 4,
                      'banks': 4,
                      'may': 153,
                      'The': 478,
                      'casual': 1,
                      'promise': 1,
                      'D.': 6,
                      'gases': 2,
                      'cycled': 1,
                      '$.027': 2,
                      'Boston': 4,
                      'Pooling': 1,
                      'Pageants': 2,
                      'cm': 1,
                      'of': 3031,
                      'Addabbo': 2,
                      'Application': 4,
                      'materially': 1,
                      '542,250': 1,
                      'celebrated': 1,
                      '47': 1,
                      'redistributed': 1,
                      'law': 21,
                      '$.07': 3,
                      'abroad': 19,
                      '1920s': 1,
                      'nearing': 2,
                      'Illustrations': 1,
                      'defense': 17,
                      'quarter': 1,
                      'combined': 3,
                      'Selling': 1,
                      'acting': 2,
                      'observed': 3,
                      'sixteen': 1,
                      'noticeable': 1,
                      'White': 1,
                      'felt': 6,
                      'private': 12,
                      'corner': 3,
                      'contribution': 6,
                      'Forty-seven': 1,
                      'Migs': 1,
                      'Low': 1,
                      'wars': 1,
                      'Rehabilitation': 2,
                      'seized': 1,
                      'spoiled': 1,
                      'directionally': 1,
                      'drivers': 2,
                      'lack': 8,
                      'together': 8,
                      "producers'": 1,
                      'High': 1,
                      'compute': 1,
                      'unchallenged': 1,
                      'low': 5,
                      'facts': 7,
                      'recreation': 8,
                      'microwaves': 1,
                      'musical': 3,
                      'cetera': 1,
                      'assembled': 4,
                      'Subsequent': 1,
                      'survival': 1,
                      'L.': 4,
                      "Armada's": 1,
                      'B-47': 1,
                      'reflects': 3,
                      'pre-1960': 1,
                      'display': 2,
                      'documentary': 1,
                      'faster': 1,
                      'too': 13,
                      'thermoplastic': 1,
                      'jacketed': 1,
                      'quirk': 1,
                      'expended': 3,
                      'exciting': 1,
                      'Theater': 1,
                      'factory': 1,
                      'entry': 2,
                      'overhead': 1,
                      'undigested': 1,
                      'Remarks': 4,
                      'outside': 16,
                      'general': 34,
                      'astrophysics': 2,
                      'arc': 3,
                      'faces': 1,
                      'relation': 7,
                      'affect': 1,
                      'aids': 6,
                      'upheld': 1,
                      'sets': 23,
                      'following': 16,
                      'Adjusting': 1,
                      'Leesona-Holt': 2,
                      'calendar': 20,
                      'Basic': 2,
                      'event': 7,
                      'encouragement': 2,
                      'employees': 25,
                      'stabilization': 2,
                      'Washington': 29,
                      'bearer': 1,
                      'mentions': 1,
                      'subtracting': 2,
                      'decree': 1,
                      'recommendation': 15,
                      'Rural': 2,
                      'how': 16,
                      'publishes': 1,
                      'towns': 29,
                      'International': 5,
                      'Musical': 2,
                      'occupants': 1,
                      'marking': 1,
                      'presents': 1,
                      'watchmaker': 1,
                      'above': 35,
                      'fees': 5,
                      'purpose': 22,
                      'relative': 5,
                      'olim': 1,
                      'broadened': 1,
                      'give': 21,
                      'one-twentieth': 1,
                      'dire': 1,
                      'absolute': 7,
                      'Thus': 9,
                      'membership': 3,
                      'assemble': 1,
                      'successfully': 3,
                      'medieval': 1,
                      'portable': 1,
                      'enormous': 1,
                      'Splinting': 1,
                      'Indirectly': 1,
                      'practice': 11,
                      'housing': 2,
                      'beauty': 1,
                      'minute': 3,
                      'ammunition': 1,
                      'unsuccessful': 1,
                      'tunnel': 1,
                      'dealers': 1,
                      'Amendment': 2,
                      'Richard': 1,
                      'thermometer': 6,
                      'semester': 3,
                      'Continuation': 1,
                      'mapping': 5,
                      'mutual': 2,
                      'solid-fueled': 1,
                      'statistics': 2,
                      'Chambers': 1,
                      'applying': 1,
                      'allow': 3,
                      'critical': 7,
                      'library': 2,
                      'fertilizer': 1,
                      'payments': 27,
                      'invariably': 1,
                      'Leading': 1,
                      'identified': 2,
                      '10-year': 5,
                      'withholding': 5,
                      'inadequacy': 1,
                      'performing': 3,
                      'actually': 2,
                      'advice': 10,
                      'Several': 2,
                      'Paragraph': 1,
                      'explanation': 2,
                      'priced': 1,
                      'coveted': 1,
                      'overload': 1,
                      'Investigation': 5,
                      'sea': 2,
                      'public': 43,
                      'thought': 7,
                      'camera': 3,
                      'employed': 7,
                      'Very': 1,
                      'corrugated': 1,
                      'on': 401,
                      'response': 5,
                      'conviction': 4,
                      'upgrading': 2,
                      'favorable': 6,
                      'authority': 14,
                      'Our': 20,
                      'Perhaps': 3,
                      'confused': 1,
                      'missile': 9,
                      'big': 4,
                      'plant': 30,
                      'commutation': 1,
                      'earnestly': 1,
                      'long-term': 15,
                      'zinc': 4,
                      'quantities': 2,
                      'inspect': 3,
                      'Agreement': 25,
                      'Saddle': 2,
                      'Vapor': 1,
                      'clearly': 5,
                      'completely': 4,
                      'abides': 1,
                      'Paced': 1,
                      'C': 28,
                      'wooden': 2,
                      'civilization': 1,
                      'cold': 2,
                      'financial': 28,
                      'notes': 13,
                      'coverage': 3,
                      'Expenditures': 2,
                      'ration': 1,
                      'Employee': 1,
                      'Cambridge': 1,
                      'normally': 1,
                      'utterly': 2,
                      'repayable': 1,
                      'hip': 1,
                      'twenty-three': 2,
                      'mere': 1,
                      '1833': 1,
                      'identifying': 2,
                      'warm': 2,
                      'proclaim': 7,
                      'stood': 2,
                      'worst': 4,
                      "Railroad's": 1,
                      'transitions': 2,
                      'lend': 1,
                      'reminding': 2,
                      'contracted': 2,
                      'impact': 5,
                      'desks': 1,
                      'physical': 6,
                      'Heritage': 3,
                      'long': 16,
                      'specimens': 2,
                      'tooth': 3,
                      'current': 21,
                      ...}),
             'hobbies': Counter({'Without': 4,
                      'vacuum': 4,
                      'festival': 1,
                      'shimmering': 1,
                      '25-cents': 1,
                      'dig': 1,
                      '23': 2,
                      'Outside': 3,
                      'lighted': 1,
                      'anyhow': 1,
                      'speak': 2,
                      'located': 12,
                      'economics': 1,
                      'infectious': 3,
                      'practising': 1,
                      'education': 13,
                      'house': 47,
                      'displaced': 1,
                      'promises': 1,
                      'skilled': 3,
                      'muscles': 9,
                      'escutcheons': 1,
                      'non-job-connected': 1,
                      'thickness': 13,
                      'darling': 1,
                      'tube': 5,
                      'building': 26,
                      'waters': 3,
                      'helpless': 1,
                      'Minnie': 1,
                      'Between': 2,
                      'More': 5,
                      'threshold': 1,
                      'monomer': 1,
                      'junction': 1,
                      'restrain': 1,
                      'operating': 10,
                      '90-degree': 2,
                      'use': 90,
                      'crags': 1,
                      'Limit': 1,
                      'experience': 20,
                      'served': 10,
                      'Raceway': 2,
                      'octagonal': 1,
                      'equalize': 1,
                      'xenon': 1,
                      'bars': 23,
                      'Girl': 1,
                      'population': 3,
                      'Sr.': 2,
                      'tempted': 1,
                      'overhangs': 2,
                      'thoroughly': 9,
                      'February': 6,
                      'Presto': 1,
                      'gaggle': 1,
                      'Much': 1,
                      'Renaults': 1,
                      'worth-while': 1,
                      'certain': 16,
                      'season': 9,
                      "NRLDA's": 1,
                      'shade': 6,
                      'inspector': 2,
                      'others': 22,
                      'stronghold': 1,
                      'vicinity': 2,
                      'manufacturers': 15,
                      'grow': 9,
                      'strains': 1,
                      'transition': 1,
                      'Only': 5,
                      'catch': 5,
                      'handiest': 1,
                      'beautiful': 10,
                      'amazing': 1,
                      'shorter': 3,
                      "They're": 6,
                      'Greenleaf': 1,
                      'Mar.': 1,
                      'well-designed': 1,
                      'denying': 1,
                      'Frisco': 1,
                      'lapses': 1,
                      'breather': 1,
                      'retains': 1,
                      'Eighty': 1,
                      '15,500-lb.': 1,
                      'releasing': 1,
                      'cosponsored': 1,
                      'company': 31,
                      'Enough': 1,
                      'Stevens': 2,
                      'mon': 1,
                      'chic': 1,
                      'greatness': 2,
                      'determine': 15,
                      'foreleg': 1,
                      'Limits': 1,
                      'correct': 8,
                      'toughest': 1,
                      'corporation': 2,
                      'predetermined': 1,
                      'cups': 5,
                      'kitchen': 9,
                      'Debonnie': 3,
                      'amount': 13,
                      'crack': 1,
                      'Called': 2,
                      'Seeing': 1,
                      'Beccaria': 2,
                      'Md.': 1,
                      'cheek': 1,
                      'Challenge': 1,
                      'lets': 2,
                      'undertow': 1,
                      'serratus': 2,
                      'reinforcing': 1,
                      'through': 77,
                      'Following': 5,
                      'company-paid': 1,
                      'Buckhannon': 1,
                      'Apple': 1,
                      'Just': 7,
                      'specific': 9,
                      'expendable': 1,
                      'each': 82,
                      'loosening': 1,
                      'Because': 11,
                      'Plus': 1,
                      'Coffee': 2,
                      'ink': 1,
                      'tons': 3,
                      'Dennis': 1,
                      'eye-filling': 1,
                      "grocer's": 1,
                      'Physique': 1,
                      'forget': 7,
                      'apparently': 2,
                      'sturgeon': 1,
                      'Shade': 1,
                      'craft': 6,
                      'fernery': 1,
                      'cottage': 3,
                      'tongue-in-cheek': 1,
                      'raised': 6,
                      'magnificence': 1,
                      'stray': 1,
                      'extent': 4,
                      'probability': 3,
                      'stream': 6,
                      'encompasses': 1,
                      'Half': 1,
                      'buggy': 1,
                      'fabulous': 2,
                      'suburbs': 1,
                      'Figs.': 4,
                      'situations': 3,
                      'Haven': 4,
                      'fences': 2,
                      'At': 25,
                      'lb-plus': 1,
                      "I'm": 2,
                      'focused': 1,
                      'economizing': 1,
                      'vessels': 3,
                      'jewels': 1,
                      'balconies': 1,
                      'scoop': 1,
                      'Those': 6,
                      'surprisingly': 2,
                      'treasures': 1,
                      'Nelson': 1,
                      'drop': 4,
                      'thermoforming': 1,
                      'career': 3,
                      'cartilage': 1,
                      'husband': 1,
                      'friends': 8,
                      'chop': 2,
                      'twists': 3,
                      'integral': 1,
                      'shouting': 1,
                      'correlation': 1,
                      'kiloton': 1,
                      'slip': 8,
                      'acetonemia': 3,
                      "patient's": 1,
                      'mattresses': 1,
                      'terry': 1,
                      'small': 55,
                      'bull': 2,
                      'jam': 1,
                      'enemy': 1,
                      'amuse': 2,
                      'lifters': 4,
                      'understatement': 1,
                      'Hanover-Misty': 1,
                      'skyline': 1,
                      'snag': 2,
                      'knife': 3,
                      'drilled': 3,
                      'missiles': 9,
                      'recently': 11,
                      'earthy': 1,
                      'landmarks': 3,
                      'built-in': 1,
                      'sportsmen': 6,
                      'Heel-Holiday': 1,
                      'next': 19,
                      'mustard': 12,
                      'strike': 3,
                      'screens': 1,
                      'shall': 5,
                      'searching': 1,
                      'e.g.': 2,
                      'really': 14,
                      'peoples': 2,
                      'broad': 6,
                      'Foliage': 1,
                      'external': 2,
                      'inflicted': 1,
                      'Eddie': 2,
                      'soup': 3,
                      'signals': 7,
                      'effect': 12,
                      'Pete': 1,
                      'exact': 6,
                      'stitch': 2,
                      'artists': 3,
                      'advantages': 3,
                      'lag': 1,
                      'hammerless': 1,
                      'Jacky': 1,
                      'masculine': 3,
                      'evening': 4,
                      'twelve': 4,
                      'Gables': 1,
                      'impersonated': 1,
                      'furbishing': 1,
                      'Joseph': 2,
                      'Drawn': 1,
                      'keen': 2,
                      'KCs': 1,
                      'relax': 1,
                      'Bureau': 3,
                      'P': 1,
                      'colts': 4,
                      'Health': 2,
                      'three-bedroom': 1,
                      "Prokofieff's": 6,
                      'life-long': 1,
                      '``': 227,
                      'assets': 1,
                      'ask': 2,
                      'pleasures': 1,
                      'found': 33,
                      'pegging': 1,
                      'profile': 1,
                      'owning': 2,
                      'win': 9,
                      'read': 8,
                      'domes': 3,
                      'pretty': 8,
                      'scholarships': 1,
                      'barges': 1,
                      "clown's": 1,
                      'things': 23,
                      'Now': 14,
                      'job': 35,
                      'tranquilizer': 1,
                      'Anders': 2,
                      'department': 13,
                      'provision': 3,
                      'independent': 3,
                      'boatels': 1,
                      'divided': 6,
                      'suppose': 2,
                      'feed': 84,
                      'Bent-Arm': 1,
                      'vertical': 7,
                      'bits': 2,
                      'tops': 2,
                      '18': 3,
                      'darkest': 1,
                      'cocked': 1,
                      'spur': 1,
                      'stops': 1,
                      'sir': 1,
                      'region': 5,
                      'bat': 2,
                      'Acadia': 1,
                      'LP': 1,
                      'pricing': 2,
                      'our': 77,
                      'German': 6,
                      'Molly': 1,
                      'Acorns': 1,
                      'dissolved': 1,
                      'signal': 14,
                      'vehicles': 3,
                      '1959': 4,
                      'Pullover': 2,
                      'managing': 1,
                      'Traditionalist': 1,
                      'Discovery': 2,
                      'taper': 1,
                      'shoot': 6,
                      'improves': 7,
                      'floured': 1,
                      'fabric': 1,
                      'deductible': 1,
                      'spacecraft': 3,
                      'unrifled': 1,
                      'Bright': 1,
                      'employee-contributed': 1,
                      'outset': 2,
                      'unaccountably': 1,
                      'cycles': 1,
                      'required': 15,
                      'Sammy': 1,
                      'wages': 1,
                      'Division': 6,
                      '1961': 9,
                      'Beautiful': 1,
                      'strip': 8,
                      'breadth': 3,
                      'gloss': 1,
                      'hunting': 17,
                      'Beach': 2,
                      'run': 23,
                      'discussed': 1,
                      'stores': 2,
                      'sentimentality': 1,
                      'pressure-sensing': 1,
                      'diarrhea': 7,
                      'wooded': 3,
                      'importantly': 1,
                      'runabout': 1,
                      'wire': 5,
                      'diameters': 1,
                      'feather': 1,
                      'dimension': 2,
                      'install': 5,
                      'handyman': 2,
                      'themselves': 17,
                      'developing': 5,
                      'academic': 10,
                      'dome': 7,
                      'timbre': 1,
                      'valves': 2,
                      'experiences': 3,
                      'eligible': 3,
                      'stumps': 1,
                      'positive': 1,
                      'Santa': 10,
                      'lawsuits': 1,
                      'now-famous': 1,
                      'provoke': 1,
                      'horse-radish': 2,
                      'Nina': 1,
                      'sizzling': 1,
                      'speaks': 1,
                      'anyone': 8,
                      'vies': 1,
                      'reader': 2,
                      'shank': 1,
                      'four-sided': 1,
                      '13/16-inch': 2,
                      'pulled': 7,
                      'background': 12,
                      'dual-ladder': 1,
                      'continuous': 4,
                      'sprinkling': 3,
                      'framed': 1,
                      'buns': 8,
                      'methods': 11,
                      'novelty': 2,
                      'soften': 1,
                      'Manufacturers': 3,
                      'loaded': 5,
                      'meticulously': 2,
                      'Greentree': 1,
                      'Also': 10,
                      'radio': 2,
                      'coat': 7,
                      'delivery': 6,
                      'seacoast': 1,
                      'trees': 15,
                      'Made': 3,
                      'biology': 1,
                      'lecturing': 2,
                      'scales': 1,
                      'Japanese': 1,
                      'exchange': 1,
                      'Frederick': 1,
                      'due': 13,
                      'attainment': 1,
                      'congratulated': 1,
                      'Turnpike': 2,
                      'unabashed': 1,
                      'sixty': 1,
                      'checked': 5,
                      'manpower': 2,
                      'midwestern': 2,
                      'problem': 29,
                      'curtail': 1,
                      'pulse': 2,
                      'winter': 13,
                      'distance': 14,
                      '5000': 2,
                      'percentage': 4,
                      'bevels': 1,
                      'tidbits': 1,
                      'dares': 1,
                      'midweek': 1,
                      'floorboards': 1,
                      'informal': 1,
                      'reflect': 5,
                      'looking': 7,
                      'Journal': 2,
                      'pleas': 1,
                      'sangaree': 1,
                      'British': 1,
                      'Unlimited': 1,
                      'Well-stretched': 1,
                      'clever': 3,
                      'niceties': 1,
                      'humor': 2,
                      'anchovy': 1,
                      'fibers': 1,
                      'collected': 2,
                      'Big': 1,
                      'bottle': 3,
                      'roofed': 1,
                      'vastly': 1,
                      'recoil': 3,
                      'donating': 1,
                      'Usually': 3,
                      'Position': 2,
                      'microscope': 4,
                      'slowly': 9,
                      'place': 48,
                      'piece': 42,
                      'Itasca': 1,
                      'Sturdy': 1,
                      'rode': 4,
                      'payroll': 1,
                      'Vases': 2,
                      'bombers': 6,
                      '1861': 1,
                      'tasted': 1,
                      'atmosphere': 2,
                      'electronically': 1,
                      'union': 4,
                      'melting': 2,
                      '1821': 2,
                      'land': 18,
                      'galvanism': 1,
                      'railroad': 2,
                      'files': 1,
                      'Over': 2,
                      'instrument': 1,
                      'think': 11,
                      'could': 58,
                      'enjoying': 1,
                      'Federal': 1,
                      'rich': 7,
                      'stopped': 1,
                      'Wilson': 1,
                      'Ernest': 2,
                      'tappet': 15,
                      'spaciousness': 1,
                      'elaborate': 2,
                      'Along': 1,
                      'Bottoms': 1,
                      'twisting': 2,
                      'tapping': 1,
                      'brute': 1,
                      'metered': 2,
                      'customs': 1,
                      'ran': 2,
                      'wagons': 4,
                      'meant': 3,
                      'grease': 2,
                      'rocket': 2,
                      'intent': 1,
                      'tiny': 4,
                      'showmanship': 1,
                      'Displacement': 1,
                      'inventory': 1,
                      '1882': 1,
                      'picture': 11,
                      'Rubber': 2,
                      'gussets': 1,
                      'curious': 2,
                      'sympathetic': 1,
                      'watercolorist': 2,
                      'buildings': 14,
                      'between': 41,
                      'cost': 49,
                      'ordinarius': 1,
                      'Cumhuriyet': 1,
                      'host': 4,
                      'intimate': 1,
                      'premieres': 1,
                      'beam': 3,
                      'butyrate': 9,
                      'outmoded': 1,
                      'diagrams': 1,
                      'masterpiece': 1,
                      'buckle-on': 1,
                      'Atlas': 1,
                      'recognize': 3,
                      'improve': 10,
                      'irreparable': 1,
                      'marked': 7,
                      'wagon': 4,
                      'plastisols': 1,
                      'higher': 13,
                      'Dutch': 1,
                      'cabinets': 2,
                      'foliage': 2,
                      'piston': 7,
                      'convenient-type': 1,
                      'firmly': 7,
                      'Until': 2,
                      'marketing': 30,
                      'backed': 1,
                      'kob': 1,
                      'gymnast': 1,
                      'extras': 1,
                      'amplified': 2,
                      'fruit': 8,
                      '$.50': 2,
                      'endowments': 1,
                      'Factors': 1,
                      'Trackdown': 1,
                      'composers': 2,
                      '1-ton': 1,
                      '2:22': 2,
                      'distinctly': 1,
                      'wiping': 1,
                      '$310': 1,
                      'Simple': 2,
                      'Transportation': 1,
                      'The': 458,
                      'cowboys': 1,
                      'promise': 4,
                      'D.': 2,
                      'vinyl': 2,
                      'sportsman': 1,
                      'rascal': 1,
                      'candles': 1,
                      'of': 2390,
                      'under': 52,
                      'screwed': 10,
                      '47': 2,
                      '$10,000': 1,
                      'law': 5,
                      'tackle': 1,
                      'decimal': 3,
                      'Rim-Fire': 1,
                      'displacement': 12,
                      'nearing': 1,
                      'stained': 1,
                      'inboard': 1,
                      'rear': 9,
                      'Creed': 1,
                      'chimiques': 1,
                      'perky': 1,
                      'conception': 1,
                      'bold': 1,
                      'intra-company': 1,
                      'upland': 2,
                      'contribution': 4,
                      'Pump': 1,
                      'gallonage': 1,
                      'Information': 1,
                      'fractions': 4,
                      'Not': 8,
                      'fecundity': 1,
                      'Juniors': 29,
                      'Bridges': 1,
                      'rehearing': 1,
                      "Dexter's": 1,
                      'sleek': 1,
                      'Finals': 7,
                      'Ritter': 9,
                      'facts': 3,
                      'recreation': 21,
                      'Hudson': 3,
                      "70's": 1,
                      'assembled': 2,
                      'tempos': 1,
                      'Showmanship': 8,
                      'Cover': 1,
                      'policing': 1,
                      'straightening': 1,
                      'jacketed': 2,
                      'Outdoor': 3,
                      'warehousing': 1,
                      'blacked-in': 1,
                      'factory': 1,
                      'entry': 3,
                      "America's": 4,
                      'outside': 15,
                      'tall-growing': 1,
                      'gloves': 2,
                      'arc': 1,
                      'relation': 9,
                      'Timothy': 8,
                      'book': 3,
                      'droves': 1,
                      'suggested': 6,
                      'minimal': 5,
                      'Basic': 1,
                      'fame': 1,
                      'employees': 15,
                      'velvety': 2,
                      'refrigeration': 2,
                      '3-1/2': 1,
                      'Mechanized': 1,
                      'pressure': 19,
                      'bound': 3,
                      'intend': 2,
                      'heartbeat': 2,
                      'old-time': 1,
                      'presents': 3,
                      'pump': 6,
                      'Reducing': 1,
                      'purpose': 11,
                      'Caution': 5,
                      'give': 23,
                      'town': 13,
                      'clamped': 2,
                      'one-twentieth': 1,
                      'absolute': 3,
                      'membership': 9,
                      'Sugar': 1,
                      'boring': 2,
                      'scientists': 2,
                      'dishes': 2,
                      'mobility': 2,
                      'beauty': 6,
                      'gamut': 1,
                      'ammunition': 6,
                      "Henri's": 2,
                      'inhibition': 1,
                      'soldering': 1,
                      'rooting': 2,
                      '1777': 1,
                      'dealers': 11,
                      'Richard': 1,
                      'directed': 2,
                      'multiplication': 3,
                      'aesthetic': 3,
                      'strenuous': 1,
                      'applying': 3,
                      'allow': 12,
                      'rag': 1,
                      'Boissoneault': 1,
                      'similarity': 1,
                      'Aj': 16,
                      'chutney': 1,
                      'Simca': 1,
                      'travelogue': 1,
                      'actually': 11,
                      'Several': 5,
                      'rocked': 1,
                      'Westinghouse': 1,
                      'softened': 1,
                      'juice': 2,
                      'Billy': 1,
                      'Glazed': 1,
                      'Gardens': 5,
                      'after': 49,
                      'young': 14,
                      'vertical-takeoff-and-landing': 1,
                      'Very': 1,
                      'Arms': 2,
                      'corrugated': 2,
                      'man': 16,
                      'response': 1,
                      'article': 3,
                      'Trim': 4,
                      'pool-side': 2,
                      'authority': 6,
                      'Point': 2,
                      'Perhaps': 1,
                      'grace': 1,
                      'confused': 1,
                      'Audubon': 1,
                      'coverings': 1,
                      'missile': 19,
                      'big': 21,
                      'long-term': 2,
                      'unbelievable': 1,
                      'inspect': 1,
                      'E': 3,
                      '260': 2,
                      'trajectory': 1,
                      'Cod': 1,
                      'attempted': 2,
                      'completely': 13,
                      'C': 7,
                      'wooden': 17,
                      'cold': 10,
                      'blanks': 2,
                      'opened': 6,
                      'left': 39,
                      'Abilene': 2,
                      '$.10-a-minute': 1,
                      'Simmer': 2,
                      'fortified': 1,
                      'Charlottesville': 1,
                      'Holy': 1,
                      'clip': 3,
                      'stood': 5,
                      'worst': 1,
                      'rounds': 1,
                      'deck': 1,
                      'lend': 1,
                      'drinking': 2,
                      'toasted': 1,
                      'AAA': 1,
                      'street': 8,
                      'ex-President': 1,
                      'bow': 2,
                      'patterns': 10,
                      'Hanover-Bertie': 1,
                      'Konzerthaus': 1,
                      'Strongheart': 1,
                      'adjoined': 1,
                      'locally': 2,
                      'award': 3,
                      'fill': 7,
                      'modernism': 1,
                      'rusting': 1,
                      '$200': 2,
                      'wears': 1,
                      'transom': 13,
                      'Harlan-Marcia': 1,
                      'now': 61,
                      'Column': 4,
                      'electrocardiograph': 2,
                      'dramatic': 2,
                      'radiant': 1,
                      'taunts': 1,
                      'calibers': 3,
                      "'round": 2,
                      "emperor's": 1,
                      'commonplace': 2,
                      'upsets': 1,
                      'colder': 1,
                      'De': 4,
                      'Sporting': 2,
                      'Yankee': 1,
                      'Karet': 1,
                      '2:06.1': 1,
                      'reckon': 1,
                      'five-cent': 1,
                      'loss': 4,
                      'three': 44,
                      'shortsighted': 1,
                      'automatic': 3,
                      'Latest': 1,
                      'demand': 14,
                      'batch': 1,
                      'stock': 14,
                      'Spring': 3,
                      'allot': 1,
                      'planks': 1,
                      'San': 4,
                      'bridge': 40,
                      'clearer': 2,
                      'industrial': 4,
                      'pure': 3,
                      'organize': 1,
                      'extended': 3,
                      'timers': 1,
                      'refers': 1,
                      'handstand': 1,
                      '06-05': 2,
                      'accorded': 1,
                      'blossom': 4,
                      'Cylinder': 2,
                      'suave': 1,
                      'joy': 1,
                      "community's": 1,
                      'continuity': 1,
                      "Wouldn't": 1,
                      'apartment': 2,
                      'motion': 3,
                      'car': 37,
                      'Analyzer': 1,
                      'miscalculations': 1,
                      'expanse': 1,
                      'Quintet': 1,
                      'A.': 1,
                      'finely': 1,
                      'Methyl': 1,
                      'salad': 5,
                      'present': 26,
                      'describes': 1,
                      'throws': 1,
                      'galley': 1,
                      'science': 8,
                      'persistence': 1,
                      'be': 508,
                      'wealth': 3,
                      'respiratory': 3,
                      'racing': 1,
                      'device': 2,
                      'ready': 8,
                      'reconsidered': 1,
                      'public-spirited': 1,
                      'formulated': 1,
                      'glistened': 1,
                      'culture': 2,
                      'abundance': 1,
                      'shares': 3,
                      'Hawaii': 2,
                      'Hackmann': 1,
                      'Plastics': 1,
                      'conclusion': 1,
                      'ant': 1,
                      'reasons': 5,
                      'auto': 8,
                      'essay': 3,
                      'minutes': 17,
                      'Missiles': 1,
                      'Gas': 2,
                      'Treasury': 1,
                      'equipped': 11,
                      'peerless': 1,
                      'joined': 4,
                      "He's": 2,
                      'grower': 1,
                      'purchasers': 1,
                      'Saracens': 1,
                      'injection-molded': 1,
                      'locale': 1,
                      'circus': 1,
                      'load': 12,
                      'distributor': 6,
                      'favored': 1,
                      'enforcement': 1,
                      'afternoon': 3,
                      'side-looking': 1,
                      'scored': 6,
                      'indistinguishable': 1,
                      'Dream': 2,
                      'drained': 1,
                      'Fascism': 1,
                      'concentrates': 1,
                      'status': 12,
                      'commercial': 7,
                      'specify': 2,
                      'exceptional': 3,
                      'central': 12,
                      'scooping': 1,
                      'tablespoons': 7,
                      'grand-looking': 1,
                      'hazard': 2,
                      'weeds': 1,
                      'vacuuming': 2,
                      'force': 23,
                      'wing-shooting': 1,
                      'scenic': 6,
                      'replaced': 12,
                      'youth': 1,
                      'prevented': 2,
                      'Dean': 2,
                      'sizes': 5,
                      'obligations': 1,
                      'cringing': 1,
                      'Aureomycin': 5,
                      'eastern': 1,
                      'Pansies': 4,
                      'RCA': 2,
                      'vacationers': 1,
                      'blood-filled': 1,
                      'Eating': 1,
                      'Naval': 1,
                      "child's": 2,
                      'lighting': 6,
                      'alone': 16,
                      'Buries': 1,
                      'pickle': 1,
                      'bullet': 4,
                      '989': 1,
                      'handgun': 3,
                      '70-mile-long': 1,
                      'into': 108,
                      'principally': 2,
                      'Initial': 1,
                      'Oranges': 2,
                      'denouncing': 1,
                      'affiliated': 1,
                      'millimeter': 1,
                      'haul': 2,
                      'Abraham': 1,
                      'Jones-Imboden': 1,
                      'keel': 4,
                      'else': 6,
                      'skipped': 1,
                      'justify': 1,
                      'let': 13,
                      'lengthwise': 2,
                      'forecasts': 1,
                      'checking': 1,
                      'localities': 1,
                      'May': 12,
                      'Laguna': 1,
                      'promising': 5,
                      'outraged': 1,
                      'encourages': 2,
                      'discovery': 5,
                      'excelsior': 1,
                      'team': 8,
                      'managed': 2,
                      'thirty-foot': 1,
                      'method': 16,
                      'accommodating': 1,
                      'election': 1,
                      'sweeping': 1,
                      'landscapes': 1,
                      'Years': 1,
                      '2': 37,
                      'hold': 15,
                      'trimming': 2,
                      'seven-shot': 1,
                      '400': 3,
                      'Chlortetracycline': 1,
                      'violent': 1,
                      'length': 14,
                      'mayonnaise': 1,
                      "Russia's": 2,
                      'pupils': 2,
                      'rockets': 4,
                      'rolls': 1,
                      'Bring': 1,
                      'switch': 5,
                      "workers'": 1,
                      'chowders': 1,
                      'SD': 1,
                      'Rodney-The': 1,
                      'entourage': 1,
                      'administrator': 1,
                      'simplified': 5,
                      'all-purpose': 1,
                      'liners': 1,
                      'electric': 9,
                      'shots': 2,
                      'participated': 3,
                      'outdoors': 2,
                      'S': 5,
                      'secretaries': 1,
                      ')': 280,
                      'garlic': 2,
                      'Contrary': 2,
                      'attain': 1,
                      'lecture': 2,
                      'corners': 4,
                      'around': 41,
                      'sayonara': 1,
                      'shakers': 1,
                      'understands': 1,
                      'over-all': 5,
                      'Topography': 2,
                      'edges': 19,
                      'Otherwise': 1,
                      'verisimilitude': 1,
                      'S.': 6,
                      'germinate': 2,
                      'Turkish': 2,
                      'verandas': 1,
                      'smoothest': 1,
                      'Sweepstakes': 1,
                      'high-velocity': 1,
                      'Push-ups': 2,
                      'Land': 1,
                      'startling': 1,
                      'clicks': 1,
                      'clothes': 3,
                      'rockbound': 1,
                      'playful': 1,
                      'relative': 4,
                      'qualities': 3,
                      'forums': 1,
                      'tail': 1,
                      'Shortage': 1,
                      'secondary': 1,
                      'ceramic': 3,
                      'released': 4,
                      'ends': 16,
                      'possibilities': 6,
                      'scanning': 3,
                      ...}),
             'humor': Counter({'Without': 1,
                      'vacuum': 1,
                      'read': 8,
                      'dig': 1,
                      'anyhow': 1,
                      'speak': 1,
                      'grow': 2,
                      'house': 13,
                      'company': 3,
                      'barrage': 1,
                      'monies': 1,
                      'darling': 1,
                      'theater': 2,
                      'building': 1,
                      'Maria': 1,
                      'bangish': 1,
                      'generation': 1,
                      'Between': 1,
                      'More': 1,
                      'summoned': 1,
                      'walked': 2,
                      'use': 10,
                      'worse': 2,
                      'killed': 1,
                      'contraptions': 1,
                      'deigned': 1,
                      'experience': 1,
                      'disorderly': 1,
                      'belonged': 2,
                      'served': 3,
                      'vase': 1,
                      'Eddie': 2,
                      'bars': 1,
                      'odious': 1,
                      'tonal': 1,
                      'gardening': 1,
                      'titanic': 1,
                      'trip': 2,
                      'Oddly': 1,
                      'has': 26,
                      'thwarted': 1,
                      "That'll": 1,
                      'thoroughly': 1,
                      "Roylott's": 1,
                      'instances': 1,
                      'pleading': 1,
                      'boosting': 1,
                      'our': 30,
                      'along': 5,
                      'forum': 1,
                      'certain': 8,
                      'God': 1,
                      'Couple': 1,
                      'demanded': 3,
                      'shade': 2,
                      'maximal': 1,
                      'others': 4,
                      'thespians': 1,
                      'editorial': 1,
                      'prowlers': 1,
                      'anyone': 6,
                      'entertain': 1,
                      'tangled': 1,
                      'grew': 2,
                      'imperilled': 1,
                      'scenario': 1,
                      'treacherous': 1,
                      'contributes': 1,
                      'sleeve': 1,
                      'beautiful': 3,
                      'marriages': 1,
                      'incident': 1,
                      'armadillo': 2,
                      'first': 13,
                      'dictator': 1,
                      'admit': 1,
                      'developed': 1,
                      'Mystery': 1,
                      'lapses': 1,
                      'updated': 1,
                      'cadenza': 1,
                      'messenger': 1,
                      'films': 1,
                      'taverns': 1,
                      'adding': 2,
                      'Reichstag': 1,
                      'promises': 1,
                      'constituents': 1,
                      'chortled': 1,
                      'wall': 6,
                      'railway': 1,
                      'chicken': 3,
                      'determine': 2,
                      'Spector': 1,
                      'under': 8,
                      'chaperon': 1,
                      'glance': 2,
                      'kitchen': 10,
                      'amount': 1,
                      'ship': 1,
                      'pressure': 1,
                      'cheek': 1,
                      "you'll": 1,
                      'directed': 1,
                      'Berlin': 1,
                      'insanity': 1,
                      'hear': 8,
                      'man': 21,
                      'martinis': 1,
                      "wouldn't": 5,
                      'Valley': 1,
                      'through': 18,
                      'Yvette': 1,
                      "victim's": 1,
                      'instigation': 1,
                      'Kodaks': 1,
                      'Just': 4,
                      'square': 2,
                      'each': 9,
                      'Downfall': 1,
                      'fashionable': 2,
                      'battle': 2,
                      'Ah': 1,
                      'boon': 1,
                      'motion': 2,
                      'Justice': 1,
                      'so-so': 1,
                      'pavement': 1,
                      "grocer's": 1,
                      'forget': 2,
                      'it': 162,
                      'burden': 1,
                      'humiliated': 1,
                      'cottage': 1,
                      'tongue-in-cheek': 1,
                      'raised': 2,
                      'misty-eyed': 1,
                      'love': 4,
                      'meanings': 1,
                      'hoping': 2,
                      'snapshots': 1,
                      'repeated': 2,
                      'eventually': 3,
                      'Readers': 1,
                      'apron': 1,
                      'Expressionism': 1,
                      'depressed': 2,
                      'Doris': 2,
                      'hard': 4,
                      "Shakespeare's": 1,
                      'At': 7,
                      'eludes': 1,
                      "I'm": 11,
                      'garlic': 1,
                      'focused': 1,
                      'Station': 1,
                      'merit': 1,
                      'Take': 1,
                      'ridiculed': 1,
                      'Those': 3,
                      'entertainments': 1,
                      'merely': 3,
                      'career': 3,
                      'husband': 8,
                      'friends': 1,
                      'Mesta': 1,
                      'slip': 1,
                      'manage': 1,
                      'vindictive': 1,
                      'small': 9,
                      'fuse': 1,
                      'Apollo': 1,
                      'bump': 1,
                      'red': 4,
                      'performances': 1,
                      'understatement': 1,
                      'commonest': 1,
                      'retracted': 1,
                      'hostess': 2,
                      'recently': 4,
                      'earthy': 1,
                      'Scandal': 2,
                      'distinct': 1,
                      'lenient': 1,
                      "Rossilini's": 1,
                      'babes': 1,
                      'peas': 1,
                      'snapped': 1,
                      'next': 8,
                      'crowed': 1,
                      'screens': 1,
                      'four': 8,
                      'badly': 1,
                      'searching': 2,
                      'really': 6,
                      'slightly': 2,
                      'predictable': 1,
                      'please': 2,
                      'crudely': 1,
                      'rushed': 2,
                      'female': 1,
                      'efforts': 1,
                      'Serbantian': 1,
                      'effect': 5,
                      'selected': 1,
                      'thin': 3,
                      'ad': 3,
                      'falcon': 1,
                      'cell': 3,
                      'cereal': 1,
                      'artists': 1,
                      'family': 6,
                      'imploring': 1,
                      'evening': 5,
                      'Spain': 1,
                      'contradiction': 1,
                      'Tschilwyk': 1,
                      'Replied': 1,
                      'fear': 5,
                      'keen': 1,
                      'rice': 2,
                      'These': 2,
                      'Cal': 1,
                      'late': 5,
                      'indefinity': 1,
                      'Communism': 1,
                      'Possibly': 1,
                      'personalities': 2,
                      'ask': 2,
                      'lemon-meringue': 1,
                      'flautist': 1,
                      'found': 12,
                      'rebuffed': 1,
                      'triumph': 1,
                      'symptoms': 1,
                      'demonstrable': 1,
                      'Nevertheless': 1,
                      'Major': 2,
                      "one's": 1,
                      ':': 40,
                      'pretty': 3,
                      'coverlet': 1,
                      'tangos': 1,
                      'mother': 12,
                      'things': 27,
                      'Now': 14,
                      "mayor's": 1,
                      'Abstract': 1,
                      'can': 16,
                      'department': 3,
                      'sees': 1,
                      'defines': 1,
                      'divided': 1,
                      'suppose': 3,
                      'charging': 1,
                      'scandal': 1,
                      'afghan': 1,
                      'sir': 2,
                      'underwear': 1,
                      'Yorker': 1,
                      'Don': 1,
                      'larder': 1,
                      'constituted': 1,
                      'activities': 1,
                      'German': 1,
                      'leave': 1,
                      'York': 3,
                      'scream': 1,
                      'reporters': 1,
                      'importance': 1,
                      'managing': 1,
                      'power': 7,
                      'gushed': 1,
                      'till': 2,
                      'cheers': 1,
                      'any': 18,
                      'stiffened': 1,
                      'painstakingly': 1,
                      'ordinarily': 1,
                      'reliably': 1,
                      'Fing': 2,
                      'observe': 1,
                      'required': 1,
                      'talent': 1,
                      'toes': 2,
                      'Lord': 1,
                      'housewives': 1,
                      'years': 21,
                      'curiously': 1,
                      'drink': 5,
                      'committing': 1,
                      'Coopers': 1,
                      'run': 4,
                      'discussed': 2,
                      'persons': 2,
                      'entitle': 1,
                      'Barrymores': 1,
                      'indelible': 1,
                      'haunted': 1,
                      'impatience': 1,
                      'playwright': 1,
                      'poised': 1,
                      'themselves': 4,
                      'guest': 1,
                      'send': 4,
                      'music': 4,
                      'visit': 3,
                      'moral': 1,
                      'reactionary': 1,
                      'fallen': 1,
                      'fine': 2,
                      'remainder': 1,
                      'replacing': 1,
                      'Santa': 1,
                      'dropped': 1,
                      'disabuse': 1,
                      'Good': 1,
                      'victim': 1,
                      'speaks': 1,
                      'berth': 1,
                      'reader': 1,
                      'insinuation': 1,
                      'clean': 2,
                      'symbolizing': 2,
                      'her': 62,
                      'pulled': 1,
                      'continuous': 1,
                      'role': 4,
                      'longer': 2,
                      'sing': 1,
                      'strange': 1,
                      'framed': 1,
                      'accents': 1,
                      'sentences': 1,
                      'band': 1,
                      'visited': 3,
                      'loaded': 1,
                      'steam': 2,
                      'disposed': 2,
                      'Also': 2,
                      'radio': 3,
                      'myriad': 1,
                      'ruse': 1,
                      'engagingly': 1,
                      "mother's": 3,
                      'Some': 1,
                      'dove': 1,
                      'Oscar': 1,
                      'due': 1,
                      'grand': 1,
                      'guys': 1,
                      'cut': 4,
                      'formation': 1,
                      'shrewdly': 1,
                      'plea': 2,
                      'appearing': 1,
                      'precious': 1,
                      'Russian': 1,
                      'checked': 1,
                      'serve': 1,
                      'meadow': 1,
                      'problem': 5,
                      'It-wit': 1,
                      'distance': 2,
                      'chancellor': 1,
                      'pegboard': 6,
                      'stage': 8,
                      'tongue-twister': 1,
                      "bee's": 1,
                      'looking': 2,
                      'iniquitous': 1,
                      'hubris': 1,
                      'sugar': 1,
                      'sandpaper': 1,
                      'stack': 1,
                      'British': 2,
                      'Time': 1,
                      'yapping': 1,
                      'livelihood': 1,
                      'heyday': 1,
                      'astounding': 1,
                      'plumber': 1,
                      'exclusive': 1,
                      'sentimental': 1,
                      'Minute': 1,
                      'bottle': 2,
                      'farmed': 1,
                      'thrills': 1,
                      'Usually': 1,
                      'realized': 2,
                      'same': 7,
                      'dragged': 2,
                      'possess': 3,
                      'finances': 1,
                      'piece': 4,
                      'relict': 1,
                      'hardly': 3,
                      'jimmied': 1,
                      'committees': 1,
                      'Marquess': 1,
                      'Belle': 1,
                      "What's": 2,
                      'pangs': 1,
                      'urgent': 1,
                      'inspiring': 1,
                      'greater': 2,
                      'therefore': 2,
                      'remain': 1,
                      'Eh': 1,
                      'Pickfair': 1,
                      'single-handedly': 1,
                      'think': 14,
                      'could': 30,
                      'Smith': 1,
                      'winking': 2,
                      'stripes': 1,
                      'rich': 2,
                      'stopped': 2,
                      'repressed': 1,
                      'blissfully': 1,
                      'pride': 2,
                      'Franklin': 1,
                      'compulsion': 2,
                      'remarkably': 1,
                      'tapping': 1,
                      'darkened': 1,
                      'reread': 1,
                      'Juan': 1,
                      'reflexes': 1,
                      'ran': 1,
                      'littered': 1,
                      'Heywood': 1,
                      'meant': 2,
                      'objets': 1,
                      'ivory': 5,
                      'inventory': 1,
                      "devil's-food": 1,
                      'picture': 7,
                      'Multiple': 1,
                      'deities': 1,
                      'readable': 1,
                      'performance': 5,
                      'count': 4,
                      'equally': 4,
                      'between': 3,
                      'rare': 2,
                      'woodwork': 1,
                      'blonde': 1,
                      'collapsing': 1,
                      'host': 5,
                      'Mad': 2,
                      'Gold': 1,
                      'intimate': 1,
                      'collation': 1,
                      'report': 1,
                      'solar': 1,
                      'subscription': 1,
                      'appear': 2,
                      'metal': 1,
                      'masterpiece': 1,
                      'Regiment': 1,
                      'World': 1,
                      'inhabitants': 1,
                      'wagon': 2,
                      "Bradley's": 1,
                      'Fascio-Communist': 1,
                      'matrimonial': 1,
                      'eyes': 10,
                      'filled': 1,
                      'simple': 1,
                      'petting': 1,
                      'Soak': 1,
                      'culprit': 1,
                      'Hastening': 1,
                      'Senator': 2,
                      'fruit': 2,
                      'remind': 1,
                      'Tasti-Freeze': 1,
                      'lit': 1,
                      'favorite': 1,
                      'oughta': 1,
                      'wither': 1,
                      'told': 20,
                      'they': 70,
                      'done': 13,
                      'panels': 1,
                      'instructed': 1,
                      'banks': 1,
                      'may': 8,
                      'paraphrase': 2,
                      'Best': 1,
                      'maxim': 1,
                      'portion': 1,
                      'bowl': 1,
                      'existentialist': 1,
                      'Pioneers': 1,
                      'convertible': 1,
                      'law': 2,
                      'abroad': 1,
                      'walrus': 1,
                      'autos': 1,
                      'twofold': 1,
                      'stained': 1,
                      'charming': 2,
                      'quarter': 2,
                      'police': 7,
                      'visiting': 2,
                      'acting': 1,
                      'observed': 1,
                      'state': 5,
                      'White': 1,
                      'felt': 7,
                      'private': 3,
                      'beaten': 1,
                      'separate': 1,
                      'bold': 1,
                      'actors': 1,
                      'lived': 2,
                      'climbed': 1,
                      'Horror': 1,
                      'applejack': 1,
                      'autumn': 1,
                      'lack': 2,
                      "ladies'": 2,
                      'maid': 2,
                      'over-spent': 1,
                      'facts': 1,
                      'Travancore': 1,
                      'musical': 1,
                      'antic': 1,
                      'Tibetan': 1,
                      'L.': 1,
                      "Letch's": 2,
                      'peculiar': 1,
                      'stumbles': 1,
                      'too': 19,
                      'balkiness': 1,
                      'Siberia': 1,
                      'bulwark': 1,
                      'bathtubs': 1,
                      'aspect': 1,
                      'funny': 17,
                      'homosexual': 1,
                      'staircase': 1,
                      'entry': 1,
                      "America's": 1,
                      'outside': 3,
                      'hung': 4,
                      'faces': 1,
                      'language': 1,
                      'book': 3,
                      'sprue': 1,
                      'following': 2,
                      'suggested': 2,
                      'helped': 2,
                      'harshly': 1,
                      'unidentified': 1,
                      'bearer': 1,
                      'instantly': 1,
                      'bound': 1,
                      'homesteads': 1,
                      'aided': 1,
                      'occupants': 1,
                      'above': 2,
                      'endlessly': 1,
                      'heartily': 2,
                      'give': 2,
                      'clumsy': 1,
                      'Hubba': 1,
                      'misfortune': 1,
                      'Thus': 2,
                      'Evidently': 1,
                      'implications': 1,
                      'enormous': 1,
                      'upset': 1,
                      'dishes': 3,
                      'practice': 1,
                      'minute': 1,
                      'comedians': 1,
                      'Coughlin': 1,
                      'bothering': 1,
                      'Richard': 1,
                      'operator': 2,
                      'horns': 1,
                      'obsessed': 1,
                      'Sanitary': 1,
                      'tells': 2,
                      'identified': 2,
                      'boost': 1,
                      'fragment': 1,
                      'wander': 1,
                      'missing': 2,
                      'actually': 2,
                      'Shapes': 1,
                      'precocious': 1,
                      'screenland': 1,
                      'explanation': 1,
                      'replied': 3,
                      'preserve': 1,
                      'barbarian': 1,
                      'N': 2,
                      'after': 19,
                      'questioning': 1,
                      'thought': 14,
                      'Diet': 1,
                      'employed': 1,
                      'world-renowned': 1,
                      'vanishes': 1,
                      'Hollywood': 5,
                      'sculpture': 2,
                      'Musee': 1,
                      'cousin': 1,
                      'Our': 2,
                      'Perhaps': 4,
                      'confused': 3,
                      'big': 3,
                      'unwittingly': 1,
                      'slips': 3,
                      'inspect': 2,
                      'clearly': 4,
                      'attempted': 1,
                      'wooden': 3,
                      'funnier': 1,
                      'cold': 3,
                      'ambiguities': 1,
                      'financial': 2,
                      'opened': 2,
                      'left': 8,
                      'hats': 1,
                      'illegitimate': 1,
                      'utterly': 1,
                      'appealed': 1,
                      'mere': 2,
                      'fate': 2,
                      'insults': 1,
                      'warm': 2,
                      'return': 3,
                      'clergyman': 1,
                      'stood': 1,
                      'unfortunately': 1,
                      'drinking': 2,
                      'jest': 1,
                      'glasses': 1,
                      'street': 2,
                      'saner': 1,
                      'sins': 1,
                      'current': 1,
                      'View': 1,
                      'bitten': 1,
                      'slob': 2,
                      'worth': 1,
                      'fill': 2,
                      'scimitar': 1,
                      'squat': 1,
                      'called': 15,
                      'town': 3,
                      'now': 14,
                      'non-time': 1,
                      'puzzled': 2,
                      'robber': 1,
                      'smart': 1,
                      'Rattzhenfuut': 1,
                      'dramatic': 1,
                      'vacated': 1,
                      'faint': 2,
                      'Meinckian': 1,
                      'suddenly': 1,
                      'Interlude': 1,
                      'Sponge': 1,
                      'gets': 2,
                      'bilharziasis': 1,
                      'lay': 2,
                      'workshop': 4,
                      'definite': 1,
                      'loss': 2,
                      'three': 6,
                      'shut': 1,
                      'alas': 1,
                      'demand': 1,
                      'stock': 1,
                      'musicals': 1,
                      'rooms': 2,
                      'bout-de-souffle': 1,
                      'San': 2,
                      'industrial': 1,
                      'pure': 1,
                      'Anna': 1,
                      'nervous': 3,
                      'choice': 1,
                      'Fredrico': 1,
                      'reluctant': 1,
                      'stretched': 1,
                      'creamed': 1,
                      'stands': 1,
                      'detective': 1,
                      'cafeteria': 1,
                      'typed': 1,
                      'gag': 2,
                      'curvaceously': 1,
                      'interweaving': 1,
                      'murders': 1,
                      'personal': 4,
                      'difficulty': 2,
                      'Carroll': 1,
                      'remarking': 1,
                      'recorded': 1,
                      'seriously': 2,
                      'Nebraska': 1,
                      'Margo': 1,
                      'afternoons': 1,
                      'present': 3,
                      'describes': 1,
                      'leitmotiv': 1,
                      'statue': 1,
                      'science': 1,
                      'resting': 1,
                      'Noel': 1,
                      'hit': 1,
                      'be': 78,
                      'beaches': 2,
                      'must': 9,
                      'names': 5,
                      'helpful': 1,
                      'ready': 3,
                      'kissing': 1,
                      'guided': 1,
                      'acknowledged': 1,
                      'outfit': 1,
                      'an': 75,
                      'abundance': 1,
                      'France': 1,
                      'enthusiasm': 1,
                      'heap': 1,
                      'questioned': 1,
                      'jail': 3,
                      'expected': 1,
                      'garlanded': 1,
                      'minutes': 4,
                      'Fogg': 1,
                      "He's": 2,
                      'Saracens': 1,
                      'leaping': 1,
                      'friendly': 1,
                      'afternoon': 3,
                      'indistinguishable': 1,
                      'central': 1,
                      'carefully': 2,
                      'cartons': 1,
                      'soloist': 1,
                      'jocular': 2,
                      'Salamander': 1,
                      'residence': 1,
                      'bantered': 1,
                      'dignity': 1,
                      'equivocal': 1,
                      'force': 1,
                      'scenic': 1,
                      'youth': 2,
                      'proper': 2,
                      'throw': 4,
                      'blossomed': 1,
                      'middle-Gaelic': 1,
                      'recapitulate': 1,
                      'collector': 1,
                      'dinners': 2,
                      'trail': 1,
                      'catch': 1,
                      'alone': 2,
                      'humans': 1,
                      'commercially': 2,
                      'rackets': 1,
                      'dumping': 1,
                      'troops': 1,
                      'unrelieved': 1,
                      'ransack': 1,
                      'breathe': 2,
                      'Haumd': 1,
                      'into': 35,
                      'vowed': 1,
                      'exterminate': 1,
                      'question': 4,
                      'rostrum': 1,
                      'haul': 1,
                      'policemen': 2,
                      'address': 2,
                      'else': 7,
                      'V': 1,
                      'cult': 1,
                      'local': 2,
                      'Nixon': 1,
                      'let': 5,
                      'doweling': 1,
                      'unborn': 1,
                      'devout': 1,
                      'gave': 5,
                      'becoming': 1,
                      'apparent': 2,
                      'inhumane': 1,
                      'seeks': 1,
                      'grown': 2,
                      'managed': 1,
                      'Hughes': 1,
                      'coyness': 1,
                      'murder': 5,
                      'existed': 1,
                      'because': 16,
                      'abstract': 1,
                      'sweeping': 1,
                      'cryptic': 1,
                      'pink': 1,
                      'hours': 3,
                      'gagline': 1,
                      'Bill': 2,
                      'variety': 2,
                      'Doors': 1,
                      'involved': 2,
                      '``': 343,
                      'short': 3,
                      'apprehended': 1,
                      'glad': 1,
                      'tragicomic': 1,
                      'arranged': 1,
                      'wait': 3,
                      'chair': 2,
                      'stardom': 2,
                      "Crombie's": 1,
                      'electric': 2,
                      'shots': 1,
                      'second': 6,
                      'absurd': 1,
                      ')': 31,
                      'forgive': 1,
                      'pinball': 1,
                      'rediscovery': 1,
                      'gage': 1,
                      'many': 15,
                      'corners': 1,
                      'around': 15,
                      'comprises': 1,
                      'spite': 2,
                      'followed': 3,
                      'reached': 4,
                      'genius': 1,
                      'gourmet': 2,
                      "I'll": 4,
                      'unreal': 1,
                      'stupefying': 1,
                      'According': 1,
                      'derision': 1,
                      'Furiouser': 1,
                      'neurotic': 1,
                      'secretary': 2,
                      'refused': 4,
                      'about': 36,
                      'suspended': 1,
                      'indefinitely': 1,
                      'Kline': 1,
                      'clothes': 1,
                      'laughingly': 1,
                      'indefinite': 4,
                      'playful': 1,
                      'unpaintable': 1,
                      'Callas': 1,
                      'dies': 1,
                      'dispose': 1,
                      'feat': 1,
                      'somehow': 1,
                      'argument': 1,
                      'released': 1,
                      'ends': 1,
                      'convulsed': 1,
                      'outer': 2,
                      'Such': 5,
                      'attracted': 1,
                      'letter': 4,
                      'Excuse': 1,
                      'proportions': 1,
                      'Ye': 1,
                      'make': 13,
                      'another': 11,
                      'disappearing': 1,
                      'from': 59,
                      'devilish': 1,
                      'sweet': 2,
                      'collapse': 1,
                      'sensed': 1,
                      'built': 3,
                      'speaker': 1,
                      'Ever': 1,
                      'Toynbee': 1,
                      'park': 3,
                      'affronted': 1,
                      'Pollock': 1,
                      'availed': 2,
                      'area': 4,
                      'Mother': 12,
                      'systems': 1,
                      'naturally': 1,
                      'studied': 1,
                      'twenty-five': 2,
                      'philosophic': 1,
                      'appreciate': 2,
                      'parent-teacher': 1,
                      'prancing': 1,
                      'Steuben': 1,
                      'forgotten': 3,
                      'fellow': 3,
                      'much-needed': 1,
                      'probably': 5,
                      'frequently': 1,
                      'During': 3,
                      'porch': 1,
                      'tension': 1,
                      'ahead': 1,
                      'speech': 1,
                      'sculptures': 2,
                      'tasks': 1,
                      'wry': 1,
                      'borrow': 1,
                      'danced': 1,
                      'speeches': 1,
                      'drunkenly': 1,
                      'coral-colored': 1,
                      'or': 66,
                      'Law': 1,
                      'states': 1,
                      'romance': 2,
                      'destruction': 2,
                      'Khrushchev': 1,
                      'English': 13,
                      'playing': 5,
                      'sexy': 1,
                      'city': 3,
                      'balding': 1,
                      'chemistry': 1,
                      'sex': 1,
                      'fat': 1,
                      'increased': 2,
                      'original': 2,
                      'actress': 2,
                      'stomach': 3,
                      'fairy-tale': 1,
                      'revolt': 1,
                      'dentist': 1,
                      'shops': 1,
                      'mud': 2,
                      'deal': 2,
                      'Vera': 1,
                      'consciousness': 1,
                      'plans': 1,
                      'duly': 1,
                      'reaches': 1,
                      'Skolkau': 1,
                      'lumbar': 1,
                      'arenas': 1,
                      'general': 2,
                      'unearthed': 1,
                      'sidle': 1,
                      'invite': 1,
                      'kicking': 1,
                      'heart': 14,
                      'saleslady': 3,
                      'shop': 1,
                      'French': 6,
                      'artistically': 1,
                      'anywhere': 1,
                      'combat': 1,
                      'notes': 1,
                      'Miss': 8,
                      'skillful': 1,
                      'conscientious': 1,
                      'Chalmers': 1,
                      'inexorable': 1,
                      'opinions': 1,
                      'dismay': 1,
                      'amid': 2,
                      'urges': 1,
                      'came': 12,
                      'snobbishly': 1,
                      'Felix': 1,
                      'plumbed': 1,
                      'year': 8,
                      'independence': 1,
                      'reverse': 1,
                      'victory': 1,
                      'divinity': 1,
                      'adjusted': 1,
                      'Too': 1,
                      'both': 9,
                      'painted': 1,
                      'Radio': 1,
                      'vary': 1,
                      'somersaulting': 1,
                      "murderer's": 1,
                      'window': 3,
                      'Classified': 1,
                      'jaded': 1,
                      'co-star': 1,
                      "Marshall's": 1,
                      'organized': 1,
                      ...}),
             'learned': Counter({'laden': 1,
                      'elaborates': 1,
                      'builders': 6,
                      'dig': 1,
                      '23': 10,
                      'Homeric': 15,
                      'speak': 13,
                      'located': 14,
                      'economics': 2,
                      'house': 20,
                      'Abelson': 1,
                      'polymeric': 2,
                      'promises': 2,
                      'adultery': 1,
                      'indispensible': 1,
                      'building': 18,
                      'waters': 4,
                      'Between': 1,
                      'More': 22,
                      'monomer': 1,
                      'suds': 6,
                      'restrain': 1,
                      'use': 120,
                      'suffixes': 1,
                      'killed': 1,
                      'experience': 53,
                      'Cumulative': 1,
                      'endure': 2,
                      'served': 17,
                      'appendages': 1,
                      'irradiation': 9,
                      'population': 64,
                      'warrants': 2,
                      'initiator': 2,
                      'all-over': 1,
                      'nomination': 1,
                      'pleading': 2,
                      'Systems': 3,
                      'Entry': 1,
                      'forum': 1,
                      'fluctuating': 1,
                      'wages': 9,
                      'zeal': 1,
                      'K': 7,
                      'demanded': 5,
                      'vicinity': 1,
                      'Wait': 5,
                      'mercy': 3,
                      'Laboratory': 3,
                      'chancellor': 1,
                      'beautiful': 2,
                      'averaged': 10,
                      'sixth': 5,
                      'cerebellum': 1,
                      'marriages': 9,
                      'programming': 4,
                      'denying': 1,
                      'Believing': 1,
                      'sorted': 1,
                      "supervisor's": 1,
                      'intrapulmonary': 1,
                      'constituents': 4,
                      'chicken': 6,
                      'materially': 1,
                      'shrapnel': 2,
                      'cups': 1,
                      'quiescent': 1,
                      'amount': 52,
                      'paleoexplosion': 1,
                      'pressure': 68,
                      'Short': 1,
                      'amended': 1,
                      '76.7': 1,
                      'legitimacy': 1,
                      'Input/Output': 4,
                      'evils': 1,
                      "wouldn't": 1,
                      'proportionally': 1,
                      'through': 141,
                      '10-foot': 1,
                      'accomplishes': 1,
                      'Just': 6,
                      'specific': 51,
                      'deemed': 5,
                      'Because': 13,
                      'ascertain': 1,
                      'Justice': 13,
                      'emission': 31,
                      'expanse': 1,
                      'surreptitiously': 1,
                      'vegetable': 1,
                      'lookup': 7,
                      'resolute': 1,
                      'W.G.': 1,
                      "yesterday's": 1,
                      'Democrats': 1,
                      'behave': 3,
                      'cottage': 1,
                      'raised': 17,
                      'fertility': 8,
                      'meanings': 14,
                      'stream': 18,
                      'revealed': 9,
                      'Ernst': 3,
                      'grave': 6,
                      'pathogenic': 2,
                      'sprung': 3,
                      'eternal': 4,
                      'epitomized': 1,
                      'tacitly': 1,
                      'experimenter': 3,
                      'Fractions': 1,
                      'At': 48,
                      'roundup': 1,
                      ')': 800,
                      'Poynting-Robertson': 5,
                      'stimulated': 1,
                      'mastery': 1,
                      'Morphophonemic': 1,
                      'accumulates': 1,
                      'exigencies': 1,
                      'Those': 6,
                      'surprisingly': 3,
                      'fibrous': 4,
                      'recurred': 1,
                      'prophets': 1,
                      'hesitant': 2,
                      'subdivisions': 1,
                      'thyrotoxic': 1,
                      'sands': 3,
                      'slip': 2,
                      'negative': 23,
                      "patient's": 13,
                      'indexes': 2,
                      'underlie': 2,
                      'small': 107,
                      'Ruling': 4,
                      'centrifuged': 7,
                      'enemy': 14,
                      'Typically': 1,
                      'directed': 11,
                      'quo': 2,
                      'revolutions': 2,
                      'knife': 36,
                      'perceptual': 3,
                      'Polaroid': 1,
                      'tournament': 2,
                      'enforced': 4,
                      'Removal': 1,
                      'strike': 10,
                      "Congress'": 2,
                      'four': 46,
                      'e.g.': 14,
                      'really': 18,
                      'Laying': 1,
                      'singled': 2,
                      'substrate': 23,
                      'lends': 1,
                      'Geiger': 1,
                      'comforting': 1,
                      'Finnsburg': 1,
                      'signals': 2,
                      'effect': 67,
                      'shielding': 1,
                      "growers'": 1,
                      'DUF': 5,
                      'smoother': 2,
                      'memory-images': 2,
                      'non-existent': 1,
                      'King': 6,
                      'lanthanum': 1,
                      'artists': 17,
                      'refund': 12,
                      'constant': 26,
                      'Stoics': 1,
                      'confronting': 1,
                      'evening': 2,
                      'Thyroglobulin': 1,
                      'hymn': 3,
                      'relax': 9,
                      '43*0C.': 2,
                      'Bureau': 3,
                      'wives': 2,
                      'retrieved': 7,
                      'tangents': 6,
                      'Annual': 1,
                      'assets': 9,
                      'ask': 5,
                      'found': 117,
                      'contingent-fee': 2,
                      'Nakamura': 3,
                      'statutory': 7,
                      'carbonates': 1,
                      'excretion': 1,
                      'sixty-one': 1,
                      'ethos': 1,
                      'mother': 6,
                      'Caribbean': 2,
                      'Chiang': 1,
                      'things': 36,
                      'powders': 4,
                      'Abstract': 2,
                      'capacities': 2,
                      'independent': 28,
                      'corporeal': 1,
                      'abrogated': 1,
                      'Boeing': 1,
                      'introjects': 1,
                      'feed': 13,
                      'nickel-iron': 1,
                      'bits': 3,
                      'transported': 1,
                      "Post's": 1,
                      '18': 13,
                      'hr.': 4,
                      'x': 2,
                      'stops': 1,
                      'operand': 16,
                      'gadgetry': 1,
                      'molten': 2,
                      'Verdi': 1,
                      'academics': 2,
                      'pricing': 2,
                      'chap.': 3,
                      'confabulation': 2,
                      'German': 2,
                      'survive': 4,
                      'antithyroid': 5,
                      'vehicles': 3,
                      '1959': 31,
                      'nucleated': 1,
                      'Routine': 2,
                      'yielded': 6,
                      'pierced': 2,
                      'shadow': 1,
                      'generalized': 7,
                      'fabric': 8,
                      'dramatizing': 1,
                      'Loomis': 4,
                      'Exceptional': 1,
                      'hogs': 1,
                      'narrowing': 2,
                      'Cox': 1,
                      'tropocollagen': 1,
                      'viability': 1,
                      'outset': 3,
                      'dying': 2,
                      'automatically': 10,
                      'milestone': 1,
                      'constructing': 2,
                      'Merchant': 2,
                      'Grabbing': 1,
                      'conceivable': 3,
                      '463-degrees-C': 1,
                      '1961': 9,
                      'strip': 3,
                      'breadth': 1,
                      'hunting': 1,
                      'Resolved': 4,
                      'run': 13,
                      'cocao': 1,
                      'Saturn': 3,
                      'equity': 1,
                      'abdominis': 1,
                      'importantly': 2,
                      'entitle': 1,
                      'unconcerned': 1,
                      'accelerometer': 17,
                      'high-positive': 1,
                      'themselves': 35,
                      'audited': 1,
                      'Niccolo': 1,
                      'Supervisor': 1,
                      'Woven': 2,
                      'diminish': 3,
                      'humanist': 2,
                      'speaks': 3,
                      'pulled': 3,
                      'thousand-fold': 1,
                      'domestic': 15,
                      'novelty': 2,
                      'soften': 2,
                      'loaded': 4,
                      'radio': 28,
                      'coat': 4,
                      'treaties': 2,
                      'scales': 2,
                      'Frederick': 1,
                      'due': 45,
                      'attainment': 1,
                      'seeming': 2,
                      'vocalic': 1,
                      'marry': 2,
                      'proviso': 1,
                      'lexicostatistics': 4,
                      'relativistic': 1,
                      'problem': 82,
                      'turrets': 1,
                      'pulse': 1,
                      '1930': 3,
                      'percentage': 6,
                      'Rhine': 1,
                      'decompose': 1,
                      'looking': 7,
                      'dictate': 1,
                      'Journal': 13,
                      'Belgium': 1,
                      'amphibious': 1,
                      'moderately': 3,
                      'clever': 1,
                      'Time-servers': 1,
                      'picnics': 1,
                      'delegation': 1,
                      'sentimental': 1,
                      'bolster': 1,
                      '22%': 1,
                      "beef's": 1,
                      'determination': 10,
                      'microscope': 2,
                      'slowly': 10,
                      '2.26': 1,
                      'transcendence': 1,
                      'neuroses': 4,
                      'frail': 1,
                      "'?'": 1,
                      'atmosphere': 20,
                      'gulf': 1,
                      '1859': 8,
                      'announcements': 1,
                      'equal': 29,
                      'instrument': 15,
                      'pork': 6,
                      'Especially': 1,
                      'unleveled': 1,
                      'Lawrence': 3,
                      'triumphant': 1,
                      'censuses': 3,
                      'rich': 6,
                      'spurious': 1,
                      'pride': 5,
                      'Franklin': 2,
                      'advocates': 1,
                      'semi-major': 2,
                      'rites': 1,
                      'inflexible': 1,
                      'monopolize': 1,
                      'reflexes': 3,
                      'ran': 3,
                      'optimism': 1,
                      '2-degrees-C': 1,
                      'unmanageably': 1,
                      'Refund': 1,
                      'tiny': 1,
                      'Rottger': 2,
                      'nemesis': 2,
                      'inventory': 1,
                      'mechanochemically': 1,
                      'phosphors': 1,
                      'Yorkers': 1,
                      'butyl-lithium': 1,
                      'indium': 1,
                      'A.M.': 1,
                      'adapted': 3,
                      'hypothalamus': 18,
                      'colossal': 1,
                      'buildings': 8,
                      'between': 191,
                      'woodwork': 2,
                      'operands': 1,
                      'monks': 3,
                      'lifeless': 1,
                      'solar': 8,
                      'sucess': 1,
                      'involutorial': 3,
                      'Jackman': 1,
                      'improve': 8,
                      'Tasmania': 1,
                      'cut-and-dried': 1,
                      'trigonal': 2,
                      'companions': 1,
                      'higher': 42,
                      'all-Negro': 6,
                      'willow': 7,
                      'foliage': 1,
                      'Alpha': 5,
                      'Stanley': 1,
                      'residences': 1,
                      'decomposing': 1,
                      'firmly': 7,
                      'Until': 6,
                      'deplorable': 1,
                      'Achievement': 1,
                      'meets': 23,
                      'movers': 1,
                      'amplified': 4,
                      'paradoxically': 1,
                      'dismissed': 1,
                      'universality': 2,
                      'told': 14,
                      'composers': 1,
                      's-values': 1,
                      'greatness': 1,
                      'Transportation': 1,
                      'The': 1458,
                      'heat': 27,
                      'D.': 10,
                      'gases': 1,
                      'Hoijer': 3,
                      'Raymond': 2,
                      'spray-dried': 1,
                      'of': 7418,
                      'restrictions': 5,
                      'law': 85,
                      'legibility': 1,
                      'Asher': 1,
                      '10.6%': 1,
                      'red-bellied': 1,
                      "2'": 1,
                      'Gertrude': 1,
                      'police': 5,
                      'visiting': 4,
                      'alienates': 1,
                      'corner': 15,
                      'vocalize': 1,
                      'directivity': 1,
                      'pregnancy': 4,
                      'circumstance': 2,
                      'protest': 1,
                      'self-confidence': 1,
                      'dispersed': 5,
                      'totality': 1,
                      "ladies'": 1,
                      'monograph': 1,
                      'facts': 12,
                      'thyrotrophin': 1,
                      'unusually': 1,
                      'announcing': 1,
                      'peculiar': 5,
                      'well': 122,
                      'linguists': 6,
                      'inciting': 1,
                      'factory': 9,
                      'entry': 9,
                      'overhead': 3,
                      'two-step': 1,
                      'outside': 23,
                      '$230,000': 1,
                      'arc': 31,
                      'interruptions': 1,
                      'book': 13,
                      'unrest': 1,
                      'linguistics': 2,
                      'Towards': 2,
                      'mentions': 1,
                      'refrigeration': 10,
                      'nightly': 1,
                      'blame': 2,
                      'EQU': 5,
                      'knowingly': 1,
                      'precaution': 1,
                      'aligned': 5,
                      'presents': 6,
                      'optics': 1,
                      'give': 56,
                      'membership': 4,
                      'dishes': 1,
                      'treason': 3,
                      'tape': 16,
                      'bushels': 1,
                      'Hollingshead': 1,
                      'over-hand': 1,
                      'inhibition': 5,
                      'Emission': 1,
                      'formalities': 1,
                      'drugstore': 5,
                      'diverse': 5,
                      'applying': 12,
                      'critical': 17,
                      'anhydrously': 1,
                      '1159': 1,
                      'anticipations': 1,
                      'withholding': 1,
                      'missing': 3,
                      'economist': 1,
                      'explanation': 17,
                      'softened': 1,
                      'N': 27,
                      'after': 95,
                      'conjectures': 1,
                      'questioning': 2,
                      'reached': 25,
                      'nectar': 2,
                      'response': 30,
                      'homemaker': 2,
                      'bargain': 2,
                      'authority': 16,
                      'Perhaps': 4,
                      '1844': 2,
                      'fitness': 3,
                      'punctuated': 1,
                      'football': 2,
                      'strikingly': 4,
                      'big': 12,
                      'Ghoreyeb': 2,
                      'Fowler': 1,
                      'long-term': 11,
                      'quantities': 5,
                      'arresting': 2,
                      '180-degrees': 1,
                      'fills': 2,
                      'attempted': 7,
                      'completely': 23,
                      'C': 74,
                      'cores': 3,
                      'cold': 16,
                      'opened': 6,
                      'Cambridge': 2,
                      'radioactive': 4,
                      'illegitimate': 2,
                      'ambitiously': 1,
                      'twenty-three': 1,
                      'Cattle': 1,
                      'prechlorination': 1,
                      'stood': 7,
                      'unfortunately': 4,
                      'deck': 2,
                      'lend': 2,
                      'street': 5,
                      'diagonalizable': 14,
                      'utmost': 1,
                      '8-mm': 1,
                      'adjectives': 3,
                      'Redevelopment': 1,
                      'mV': 1,
                      'proliferation': 1,
                      'town': 15,
                      '10-degrees-C': 1,
                      'Taft-Hartley': 2,
                      'laxness': 1,
                      'multipactor': 2,
                      'Column': 7,
                      'Boris': 10,
                      'commonplace': 3,
                      "Street's": 1,
                      'Maser': 1,
                      "l'Independance": 1,
                      'gets': 11,
                      'high-density': 1,
                      'inserts': 1,
                      'three': 103,
                      'shortsighted': 2,
                      'declaring': 1,
                      'wage': 45,
                      'concerted': 2,
                      'virtue': 4,
                      'pure': 14,
                      'organize': 6,
                      'extended': 22,
                      'recount': 1,
                      'JEDEC': 1,
                      'pursuit': 6,
                      'patiently': 1,
                      'motives': 1,
                      'beast': 1,
                      'resignations': 1,
                      'continuity': 4,
                      'forceful': 1,
                      'forge': 4,
                      'oceanography': 2,
                      'murders': 1,
                      'eerie': 1,
                      'pavement': 1,
                      'over-simplification': 1,
                      'A.': 17,
                      'finely': 2,
                      'airmail': 3,
                      'Martinique': 1,
                      'grows': 3,
                      "Whipple's": 1,
                      'carrots': 1,
                      'receives': 2,
                      'be': 1363,
                      'wealth': 1,
                      'names': 23,
                      'tarnished': 1,
                      '1875': 4,
                      'Ten': 3,
                      'transference': 2,
                      'transforms': 3,
                      'France': 1,
                      'posing': 1,
                      'Plastics': 1,
                      'orderliness': 1,
                      'conclusion': 26,
                      'reasons': 21,
                      'uncompromising': 1,
                      'invent': 1,
                      'minutes': 30,
                      'Middle-South': 3,
                      'Agriculture': 1,
                      'remote': 12,
                      'intertwined': 1,
                      'confabulations': 1,
                      'Temperatures': 1,
                      'joined': 4,
                      'loan': 4,
                      'DiGiorgio': 1,
                      'photomicrograph': 1,
                      'farfetched': 1,
                      'sharpened': 2,
                      'afternoon': 1,
                      'Money': 3,
                      'attrition': 3,
                      'entirety': 4,
                      'commercial': 25,
                      'pitfalls': 1,
                      'exceptional': 8,
                      'central': 28,
                      'steam-generation': 1,
                      'psychosomatic': 1,
                      'probes': 3,
                      'bully': 1,
                      'mergers': 1,
                      '362': 1,
                      'force': 63,
                      'demineralization': 1,
                      'employ': 3,
                      'replaced': 11,
                      'youth': 9,
                      'prevented': 7,
                      'coincide': 7,
                      'Transit': 1,
                      "N'": 6,
                      'obligations': 6,
                      'Burford': 1,
                      'dinners': 1,
                      'hardness': 1,
                      "child's": 10,
                      'refractory': 1,
                      'alone': 26,
                      'beneath': 4,
                      'compels': 1,
                      'Aerospace': 4,
                      'slugs': 1,
                      'counteract': 3,
                      'into': 222,
                      'principally': 1,
                      'confirms': 2,
                      'affiliated': 1,
                      'deviation': 11,
                      'address': 34,
                      'skipped': 1,
                      'foamed-in-place': 1,
                      'let': 23,
                      'lengthwise': 2,
                      'pertains': 5,
                      'Oder': 3,
                      'localities': 2,
                      'Dufresne': 1,
                      'ingredient': 1,
                      'ascertainable': 1,
                      'necessity': 5,
                      'unworthy': 1,
                      'depredations': 1,
                      'render': 2,
                      'method': 59,
                      'murder': 5,
                      'shakes': 2,
                      'Statistique': 1,
                      'sanctified': 1,
                      'repressive': 1,
                      'appetizing': 2,
                      'unfertilized': 1,
                      'supervisors': 1,
                      '2': 197,
                      'hold': 19,
                      'thiouracil': 3,
                      'violent': 1,
                      'elaborated': 1,
                      'pupils': 10,
                      'rockets': 2,
                      'appraisals': 1,
                      'switch': 17,
                      'incline': 1,
                      'delineating': 2,
                      'administrator': 3,
                      'profit-maximizing': 3,
                      'liners': 1,
                      'electric': 16,
                      'reunion': 1,
                      'shots': 7,
                      'lower-middle': 1,
                      'hypothesis': 11,
                      'absurd': 1,
                      'corners': 4,
                      'volunteered': 1,
                      'force-rate': 1,
                      'wage-rate': 1,
                      'RDW': 2,
                      'infer': 1,
                      'mosaic': 1,
                      'polynomial': 28,
                      'growers': 2,
                      'focused': 4,
                      'Land': 2,
                      'overly': 1,
                      'scorned': 1,
                      'DA': 6,
                      'naked': 1,
                      'commands': 1,
                      '1915': 1,
                      'integrates': 1,
                      'clothes': 4,
                      '0.3M': 1,
                      'amoral': 1,
                      'merest': 1,
                      'Constitution': 2,
                      'qualities': 11,
                      'custody': 1,
                      'converse': 1,
                      'non-pathogenic': 1,
                      'released': 5,
                      'gyro-stabilized': 6,
                      'alleged': 1,
                      'ninth': 5,
                      'chronology': 1,
                      'attracted': 3,
                      'letter': 11,
                      'hide': 3,
                      'Elaborate': 1,
                      'efficiencies': 2,
                      'Junkerdom': 1,
                      'picket': 1,
                      'Meats': 1,
                      'Yearbook': 1,
                      'from': 768,
                      'pseudophloem': 6,
                      'predominantly': 5,
                      'B.C.': 7,
                      'jumble': 2,
                      'ages': 18,
                      'stabilizing-conserving': 1,
                      'demonstrates': 3,
                      'grants': 1,
                      'Catskill': 4,
                      'Corrections': 1,
                      '72': 2,
                      'adjunct': 3,
                      'Humanism': 1,
                      'studied': 16,
                      'mg.': 11,
                      '0.24': 1,
                      're': 1,
                      'recognizable': 2,
                      'years': 87,
                      'sticks': 1,
                      'East': 14,
                      'Aces': 1,
                      'compilation': 8,
                      'burning': 8,
                      'frequently': 18,
                      'During': 32,
                      'brittle': 1,
                      'left-justified': 1,
                      'belch': 1,
                      'tore': 1,
                      'voluntary': 7,
                      'Obviously': 10,
                      'donors': 5,
                      "voice's": 1,
                      'obtrudes': 1,
                      "l'": 2,
                      'unhappy': 3,
                      'destruction': 6,
                      'Wind': 1,
                      'prototypical': 1,
                      'spiraling': 1,
                      'attributes': 7,
                      'sex': 9,
                      'active': 25,
                      'cuckoo-bumblebee': 1,
                      'hosts': 1,
                      'keeping': 7,
                      'imbedded': 1,
                      'easier': 14,
                      'consciousness': 3,
                      'plans': 13,
                      'accordance': 7,
                      'locality': 2,
                      'median': 1,
                      'heart': 7,
                      'primary': 24,
                      'bleak': 2,
                      'national': 36,
                      'wakefulness': 3,
                      'enables': 3,
                      'amenable': 2,
                      'formulae': 3,
                      'authoritarian': 2,
                      'Julio': 1,
                      "lady's": 1,
                      'called': 36,
                      'opinions': 13,
                      '1865': 1,
                      '2500': 1,
                      'reverse': 6,
                      'victory': 1,
                      'p': 6,
                      'merry': 2,
                      'AH6': 1,
                      'Too': 1,
                      'sighted': 2,
                      'agricultural': 7,
                      'susceptible': 1,
                      'G.': 9,
                      'shopping': 8,
                      'evade': 1,
                      'accelerometers': 8,
                      'deals': 7,
                      'subparagraph': 1,
                      'non-wage': 1,
                      'grindings': 1,
                      'organized': 9,
                      'partial': 4,
                      'solution': 18,
                      'demonstrations': 1,
                      'born': 10,
                      'propel': 1,
                      'Viator': 1,
                      'substructure': 1,
                      'chance': 9,
                      'roleplaying': 13,
                      'spirits': 2,
                      'dimethylglyoxime': 2,
                      'semiquantitative': 1,
                      'resonance': 10,
                      'Southern': 2,
                      'fellow': 3,
                      'halfway': 1,
                      'salty': 1,
                      'Preparation': 6,
                      'W-region': 5,
                      'Attributes': 1,
                      'pioneering': 1,
                      'taught': 1,
                      'headquarters': 3,
                      '**zq': 2,
                      'Dwight': 1,
                      'basket': 3,
                      'isolating': 3,
                      'M': 8,
                      'generates': 1,
                      'complained': 3,
                      '19': 8,
                      'legislate': 1,
                      'space-time': 1,
                      'Extensive': 1,
                      'constrictors': 1,
                      'intentions': 3,
                      'noble': 2,
                      'Nor': 4,
                      'hated': 3,
                      'imposed': 6,
                      'campus': 5,
                      'Meaning': 2,
                      'Phonemes': 1,
                      'manages': 1,
                      'five': 33,
                      'audacity': 1,
                      'dividends': 3,
                      'common': 61,
                      'Celso': 1,
                      'less': 109,
                      'outgoing': 1,
                      '153': 2,
                      'upturn': 1,
                      'participate': 1,
                      'graphed': 1,
                      'chronologically': 1,
                      'pantheon': 1,
                      'smaller': 23,
                      'purification': 2,
                      'gravity': 2,
                      'thyroxine-binding': 1,
                      'shall': 40,
                      'Greek': 17,
                      'innocence': 3,
                      'magnificent': 2,
                      'retailing': 4,
                      'circumscriptions': 1,
                      'Report': 11,
                      'thinks': 2,
                      'low-duty': 1,
                      'electrophoresis': 5,
                      'instrumentalities': 1,
                      'intrusive': 1,
                      'forward-moving': 1,
                      'usages': 2,
                      'Literal': 1,
                      'peoples': 4,
                      'accusingly': 1,
                      'Tucson': 1,
                      'claims': 22,
                      'executives': 1,
                      'Van': 4,
                      'seventy-two': 1,
                      'charred': 1,
                      'seventeenth': 1,
                      'justification': 3,
                      'symbolize': 4,
                      'Groth': 4,
                      'rationally': 1,
                      'oval': 1,
                      '2,000': 1,
                      'Start': 1,
                      'radius': 5,
                      'U.': 9,
                      'deep-tendon': 1,
                      'dioxide': 1,
                      'flow': 29,
                      'nuisances': 1,
                      'witnessing': 1,
                      'yielding': 1,
                      'concealing': 1,
                      'race': 7,
                      'task': 12,
                      'culminated': 1,
                      'bales': 1,
                      'overshoots': 1,
                      'optimization': 1,
                      'unsuitable': 1,
                      'sort': 21,
                      'hypothesized': 2,
                      'PQ': 1,
                      'sorption-desorption': 1,
                      '-': 67,
                      'uncluttered': 1,
                      'kind': 30,
                      'questionable': 1,
                      'mid-1948': 1,
                      'constitutional': 3,
                      'disbelief': 1,
                      'thermopile': 1,
                      'potential': 23,
                      'argon': 4,
                      'carboxymethyl': 1,
                      'Student': 1,
                      'recalled': 2,
                      'Byron': 1,
                      'core-jacket': 1,
                      'casually': 1,
                      'Rothko': 1,
                      'Sympathy': 1,
                      'Substantive': 1,
                      'visitors': 4,
                      'Hotel': 1,
                      'canyon': 1,
                      'swift': 1,
                      "Erasmus's": 1,
                      'Clemenceau': 1,
                      'shearing': 4,
                      'countryside': 1,
                      'anti-Soviet': 2,
                      'represents': 17,
                      'traffic': 7,
                      'ear': 1,
                      'Amaral': 1,
                      'dramatically': 2,
                      'pins': 2,
                      'ring-labeled': 1,
                      'lower': 48,
                      'nostalgically': 1,
                      'intensifiers': 6,
                      'consanguineous': 1,
                      '325-degrees-C': 1,
                      'neocortical-hypothalamic': 2,
                      'though': 57,
                      'architect': 1,
                      'successful': 15,
                      'causing': 6,
                      'AAb/': 2,
                      "Homer's": 1,
                      'defeat': 2,
                      'anorthic': 1,
                      'motif': 3,
                      'adsorbed': 3,
                      'misrepresenting': 1,
                      'saved': 9,
                      'antics': 1,
                      'kinetic': 8,
                      'grade-constructed': 1,
                      'princess': 2,
                      "aren't": 3,
                      'NC': 1,
                      'permitting': 3,
                      'designation': 1,
                      'unhealthy': 3,
                      'past': 44,
                      'non-competitive': 1,
                      'twisted': 2,
                      ...}),
             'lore': Counter({'Without': 1,
                      'vacuum': 1,
                      'lighted': 1,
                      'Parish': 1,
                      'laden': 1,
                      'judgments': 1,
                      'More': 11,
                      'tribunals': 1,
                      '23': 1,
                      'Anthony': 2,
                      'assent': 1,
                      'speak': 8,
                      'located': 7,
                      'economics': 1,
                      'Ai': 1,
                      'education': 24,
                      'house': 42,
                      'promises': 1,
                      '1906': 1,
                      'skilled': 3,
                      'conspirators': 1,
                      'overdeveloped': 1,
                      'owned': 5,
                      'tube': 1,
                      'building': 15,
                      'Adams': 1,
                      'waters': 7,
                      'preoccupation': 2,
                      'Between': 1,
                      'dig': 1,
                      'twenty-nine': 1,
                      'skirted': 1,
                      'Plaster': 1,
                      'junction': 1,
                      'roundup': 1,
                      'intrinsic': 1,
                      'Pagans': 1,
                      'use': 68,
                      'Sleight': 1,
                      'killed': 13,
                      'experience': 30,
                      'incurably': 1,
                      'served': 24,
                      'committee': 8,
                      'Doria': 1,
                      'ailments': 3,
                      'bars': 1,
                      'sports': 4,
                      'population': 9,
                      "you'uns": 1,
                      'tempted': 1,
                      'stared': 1,
                      'Sex': 1,
                      'Deck': 1,
                      'thoroughly': 4,
                      'nomination': 2,
                      'plunder': 1,
                      'procedural': 1,
                      'fluctuating': 1,
                      'certain': 31,
                      'season': 9,
                      'zeal': 2,
                      'Jenny': 1,
                      'deal': 17,
                      'demanded': 4,
                      'others': 31,
                      'child-bearing': 1,
                      'sunshine': 1,
                      'psycho-physiology': 1,
                      'manufacturers': 3,
                      'Wait': 1,
                      'tarry': 1,
                      'strains': 2,
                      'button-down': 1,
                      'transition': 2,
                      'Fuji': 1,
                      'Funari': 1,
                      'British-American': 1,
                      'catch': 3,
                      'beautiful': 17,
                      'gesture': 1,
                      'sou': 1,
                      'intimidation': 2,
                      'shorter': 3,
                      'marriages': 7,
                      'browsing': 1,
                      'programming': 1,
                      'self-respect': 1,
                      'Soukhouma': 1,
                      'denying': 2,
                      'Eternal': 1,
                      'dilapidated': 1,
                      'own': 80,
                      'Maier': 1,
                      'cheer': 2,
                      'retains': 1,
                      "countin'": 3,
                      'grotesquely': 1,
                      'imposition': 1,
                      'crasher': 1,
                      'indiscriminate': 1,
                      'Enough': 1,
                      'socked': 1,
                      'constituents': 1,
                      'paradoxically': 3,
                      'chicken': 2,
                      'determine': 4,
                      'navigation': 2,
                      'barrage': 1,
                      'materially': 1,
                      'correct': 8,
                      'volcanic': 2,
                      'Drake': 1,
                      'cabins': 4,
                      "Sorrentine's": 1,
                      'sage': 1,
                      "Side's": 1,
                      'kitchen': 6,
                      'amount': 7,
                      'colleague': 1,
                      'pursuits': 2,
                      '51': 1,
                      'Rake': 1,
                      'crack': 3,
                      'Called': 1,
                      'pressure': 26,
                      'winded': 1,
                      'Creator': 1,
                      'Grands': 3,
                      'Staffe': 4,
                      'northward': 3,
                      'way': 86,
                      'mud': 4,
                      'evils': 1,
                      'expressions': 1,
                      "wouldn't": 4,
                      'munch': 1,
                      'cheated': 1,
                      'Warfield': 1,
                      'through': 97,
                      'Following': 1,
                      "denyin'": 1,
                      'picturesque': 1,
                      'Rugged': 1,
                      'renamed': 1,
                      'Just': 11,
                      'preferring': 1,
                      'specific': 6,
                      'Sicilians': 2,
                      'each': 64,
                      'Because': 7,
                      'hacked': 1,
                      'proclamation': 2,
                      'boon': 1,
                      'Justice': 5,
                      'ink': 3,
                      'tons': 10,
                      'lift': 1,
                      'Huston': 1,
                      'vegetable': 3,
                      'France': 10,
                      'ceased': 2,
                      'Democrats': 3,
                      'craft': 1,
                      'postscript': 1,
                      'cottage': 1,
                      'shy': 1,
                      'raised': 8,
                      'fertility': 1,
                      'stray': 2,
                      'embraces': 1,
                      'stream': 5,
                      'revealed': 9,
                      'grave': 3,
                      'smell': 4,
                      'pastoral': 4,
                      'believing': 3,
                      'keyhole': 1,
                      'fabulous': 2,
                      'hustle': 1,
                      'situations': 3,
                      'irreconcilable': 1,
                      'experimenter': 1,
                      'Quacks': 1,
                      'Related': 1,
                      'Doris': 1,
                      'At': 42,
                      "I'm": 5,
                      'seas': 2,
                      'prolongation': 1,
                      'stimulated': 1,
                      'mastery': 2,
                      'operating': 3,
                      'custody': 1,
                      'mauling': 1,
                      'Messenger': 1,
                      'staked': 1,
                      'scoop': 1,
                      'Those': 4,
                      'surprisingly': 2,
                      'enlist': 1,
                      'fibrous': 1,
                      'rejection': 3,
                      'rights': 5,
                      'lucky': 3,
                      'career': 6,
                      'baggage': 1,
                      'husband': 36,
                      'friends': 15,
                      'chop': 1,
                      'bet': 1,
                      'sleet': 1,
                      'integral': 3,
                      'Prime': 4,
                      'shouting': 3,
                      'worse': 7,
                      'full-grown': 1,
                      'poark': 1,
                      'negative': 1,
                      "patient's": 5,
                      'suns': 1,
                      'small': 37,
                      'bull': 4,
                      'jam': 1,
                      'enemy': 9,
                      'Typically': 2,
                      'competition': 1,
                      'recruits': 2,
                      '$5-8,000': 1,
                      'Thevenow': 1,
                      'knife': 3,
                      'drilled': 2,
                      'chaperone': 1,
                      'generates': 1,
                      'recently': 8,
                      'earthy': 2,
                      'reined': 1,
                      'landmarks': 1,
                      'soda': 2,
                      'peas': 5,
                      'legislation': 3,
                      'next': 28,
                      'armchair': 1,
                      'reared': 1,
                      'mustard': 6,
                      'gravity': 1,
                      'shall': 12,
                      'reinstated': 1,
                      'e.g.': 2,
                      'really': 29,
                      'peoples': 9,
                      'broad': 4,
                      'lends': 1,
                      'idling': 1,
                      'Barrow': 1,
                      'external': 1,
                      'Brevet': 1,
                      'drinker': 1,
                      'disdain': 1,
                      'overloaded': 1,
                      'Please': 1,
                      'striving': 1,
                      'slated': 1,
                      'rushed': 4,
                      'borrowed': 1,
                      'uncorked': 1,
                      'fragrances': 1,
                      'signals': 4,
                      'effect': 10,
                      'rafts': 1,
                      'exact': 4,
                      'huckster': 1,
                      'attractive': 5,
                      'hunk': 1,
                      'Sibley': 1,
                      'peridontal': 1,
                      'officio': 1,
                      'shift': 4,
                      'accounted': 1,
                      'cereal': 12,
                      'fastened': 1,
                      'artists': 3,
                      'masculine': 1,
                      'confronting': 1,
                      'Force': 2,
                      'Amateur': 1,
                      'survivors': 3,
                      'twelve': 4,
                      'cannon': 2,
                      'telegraph': 1,
                      'Joseph': 5,
                      'Drawn': 1,
                      'keen': 1,
                      'chastity': 1,
                      'relax': 1,
                      'eyewitness': 1,
                      'Bureau': 5,
                      'terribly': 3,
                      'wives': 9,
                      'muddy-tasting': 1,
                      "cowhand'd": 1,
                      'runaway': 1,
                      'psychotherapy': 1,
                      'palms': 1,
                      'stead': 2,
                      'ask': 7,
                      'orgasm': 1,
                      'retrovision': 1,
                      'found': 51,
                      'detachment': 1,
                      'banditos': 1,
                      'pronouns': 1,
                      'owning': 1,
                      "one's": 14,
                      'muck': 1,
                      'cultivating': 2,
                      'mittens': 1,
                      'inflame': 1,
                      'pretty': 6,
                      'mother': 19,
                      'Caribbean': 1,
                      'Garry': 7,
                      'things': 31,
                      'Now': 18,
                      'thank': 1,
                      'Bethlehem': 1,
                      'department': 5,
                      'colorful': 3,
                      'independent': 3,
                      'lewd': 1,
                      'exported': 1,
                      'yardstick': 1,
                      'permission': 3,
                      'divided': 6,
                      'suppose': 2,
                      'feed': 7,
                      'fodder': 1,
                      'ambush': 1,
                      'prompt': 3,
                      'bits': 1,
                      'transported': 2,
                      '4,000-foot': 1,
                      '18': 2,
                      'drawn': 4,
                      'inaugural': 1,
                      'ecclesiastical': 2,
                      'Smoke': 1,
                      'rebelled': 2,
                      'bat': 1,
                      'vainly': 1,
                      'euphoric': 1,
                      'investigations': 2,
                      'gallop': 1,
                      'chiropractor': 1,
                      'one-third': 3,
                      'Anti-Americanism': 1,
                      'hatred': 4,
                      'snack': 3,
                      'reflects': 6,
                      'Doubtful': 1,
                      'Wright': 2,
                      'German': 13,
                      'obliterate': 1,
                      'well-modulated': 1,
                      'crofters': 1,
                      'dissolved': 3,
                      'neighbor': 2,
                      'scream': 2,
                      'signal': 3,
                      'vehicles': 1,
                      '1959': 2,
                      "rustlin'": 1,
                      'musicians': 2,
                      'managing': 2,
                      'objectiveness': 1,
                      'Ramsperger': 1,
                      'gushed': 1,
                      'parts-suppliers': 1,
                      'Fretting': 1,
                      'Discovery': 10,
                      'Census': 1,
                      'shoot': 5,
                      'shadow': 2,
                      'prints': 1,
                      'marched': 1,
                      'stiffened': 1,
                      'writer-turned-painter': 1,
                      'rearguard': 1,
                      'Kamchatka': 1,
                      'evidence': 20,
                      'narrowing': 1,
                      'flights': 1,
                      'virginity': 3,
                      'outset': 1,
                      'Scholar': 2,
                      'required': 13,
                      'one-reel': 1,
                      'milestone': 1,
                      'Massachusetts': 9,
                      'wages': 4,
                      'Merchant': 1,
                      'Lord': 4,
                      'Division': 2,
                      'institutionalized': 1,
                      'sticky': 1,
                      'housewives': 1,
                      'bullyboys': 1,
                      'strip': 1,
                      '6th': 1,
                      'hunting': 2,
                      'overran': 1,
                      'run': 20,
                      'discussed': 5,
                      'speedy': 1,
                      'stores': 9,
                      'equity': 2,
                      'veering': 1,
                      'manly': 2,
                      'careful': 3,
                      'Continental': 1,
                      'wooded': 1,
                      'importantly': 2,
                      'fronting': 1,
                      'ricocheted': 1,
                      'unconcerned': 2,
                      'feather': 1,
                      'dimension': 2,
                      "Sande's": 1,
                      'not-less-deadly': 1,
                      'themselves': 26,
                      'complaint': 2,
                      'academic': 14,
                      'dome': 3,
                      'Deputy': 1,
                      'phis': 1,
                      'experiences': 15,
                      'wicker': 1,
                      'Hopedale': 1,
                      'stumps': 1,
                      'reconciliation': 1,
                      'positive': 10,
                      'Santa': 4,
                      'actuality': 3,
                      'Cheyennes': 1,
                      'provoke': 1,
                      'Good': 3,
                      'Howard': 2,
                      'flier': 1,
                      'speaks': 4,
                      'Pincian': 2,
                      'reader': 1,
                      'aviator': 2,
                      'radar-controlled': 3,
                      'entertain': 2,
                      'appended': 1,
                      'pulled': 3,
                      'blacked': 1,
                      'continuous': 3,
                      'framed': 2,
                      'mercy': 2,
                      'sentences': 1,
                      'methods': 15,
                      'visited': 9,
                      'Manufacturers': 1,
                      'loaded': 3,
                      'busiest': 1,
                      'disposed': 4,
                      'Also': 6,
                      'radio': 4,
                      'myriad': 2,
                      'Thiot': 1,
                      'delivery': 2,
                      'engraved': 2,
                      'trees': 4,
                      "mother's": 5,
                      'Japanese': 17,
                      'Daer': 5,
                      'Frederick': 1,
                      'due': 5,
                      'flattening': 1,
                      'Sheldon': 1,
                      'seeming': 1,
                      'shrewdly': 1,
                      'plea': 2,
                      'warmed': 1,
                      'precious': 1,
                      'sixty': 1,
                      'checked': 2,
                      'racketeer': 1,
                      'midwestern': 2,
                      'rejoin': 1,
                      'problem': 25,
                      'freer': 1,
                      'winter': 17,
                      'distance': 10,
                      '5000': 1,
                      'crashing': 2,
                      'percentage': 5,
                      'ever-existent': 1,
                      'Hockett': 1,
                      'Portico': 1,
                      'Trustees': 2,
                      'sameness': 1,
                      'immensity': 1,
                      'reflect': 1,
                      'splattered': 1,
                      'looking': 6,
                      'fourth': 6,
                      'Rivers': 1,
                      'Journal': 1,
                      'Fall': 6,
                      'sixth': 1,
                      'sheik': 4,
                      'Semiramis': 1,
                      'British': 16,
                      'machine-gunned': 1,
                      'seismic': 1,
                      'trading': 10,
                      'moderately': 1,
                      'Time': 4,
                      'past': 24,
                      'humor': 3,
                      'periods': 4,
                      'describing': 3,
                      'fibers': 1,
                      'delegation': 1,
                      'sentimental': 4,
                      'Big': 6,
                      'fostering': 1,
                      'justice': 11,
                      'corroborated': 1,
                      'marketable': 4,
                      'Sibling': 1,
                      'remedy': 6,
                      'pointed': 7,
                      'wounded': 8,
                      'Usually': 5,
                      'boxed': 1,
                      'determination': 3,
                      'evacuation': 2,
                      'slowly': 6,
                      'Caring': 1,
                      'unlocked': 3,
                      'place': 70,
                      'piece': 3,
                      'automobiles': 2,
                      'rode': 5,
                      'truthfulness': 1,
                      'committees': 1,
                      'payroll': 1,
                      'Parks': 1,
                      'bombers': 4,
                      'Flint': 3,
                      'tasted': 1,
                      'Tombigbee': 2,
                      'first': 145,
                      'Steichen': 1,
                      'atmosphere': 5,
                      'teetering': 1,
                      'Bantus': 3,
                      'union': 9,
                      'Saratoga': 1,
                      'urgent': 1,
                      'land': 28,
                      'fade-in': 1,
                      "appearin'": 1,
                      'I.B.M.': 1,
                      'carreer': 1,
                      'blocking': 1,
                      'announcements': 1,
                      'files': 2,
                      'Natural': 1,
                      '1825': 1,
                      'instrument': 5,
                      'pork': 2,
                      'could': 141,
                      'Especially': 1,
                      'dissimilar': 1,
                      'rusty': 1,
                      'conform': 3,
                      'audience': 11,
                      'rich': 9,
                      'tap': 2,
                      'Wilson': 7,
                      "Holt's": 1,
                      'spaciousness': 1,
                      'Patton': 1,
                      'planting': 2,
                      'Gravesend': 1,
                      'Fife': 1,
                      'pride': 2,
                      'hailstorm': 1,
                      'Franklin': 6,
                      'commended': 1,
                      'sequence': 5,
                      'twisting': 1,
                      'Jean-Honore': 1,
                      'Maxentius': 1,
                      'rattle': 1,
                      'Dag': 1,
                      'brute': 2,
                      'mortally': 1,
                      'customs': 4,
                      'berated': 1,
                      'ran': 12,
                      'wagons': 2,
                      'meant': 13,
                      'Monroe': 1,
                      'felt': 17,
                      'cloudburst': 1,
                      'sentenced': 1,
                      'forbids': 1,
                      'runing': 1,
                      'inventory': 1,
                      'unsafe': 1,
                      'picture': 14,
                      'guarding': 2,
                      'sympathetic': 2,
                      'adapted': 5,
                      'Interpretation': 2,
                      'convention': 3,
                      'healthier': 1,
                      'buildings': 8,
                      'Immediately': 3,
                      'Allies': 1,
                      "weeks'": 1,
                      'woodwork': 1,
                      'Arenula': 1,
                      'exempt': 1,
                      'collapsing': 1,
                      '1953': 2,
                      'welding': 1,
                      'intimate': 2,
                      'corrosive': 1,
                      'beam': 1,
                      'cramps': 2,
                      'braces': 2,
                      'eighth': 4,
                      'counselor': 1,
                      'Loy': 1,
                      'modest': 2,
                      'presto': 1,
                      'Frank': 2,
                      'recognize': 7,
                      'improve': 4,
                      'cheetah': 1,
                      'unimpressed': 1,
                      'marked': 11,
                      'alginates': 1,
                      'companions': 1,
                      'higher': 21,
                      'constructively': 1,
                      'Dutch': 6,
                      'deplore': 1,
                      'cabinets': 1,
                      'Security': 1,
                      'Stanley': 1,
                      'nod': 2,
                      'Harvard': 7,
                      'hardships': 1,
                      'firmly': 3,
                      'Nutrition': 1,
                      'recollections': 1,
                      'backed': 2,
                      'skunks': 1,
                      'whizzing': 1,
                      'fruit': 8,
                      'Gunfire': 1,
                      'northerly': 1,
                      'planters': 2,
                      'yelps': 1,
                      'passivity': 1,
                      'triad': 1,
                      'wall': 9,
                      'told': 42,
                      'assassinated': 1,
                      'question': 25,
                      'life-like': 2,
                      'discovery': 8,
                      'civic': 1,
                      'waits': 1,
                      'The': 647,
                      'oratory': 1,
                      'promise': 2,
                      'D.': 9,
                      'non-college': 1,
                      'coincidence': 1,
                      'gases': 1,
                      'Ocean': 1,
                      'candles': 3,
                      'tuxedoed': 1,
                      'of': 3668,
                      'shores': 2,
                      'warms': 1,
                      'confiding': 1,
                      'brashness': 1,
                      '$10,000': 2,
                      'withdrawn': 1,
                      'law': 20,
                      'slugger': 1,
                      'untold': 1,
                      'police': 26,
                      'visiting': 2,
                      'inboard': 1,
                      'virility': 2,
                      'narrative': 7,
                      'rear': 6,
                      "Henry's": 1,
                      'Spokesmen': 1,
                      'metal': 2,
                      'corner': 6,
                      'sworn': 2,
                      'cups': 2,
                      'lived': 12,
                      'Ivies': 2,
                      'protest': 5,
                      'Information': 4,
                      'Anticipating': 1,
                      'self-confidence': 1,
                      'erupt': 2,
                      'resigned': 2,
                      "Knowlton's": 1,
                      'watching': 6,
                      'synagogue': 2,
                      'express': 3,
                      'maid': 3,
                      'Tapley': 1,
                      'numb': 1,
                      'conservationist': 1,
                      'investigating': 1,
                      'facts': 17,
                      'recreation': 2,
                      'Hudson': 34,
                      'unusually': 1,
                      'carts': 3,
                      'assembled': 3,
                      'plump': 1,
                      'disastrous': 4,
                      "suite's": 1,
                      'peculiar': 3,
                      'display': 4,
                      "Burlington's": 1,
                      'Dugan': 2,
                      'policing': 2,
                      'straightening': 2,
                      'linguists': 3,
                      'Outdoor': 1,
                      'homosexual': 1,
                      'factory': 1,
                      "America's": 2,
                      'poisonous': 3,
                      'hung': 2,
                      'arc': 1,
                      'relation': 9,
                      'book': 13,
                      'fragmentary': 1,
                      'Italo-American': 1,
                      'sympathetically': 1,
                      'egocentric': 1,
                      'unrest': 1,
                      'saps': 1,
                      'suggested': 11,
                      'linguistics': 3,
                      'minimal': 2,
                      'fame': 1,
                      'employees': 3,
                      'mentions': 1,
                      'Renaissance': 4,
                      'jacket': 1,
                      'nightly': 1,
                      'Interruptions': 1,
                      'bound': 7,
                      'sullen': 1,
                      'knowingly': 2,
                      'intend': 2,
                      'platted': 1,
                      'old-time': 2,
                      'presents': 3,
                      'post-Inaugural': 1,
                      'purpose': 17,
                      'give': 31,
                      'outgrow': 1,
                      'laundering': 1,
                      'puckering': 1,
                      'fluid': 2,
                      'membership': 6,
                      'surprises': 1,
                      'incompatible': 1,
                      'seismograph': 1,
                      'cheek': 1,
                      'horrors': 2,
                      'sensibilities': 2,
                      'tape': 4,
                      'bushels': 3,
                      'beauty': 9,
                      'question-and-answer': 1,
                      'coop': 1,
                      'trouser': 1,
                      'flatly': 1,
                      'idea': 14,
                      'dealers': 3,
                      "seizin'": 1,
                      'Richard': 2,
                      'directed': 6,
                      'waving': 2,
                      'graybeards': 1,
                      'premature': 1,
                      'aesthetic': 2,
                      'sanction': 1,
                      'Bey': 6,
                      'diverse': 4,
                      'strenuous': 2,
                      'Arrack': 1,
                      'crest': 7,
                      'hawker': 1,
                      'allow': 7,
                      'rag': 3,
                      'critical': 4,
                      'monumentally': 1,
                      'library': 3,
                      'diminishes': 2,
                      'soldier-masters': 1,
                      'cancer': 8,
                      'boost': 1,
                      'wander': 1,
                      'missing': 3,
                      'actually': 26,
                      'flank': 1,
                      'Several': 2,
                      'rocked': 1,
                      'strivings': 1,
                      'explanation': 3,
                      'replied': 9,
                      'juice': 1,
                      'vital': 5,
                      'after': 85,
                      'Thirdly': 1,
                      'questioning': 2,
                      'ward': 4,
                      'Kipling': 1,
                      'madstones': 1,
                      'corrugated': 1,
                      'cornmeal': 1,
                      'response': 5,
                      'barrel': 2,
                      'homemaker': 1,
                      'T.B.': 1,
                      'procurer': 2,
                      'authority': 10,
                      'Point': 1,
                      'contouring': 1,
                      '1844': 3,
                      'confused': 3,
                      'bricklaying': 1,
                      'missile': 2,
                      'schooner': 2,
                      'big': 15,
                      'ill-conceived': 2,
                      'long-term': 1,
                      'quantities': 1,
                      'armament': 1,
                      'inspect': 2,
                      'E': 1,
                      'grenades': 2,
                      'attempted': 1,
                      'completely': 16,
                      'C': 1,
                      'wooden': 4,
                      'cold': 18,
                      'opened': 6,
                      'Valley': 2,
                      'rookies': 1,
                      'stain': 1,
                      'Opportunities': 1,
                      'wary': 1,
                      'twenty-three': 1,
                      'Nathan': 1,
                      'fate': 1,
                      'Holy': 2,
                      'blast': 1,
                      'negociant': 2,
                      'unfortunately': 2,
                      'deck': 2,
                      'lend': 1,
                      'reminding': 1,
                      'Korea': 4,
                      'street': 15,
                      'regarding': 2,
                      'passable': 1,
                      'long': 71,
                      'utmost': 1,
                      'tooth': 12,
                      'current': 4,
                      'page': 5,
                      'brand-new': 1,
                      'locally': 1,
                      'cops': 2,
                      'hookup': 1,
                      'priest': 7,
                      'expectations': 1,
                      'cares': 1,
                      'dislike': 2,
                      'accuse': 2,
                      'town': 18,
                      'fulfilled': 1,
                      'dormitories': 1,
                      'compresses': 1,
                      'now': 84,
                      'peeked': 1,
                      'dramatic': 3,
                      'depositions': 3,
                      'plain-spoken': 1,
                      'nationalism': 7,
                      'commonplace': 3,
                      'rival': 4,
                      'suddenly': 11,
                      'heterogeneous': 3,
                      'De': 4,
                      '694': 1,
                      'gets': 8,
                      "we'll": 1,
                      'flat-bottomed': 1,
                      'inaudible': 1,
                      'casino': 1,
                      'loss': 13,
                      'boat': 5,
                      'three': 57,
                      'geologists': 2,
                      'automatic': 1,
                      'smolders': 1,
                      'cage': 4,
                      'demand': 8,
                      'stock': 12,
                      'Inauguration': 1,
                      'Spring': 3,
                      'Humor': 1,
                      'downed': 1,
                      'half-acceptance': 1,
                      'San': 3,
                      'drafty': 1,
                      'wage': 3,
                      'industrial': 15,
                      'virtue': 2,
                      'organizational': 2,
                      'pure': 3,
                      'organize': 1,
                      'extended': 3,
                      'twenty-nine-foot-wide': 1,
                      'patiently': 1,
                      'College': 21,
                      'Coffee': 1,
                      'odd': 1,
                      'twins': 1,
                      'frowns': 1,
                      'intrigues': 1,
                      'motives': 4,
                      'detective': 3,
                      'continuity': 6,
                      'gag': 1,
                      'forceful': 3,
                      'car': 7,
                      'murders': 9,
                      'rehearsed': 3,
                      'manipulated': 1,
                      'monastic': 1,
                      'shine': 1,
                      'cap': 1,
                      'smokescreen': 1,
                      'A.': 12,
                      'jibes': 1,
                      'Psychotherapy': 1,
                      'faker': 1,
                      'present': 21,
                      'describes': 1,
                      'sonofabitch': 1,
                      'science': 8,
                      'interdependence': 5,
                      'ranch': 4,
                      'carrots': 2,
                      'receives': 4,
                      'be': 570,
                      'wealth': 1,
                      'alternative': 2,
                      'racing': 1,
                      'device': 16,
                      'frolic': 1,
                      'ready': 16,
                      'corpses': 1,
                      'speckles': 1,
                      'pistol-whipped': 1,
                      'Ought': 1,
                      'culture': 13,
                      'rutabagas': 1,
                      'abundance': 2,
                      'shares': 1,
                      'heap': 3,
                      'engaged': 2,
                      'Plastics': 1,
                      'conclusion': 5,
                      'jail': 2,
                      ...}),
             'mystery': Counter({'Without': 3,
                      'vacuum': 1,
                      'lighted': 11,
                      'contented': 1,
                      'laden': 1,
                      'read': 6,
                      'verve': 1,
                      'dig': 1,
                      'high-pitched': 1,
                      'Anthony': 3,
                      'aboard': 1,
                      'anyhow': 4,
                      'speak': 2,
                      'unaware': 1,
                      'located': 2,
                      'plush': 1,
                      'avoiding': 1,
                      'education': 1,
                      'house': 41,
                      'desolate': 2,
                      'promises': 1,
                      'muscles': 3,
                      "She'd": 7,
                      'darling': 1,
                      'burrowing': 1,
                      'registrar': 1,
                      'building': 21,
                      'Adams': 1,
                      'helpless': 3,
                      'forearm': 2,
                      'hoping': 6,
                      'smell': 2,
                      'summoned': 1,
                      'crib': 2,
                      'walked': 31,
                      'hemorrhages': 1,
                      'Seaton': 1,
                      'dock': 2,
                      'use': 13,
                      'worse': 4,
                      'Carruthers': 3,
                      'killed': 12,
                      'experience': 2,
                      'Escape': 1,
                      'blood': 6,
                      'jarred': 1,
                      'sports': 1,
                      'outsmarted': 1,
                      'trip': 2,
                      'typewriter': 3,
                      'has': 17,
                      'Pretty': 5,
                      'thoroughly': 3,
                      'transposition': 1,
                      'pleading': 1,
                      'maintenance': 1,
                      'brushfire': 1,
                      'our': 14,
                      'along': 24,
                      'beau': 1,
                      'suspiciously': 2,
                      'hamburgers': 3,
                      'certain': 9,
                      'wages': 2,
                      'heaved': 1,
                      'sticky': 1,
                      'Gardner': 2,
                      'Beach': 10,
                      'sped': 3,
                      'demanded': 2,
                      'shade': 2,
                      'inspector': 3,
                      'others': 10,
                      'moulding': 1,
                      'half-hour': 2,
                      'shaved': 1,
                      'anyone': 9,
                      'mercy': 1,
                      'grew': 3,
                      'Jail': 1,
                      'groves': 1,
                      'taut': 1,
                      'bellboy': 4,
                      'catch': 3,
                      'sleeve': 1,
                      'nauseated': 1,
                      'beautiful': 3,
                      'Twenty-four': 1,
                      'marriages': 1,
                      'rafters': 1,
                      "Glendora's": 1,
                      'first': 37,
                      'candle': 6,
                      'hallucinating': 1,
                      'refuse': 2,
                      'Forties': 1,
                      'dilapidated': 1,
                      'developed': 1,
                      'Frisco': 2,
                      'lapses': 1,
                      'grease': 1,
                      'brushing': 1,
                      'Tracking': 1,
                      'taverns': 1,
                      'releasing': 1,
                      "supervisor's": 1,
                      'company': 5,
                      'wall': 19,
                      'convey': 1,
                      'chicken': 2,
                      'necktie': 1,
                      'under': 27,
                      'Quebec': 1,
                      'Pontchartrain': 1,
                      'correct': 3,
                      'bitch': 2,
                      'glance': 9,
                      "Joan's": 1,
                      'kitchen': 14,
                      'amount': 3,
                      'Maude': 21,
                      'blunt': 1,
                      'orders': 1,
                      'whorls': 1,
                      'parked': 19,
                      'Timothy': 3,
                      'schoolgirl': 1,
                      'omitted': 1,
                      'Aunt': 1,
                      'pressure': 3,
                      'overrun': 1,
                      'cheek': 2,
                      "you'll": 7,
                      'skylights': 2,
                      'busily': 1,
                      'directed': 1,
                      'anagram': 1,
                      'hear': 14,
                      'way': 61,
                      'long-stemmed': 1,
                      'man': 106,
                      "Ingleside's": 1,
                      'martinis': 1,
                      'unbelievable': 1,
                      "wouldn't": 32,
                      'draft': 1,
                      'hurry': 9,
                      'cheated': 1,
                      'provision': 1,
                      'bang': 2,
                      'through': 57,
                      'Following': 1,
                      'panes': 1,
                      'Ball': 1,
                      'No-o-o': 1,
                      'Just': 8,
                      'precaution': 1,
                      "we'll": 8,
                      'each': 18,
                      'Because': 4,
                      'battle': 1,
                      'That-a-way': 1,
                      'Rossi': 1,
                      'wastrel': 1,
                      'pavement': 2,
                      'Pamela': 1,
                      'stakes': 1,
                      'plain': 1,
                      'forget': 4,
                      'apparently': 7,
                      'burden': 2,
                      'ye': 1,
                      'cottage': 2,
                      'shy': 1,
                      'raised': 9,
                      'fingerprints': 4,
                      'stray': 2,
                      'love': 7,
                      'embraces': 1,
                      'ate': 1,
                      'grave': 1,
                      'mileage': 1,
                      'repeated': 5,
                      'eventually': 2,
                      'apron': 3,
                      'believing': 1,
                      'Trinidad': 1,
                      'suburbs': 1,
                      'situations': 1,
                      "Pam's": 1,
                      'service': 4,
                      'depressed': 2,
                      'Doris': 1,
                      'hard': 15,
                      'fences': 2,
                      'At': 28,
                      "I'm": 53,
                      'focused': 2,
                      'patrolman': 2,
                      'Station': 2,
                      'blustery': 1,
                      'prune': 1,
                      'Take': 5,
                      'None': 3,
                      "moraine's": 1,
                      'hurtled': 1,
                      'baby': 3,
                      'Those': 4,
                      'onlookers': 1,
                      'Nelson': 6,
                      'negligent': 1,
                      'drop': 5,
                      'career': 2,
                      'idly': 1,
                      'husband': 11,
                      'friends': 9,
                      'Quickly': 1,
                      'pulling': 5,
                      'dully': 2,
                      'wiped': 3,
                      'scanned': 1,
                      'slip': 1,
                      'manage': 3,
                      'mud-caked': 1,
                      'Brian': 5,
                      'tenuous': 1,
                      'small': 28,
                      'Miss': 11,
                      'park': 1,
                      'Petey': 1,
                      'red': 7,
                      'reverberated': 1,
                      'expressed': 1,
                      'powerful': 3,
                      'doubtfully': 1,
                      'inferred': 1,
                      'sprawl': 1,
                      'appearance': 1,
                      'forward': 8,
                      'brim': 1,
                      'bins': 1,
                      'decent': 2,
                      'Andy': 29,
                      "Harris'": 1,
                      'undulating': 1,
                      'dispel': 1,
                      'thunder': 3,
                      'peas': 1,
                      'snapped': 1,
                      'next': 29,
                      'Palmer': 1,
                      'p.': 2,
                      'reared': 1,
                      'strike': 1,
                      'screens': 2,
                      'four': 13,
                      'badly': 2,
                      'Dorado': 3,
                      'chomp': 1,
                      'waist': 2,
                      'examined': 3,
                      'advance': 1,
                      'really': 21,
                      'Murder': 1,
                      'broad': 2,
                      'predictable': 1,
                      'external': 1,
                      'scent': 1,
                      'please': 4,
                      'grandfather-father-to-son': 1,
                      'Please': 2,
                      'comforting': 1,
                      'rushed': 1,
                      'efforts': 2,
                      'signals': 2,
                      'effect': 3,
                      'bars': 1,
                      'Pete': 9,
                      'Lanesville': 1,
                      'exact': 2,
                      'clothed': 1,
                      'Anything': 5,
                      'thin': 12,
                      'Dumpty': 1,
                      'returning': 4,
                      'conspiratorial': 1,
                      'King': 7,
                      'family': 7,
                      'swimming': 1,
                      'scrutinizing': 1,
                      'alarmed': 2,
                      'evening': 11,
                      'prisoners': 2,
                      'twelve': 3,
                      'pungent': 1,
                      'heater': 1,
                      'fear': 5,
                      'tip': 4,
                      'Bureau': 2,
                      'unperceived': 1,
                      'wives': 1,
                      'retrieved': 1,
                      'complying': 1,
                      'regretted': 2,
                      'dust': 6,
                      'blaring': 1,
                      'late': 5,
                      'a.m.': 3,
                      '``': 740,
                      'relief': 4,
                      'prettier': 2,
                      'grillework': 1,
                      'ask': 17,
                      'forms': 2,
                      'found': 42,
                      'Reverse': 1,
                      'Forgot': 1,
                      'doubted': 2,
                      'pedals': 1,
                      'adventure': 1,
                      "one's": 1,
                      ':': 27,
                      'pretty': 8,
                      'telescopic': 1,
                      'bell': 1,
                      'mother': 11,
                      'things': 26,
                      'Now': 31,
                      'N-no': 1,
                      'thank': 2,
                      'can': 42,
                      'mailboxes': 4,
                      'negligence': 1,
                      'muddy': 1,
                      'dismal': 1,
                      'divided': 2,
                      'suppose': 9,
                      'reconstruction': 1,
                      'poured': 2,
                      'storehouse': 3,
                      'brown': 4,
                      'cocked': 1,
                      'drawn': 3,
                      'spur': 2,
                      'Reporters': 1,
                      'mauve': 1,
                      'sir': 8,
                      'unscrewed': 1,
                      'commences': 1,
                      'raked': 1,
                      'dialed': 2,
                      'upstairs': 3,
                      "Pops's": 1,
                      'studied': 2,
                      'feelings': 1,
                      'activities': 2,
                      'Eliminating': 1,
                      'staginess': 1,
                      'leave': 26,
                      'York': 8,
                      'signal': 11,
                      'vehicles': 1,
                      'grabbed': 4,
                      'power': 2,
                      'paunchy': 2,
                      'hafta': 1,
                      'yielded': 1,
                      'till': 8,
                      'shoot': 1,
                      'shadow': 8,
                      'prints': 3,
                      'any': 64,
                      'stiffened': 2,
                      'Revere': 1,
                      'stake-out': 3,
                      'bore': 2,
                      'brake': 1,
                      'dying': 2,
                      'observe': 2,
                      'automatically': 3,
                      'season': 1,
                      'gross': 1,
                      'odors': 2,
                      'Road': 4,
                      'deeds': 1,
                      'Joan': 2,
                      'foot': 6,
                      'pausing': 1,
                      'curiously': 3,
                      'drink': 16,
                      'mouthing': 1,
                      'God': 13,
                      'run': 18,
                      'discussed': 3,
                      'poisoned': 1,
                      'merging': 1,
                      'stores': 1,
                      'dedicated': 1,
                      'cobra': 1,
                      'eyebrows': 4,
                      'Regulars': 1,
                      'Sir': 2,
                      'Gibby': 7,
                      'flat': 3,
                      'barest': 1,
                      'unheeding': 1,
                      'uniform': 2,
                      'uncomfortable': 1,
                      'Came': 1,
                      'thousand': 14,
                      'poised': 3,
                      'obsession': 1,
                      'themselves': 8,
                      'guest': 4,
                      'sketch': 1,
                      'send': 10,
                      'slab': 2,
                      'dome': 1,
                      'music': 5,
                      'visit': 1,
                      'fifty-four': 1,
                      'experiences': 1,
                      'moral': 1,
                      'fallen': 3,
                      'fine': 4,
                      'trustees': 1,
                      'positive': 2,
                      'sweetness': 1,
                      'Plymouth': 6,
                      'actuality': 2,
                      'Dammit': 2,
                      'Birdwood': 1,
                      'execution': 1,
                      'terminal': 1,
                      'disappears': 1,
                      'victim': 2,
                      'two-line': 1,
                      'Wait': 7,
                      'so-called': 1,
                      'roof': 8,
                      'progressing': 1,
                      'Prettier': 1,
                      'her': 296,
                      'scenes': 1,
                      'bottom': 5,
                      'pulled': 15,
                      'frizzled': 1,
                      'Carl': 1,
                      'awfully': 1,
                      'setup': 1,
                      'Past': 1,
                      'longer': 11,
                      'sing': 2,
                      'strange': 10,
                      'automobiles': 1,
                      'framed': 1,
                      'Though': 1,
                      'Kee-reist': 1,
                      'domestic': 3,
                      'visited': 1,
                      'scowled': 1,
                      'loaded': 1,
                      'terrified': 1,
                      'steam': 1,
                      'Also': 7,
                      'radio': 5,
                      'coat': 5,
                      'delivery': 1,
                      'ruse': 1,
                      'trees': 4,
                      "mother's": 1,
                      "they're": 6,
                      'Some': 8,
                      'water-filled': 1,
                      'Carmody': 2,
                      'Emotionally': 1,
                      'fastidious': 1,
                      'Oscar': 1,
                      'due': 1,
                      'attainment': 1,
                      'snowed': 2,
                      'Sheldon': 2,
                      'cut': 15,
                      'preamble': 1,
                      'crossed': 5,
                      'establishments': 1,
                      'warmed': 2,
                      'sixty': 2,
                      'checked': 9,
                      'proviso': 1,
                      'serve': 1,
                      'manpower': 1,
                      'broad-brimmed': 1,
                      'butane': 1,
                      'problem': 6,
                      'pulse': 1,
                      'winter': 1,
                      'distance': 7,
                      'Quintana': 1,
                      'continuously': 1,
                      'walking': 5,
                      'commanded': 1,
                      'receive': 1,
                      'floorboards': 2,
                      'informal': 1,
                      'mesmerized': 1,
                      "money's": 1,
                      'looking': 24,
                      'fourth': 2,
                      'denied': 1,
                      'sugar': 4,
                      "Harris's": 1,
                      'thanks': 2,
                      'striking': 1,
                      'British': 2,
                      'loft': 1,
                      'Time': 1,
                      'humor': 1,
                      'Bast': 1,
                      'grooms': 1,
                      'Norberg': 6,
                      'ribs': 4,
                      'hallway': 3,
                      "We're": 6,
                      'Big': 1,
                      'bottle': 20,
                      "lieutenant's": 3,
                      'eying': 1,
                      'Examiner': 1,
                      'overlapped': 1,
                      'scientist': 1,
                      'grunted': 2,
                      'pointed': 6,
                      'Usually': 1,
                      'Expecting': 1,
                      'same': 41,
                      'dragged': 2,
                      'slowly': 11,
                      'unlocked': 6,
                      'place': 33,
                      'piece': 5,
                      'eerily': 1,
                      'gasping': 2,
                      'rode': 1,
                      'Parks': 1,
                      'tact': 1,
                      'tasted': 3,
                      'Club': 3,
                      'aroma': 1,
                      "What's": 5,
                      'drive-in': 2,
                      'apprehension': 1,
                      'Alison': 1,
                      'coldly': 2,
                      'urgent': 1,
                      'land': 2,
                      'growled': 3,
                      'Much': 2,
                      'therefore': 3,
                      'kicking': 1,
                      'fifty-five': 1,
                      'files': 4,
                      'emptier': 1,
                      'wires': 1,
                      'swamp': 1,
                      'could': 141,
                      'enjoying': 2,
                      'turkey': 2,
                      'audience': 6,
                      '275': 1,
                      'slippers': 2,
                      'stopped': 16,
                      'by-product': 1,
                      'admit': 4,
                      'ribbing': 1,
                      'delusion': 1,
                      'armpit': 1,
                      'registrations': 1,
                      'sequence': 1,
                      'okay': 3,
                      'Lila': 2,
                      'ointment': 1,
                      'wise': 2,
                      'search': 4,
                      'delinquency': 1,
                      'optimism': 1,
                      'interested': 5,
                      'wagged': 1,
                      'meant': 9,
                      'reek': 1,
                      'intent': 1,
                      'tiny': 5,
                      'seepage': 1,
                      'Jerry': 4,
                      'Dumont': 8,
                      'picture': 14,
                      'doomed': 1,
                      "shouldn't": 1,
                      'bake-oven': 1,
                      'conceive': 1,
                      'full': 12,
                      'performance': 2,
                      'count': 2,
                      'stairway': 1,
                      'slipped': 5,
                      'equally': 1,
                      'buildings': 6,
                      'between': 26,
                      'cost': 4,
                      'blonde': 1,
                      'widowed': 1,
                      'Chicagoans': 1,
                      'Gold': 1,
                      'Kidnapper': 1,
                      'beam': 1,
                      'report': 8,
                      'ring': 1,
                      'co-workers': 1,
                      'Espanol': 1,
                      'appear': 1,
                      'metal': 7,
                      'fine-boned': 1,
                      'Roman': 1,
                      'recognize': 1,
                      'improve': 1,
                      'volley': 1,
                      'marked': 5,
                      'Anderson': 2,
                      'wagon': 10,
                      'highboy': 1,
                      'active': 3,
                      'moves': 2,
                      'higher': 2,
                      'rousing': 1,
                      'deviate': 1,
                      'cabinets': 1,
                      'eyes': 48,
                      'Security': 1,
                      'Chapter': 1,
                      'filled': 5,
                      'energy': 2,
                      'Stanley': 26,
                      'nod': 1,
                      'firmly': 2,
                      'Until': 1,
                      'idiosyncrasies': 1,
                      'Soak': 1,
                      'gallstone': 1,
                      'backed': 8,
                      'heredity': 1,
                      'bedsprings': 1,
                      'Domokous': 2,
                      'sorrows': 1,
                      'fruit': 1,
                      'cameras': 1,
                      'Acourse': 1,
                      'chill': 1,
                      'nowhere': 1,
                      'dictating': 1,
                      'brakes': 2,
                      'Diets': 1,
                      'they': 106,
                      'done': 11,
                      'distinctly': 2,
                      'wiping': 1,
                      'instructed': 2,
                      'crests': 1,
                      'banks': 1,
                      'may': 13,
                      'The': 244,
                      'casual': 2,
                      'promise': 1,
                      'downstairs': 3,
                      'coincidence': 2,
                      'unjustified': 1,
                      'Boston': 1,
                      'specialist': 1,
                      'of': 903,
                      'psychopathic': 2,
                      'noisily': 1,
                      'bowl': 1,
                      'excruciating': 1,
                      'chickens': 3,
                      'trays': 1,
                      'law': 4,
                      'trickling': 1,
                      'trapped': 2,
                      'autos': 1,
                      'nearing': 1,
                      'stained': 2,
                      'Larry': 2,
                      'defense': 1,
                      'quarter': 3,
                      'appearances': 1,
                      'superstitious': 1,
                      'guess': 11,
                      'punishes': 1,
                      'Beauclerk': 8,
                      'rear': 4,
                      'bloody': 1,
                      'Kinda': 1,
                      'Croydon': 1,
                      'possum': 1,
                      'noticeable': 1,
                      "landlord's": 1,
                      'felt': 40,
                      'awkwardly': 1,
                      'beaten': 1,
                      'separate': 2,
                      'lived': 15,
                      'climbed': 8,
                      'Deal': 1,
                      'disliking': 1,
                      'seized': 3,
                      'vet': 1,
                      'mingled': 1,
                      'performer': 1,
                      'lack': 4,
                      'reprehensible': 1,
                      'Alfredo': 2,
                      'mentioned': 3,
                      'malice': 1,
                      "ladies'": 1,
                      'enviously': 1,
                      'maid': 2,
                      'Hall-Mills': 1,
                      'hate': 2,
                      'landlord': 8,
                      'bride': 7,
                      'enjoyed': 2,
                      'musical': 1,
                      'cetera': 1,
                      'assembled': 1,
                      'L.': 1,
                      'impulse': 1,
                      'crashed': 1,
                      'peculiar': 3,
                      'display': 3,
                      'too': 72,
                      'advisability': 1,
                      'advancing': 1,
                      'Supplies': 1,
                      'Theater': 1,
                      "Burton's": 2,
                      'confirmed': 1,
                      'sneaking': 2,
                      'entry': 1,
                      'overhead': 1,
                      'velour': 1,
                      'outside': 19,
                      'more-than-average': 1,
                      'hung': 12,
                      'arc': 1,
                      'faces': 8,
                      'language': 1,
                      'book': 3,
                      'jaws': 1,
                      'stains': 1,
                      'somewhat': 3,
                      'slightly': 5,
                      'screwball': 1,
                      'following': 3,
                      'suggested': 4,
                      'crack': 4,
                      'sompin': 1,
                      'event': 2,
                      'unlacing': 1,
                      'Washington': 1,
                      'cohesion': 1,
                      'instantly': 3,
                      'recommendation': 1,
                      'jacket': 7,
                      'Seeing': 2,
                      'how': 37,
                      'sullen': 2,
                      'heartbeat': 2,
                      'occupants': 3,
                      'ticket': 3,
                      'above': 6,
                      'hyped-up': 1,
                      'equidistant': 1,
                      'volumes': 3,
                      'LaSalle': 1,
                      'give': 23,
                      'clumsy': 2,
                      'clamped': 2,
                      'implications': 1,
                      'Richards': 2,
                      'Mister': 1,
                      'upset': 2,
                      'seeing': 6,
                      'beauty': 2,
                      'odor': 3,
                      'aunts': 1,
                      'minute': 10,
                      'exuded': 1,
                      'Payne': 8,
                      'blared': 1,
                      'blushing': 1,
                      'minister': 12,
                      'glove': 1,
                      'tunnel': 6,
                      'bothering': 2,
                      'tire': 1,
                      'thermometer': 1,
                      'gradually': 1,
                      'waving': 3,
                      'nakedly': 1,
                      'overnighters': 3,
                      'convenience': 3,
                      'allow': 2,
                      'rag': 2,
                      'drinks': 6,
                      'library': 1,
                      'payments': 1,
                      'Aj': 2,
                      'identified': 2,
                      'staggeringly': 1,
                      'gauged': 1,
                      'racking': 1,
                      'wander': 1,
                      'ounce': 1,
                      'actually': 4,
                      'piteous': 1,
                      'advice': 3,
                      'tightly': 2,
                      'Angelo': 5,
                      "sheriff's": 2,
                      'explanation': 1,
                      'Jeep': 9,
                      'juice': 2,
                      'preserve': 2,
                      'after': 36,
                      'heightened': 1,
                      'questioning': 2,
                      'thought': 54,
                      'camera': 1,
                      'ward': 3,
                      'reached': 16,
                      'Very': 2,
                      'anyway': 9,
                      'Fox': 2,
                      'cornmeal': 1,
                      'on': 421,
                      'response': 1,
                      'lawyer': 3,
                      'conviction': 1,
                      'convincing': 1,
                      'cousin': 1,
                      'Point': 3,
                      'Our': 3,
                      'Perhaps': 5,
                      'confused': 2,
                      'morosely': 1,
                      'Shall': 3,
                      'schooner': 1,
                      'quarry': 2,
                      'big': 32,
                      'plant': 1,
                      'earnestly': 2,
                      'fetid': 1,
                      'salvaging': 1,
                      'clearly': 3,
                      'attempted': 1,
                      'completely': 7,
                      'interrupted': 5,
                      'wooden': 3,
                      'cold': 14,
                      'freak': 2,
                      'punk': 1,
                      'financial': 2,
                      'opened': 20,
                      'still': 66,
                      'left': 56,
                      'hats': 2,
                      'normally': 1,
                      'bellows': 1,
                      'chaise': 1,
                      'utterly': 3,
                      'hip': 2,
                      'one-way': 1,
                      'twenty-three': 2,
                      'mere': 1,
                      "Dronk's": 1,
                      'warm': 1,
                      'connecting': 1,
                      'proclaim': 1,
                      'stood': 41,
                      'worst': 2,
                      'electronic': 4,
                      'Ferrell': 3,
                      'once-over': 1,
                      'yuh': 1,
                      'Demanded': 1,
                      'drinking': 6,
                      'impact': 2,
                      'alias': 1,
                      'skull': 1,
                      'glasses': 2,
                      'street': 36,
                      'conscientious': 1,
                      'grip': 2,
                      'orbiting': 1,
                      'long': 39,
                      'hemmed': 1,
                      'utmost': 1,
                      'sweaty': 1,
                      'alimony': 1,
                      'icicle': 1,
                      'slob': 1,
                      'worth': 5,
                      'cops': 13,
                      'destination': 1,
                      'bounded': 1,
                      'easiest': 1,
                      'wary': 1,
                      'Service': 1,
                      'got': 77,
                      'regulation': 1,
                      'town': 8,
                      'now': 83,
                      'wrong': 9,
                      'Drink': 1,
                      'azaleas': 1,
                      'smart': 3,
                      'delicious': 1,
                      'lady': 3,
                      'blowing': 1,
                      'cab': 4,
                      'suddenly': 19,
                      'transoms': 1,
                      'gets': 6,
                      'McIver': 3,
                      'specific': 1,
                      'torn': 2,
                      'bandaged': 2,
                      'lay': 13,
                      'enclosed': 1,
                      'three': 33,
                      'Anyone': 1,
                      "Day's": 1,
                      'figured': 5,
                      'shut': 10,
                      'consulted': 2,
                      'demand': 1,
                      'powder': 3,
                      'stock': 3,
                      'Reckon': 1,
                      'Lindsay': 1,
                      'Kimball': 1,
                      'rooms': 8,
                      'serpents': 1,
                      'San': 5,
                      'Concord': 1,
                      'clearer': 1,
                      'Las': 3,
                      'Scholarship': 1,
                      'extended': 3,
                      'taunt': 1,
                      'emphatically': 1,
                      'Added': 1,
                      'choice': 1,
                      'questioned': 1,
                      'one-thirty': 2,
                      'odd': 3,
                      'gunning': 1,
                      'stretched': 8,
                      'suave': 1,
                      'joy': 2,
                      'detective': 3,
                      'Comfortably': 1,
                      'hole': 1,
                      'tails': 3,
                      'zipper': 1,
                      'sidled': 1,
                      'fluttered': 1,
                      'car': 69,
                      'eerie': 1,
                      'west': 5,
                      'personal': 2,
                      'cap': 1,
                      'seriously': 3,
                      'discovered': 2,
                      'sigue': 1,
                      'afternoons': 1,
                      'salad': 1,
                      'present': 5,
                      'galley': 1,
                      'necessary': 3,
                      'hit': 16,
                      'treasure': 1,
                      'be': 233,
                      'depressing': 1,
                      'must': 30,
                      'racing': 2,
                      'names': 9,
                      'helpful': 1,
                      'ready': 9,
                      'faintest': 2,
                      'corpses': 1,
                      ...}),
             'news': Counter({'Without': 4,
                      'festival': 2,
                      '5-run': 2,
                      'Motorists': 1,
                      'builders': 7,
                      'taxing': 1,
                      'More': 12,
                      'tribunals': 1,
                      '23': 9,
                      'Anthony': 1,
                      'assent': 1,
                      'speak': 3,
                      'located': 6,
                      'economics': 3,
                      'contented': 1,
                      'education': 31,
                      'house': 26,
                      'promises': 3,
                      'skilled': 1,
                      'conspirators': 2,
                      'officiating': 1,
                      'overdeveloped': 1,
                      'Gerald': 3,
                      'tube': 1,
                      'Wilmette': 3,
                      'building': 15,
                      'Adams': 1,
                      'Exploratory': 1,
                      'waters': 4,
                      'Arvey': 1,
                      'ex-National': 1,
                      'Minnie': 2,
                      'Between': 1,
                      'shouldda': 1,
                      'threshold': 3,
                      'junction': 1,
                      'restrain': 1,
                      'operating': 4,
                      'drafts': 2,
                      'LSU': 1,
                      'use': 31,
                      "bridegroom's": 2,
                      'authoritative': 1,
                      'thoroughfare': 1,
                      'Olivia': 1,
                      'experience': 6,
                      'endure': 2,
                      'served': 17,
                      'committee': 38,
                      'Norm': 2,
                      'bars': 1,
                      'sports': 6,
                      'Feis': 1,
                      'population': 13,
                      'Sr.': 1,
                      "Baltimore's": 2,
                      'Mickey': 12,
                      'thoroughly': 1,
                      'nomination': 6,
                      'Omsk': 1,
                      'views': 7,
                      'Much': 2,
                      'plunder': 1,
                      'forum': 1,
                      "Mayor's": 2,
                      'abrupt': 1,
                      'certain': 18,
                      'season': 43,
                      'Gardner': 1,
                      'Coopers': 1,
                      'Melcher': 1,
                      'glutted': 1,
                      'Continental': 7,
                      'shade': 1,
                      'others': 15,
                      'budgets': 2,
                      'Durwood': 1,
                      'vicinity': 1,
                      'manufacturers': 10,
                      'Patrolmen': 1,
                      'Polls': 1,
                      '6.5': 1,
                      'Callan': 1,
                      'Zoe': 1,
                      'McN.': 1,
                      'Milwaukee': 4,
                      'beautiful': 5,
                      'gesture': 2,
                      'intimidation': 1,
                      'sixth': 10,
                      'crypt': 1,
                      'august': 1,
                      'marriages': 4,
                      'head-and-shoulders': 1,
                      'all-important': 1,
                      'self-respect': 1,
                      'Couturier': 1,
                      'denying': 1,
                      'Schrunk': 1,
                      'handy': 1,
                      'Reputedly': 1,
                      'cheer': 2,
                      'retains': 1,
                      "countin'": 1,
                      'imposition': 1,
                      'believes': 8,
                      'Stevens': 4,
                      'constituents': 1,
                      'chic': 4,
                      'chicken': 4,
                      'determine': 3,
                      'drab': 1,
                      'barrage': 1,
                      'Loen': 1,
                      'goaded': 1,
                      'pitchers': 4,
                      'correct': 3,
                      'corporation': 7,
                      'kitchen': 5,
                      'amount': 18,
                      '$7,500,000': 1,
                      'crack': 2,
                      'omitted': 1,
                      'Called': 1,
                      'pressure': 12,
                      'Md.': 2,
                      'Chinese': 2,
                      '2-score-year': 1,
                      'darling': 1,
                      '22-acre': 1,
                      'amended': 1,
                      'adviser': 3,
                      'perturbed': 1,
                      'won-lost': 1,
                      "wouldn't": 2,
                      'scholastic': 2,
                      'hard-hit': 1,
                      'scratching': 1,
                      'through': 54,
                      'classical': 3,
                      'Dallas-based': 1,
                      'halftime': 1,
                      'nationalism': 2,
                      'Just': 7,
                      'specific': 4,
                      'each': 61,
                      'Because': 4,
                      'Plus': 2,
                      'Ah': 2,
                      'Justice': 1,
                      '13-1/2': 1,
                      'tons': 1,
                      'Dennis': 1,
                      'stakes': 1,
                      'menaced': 1,
                      'Walters': 1,
                      'France': 7,
                      'inviolate': 1,
                      'ringsiders': 1,
                      'ceased': 1,
                      "yesterday's": 1,
                      'Democrats': 11,
                      'Pitcher': 1,
                      'cottage': 1,
                      'raised': 8,
                      'Todd': 1,
                      'place-kicker': 1,
                      'stream': 3,
                      'revealed': 1,
                      'Ernst': 1,
                      'enunciate': 1,
                      'grave': 2,
                      'Sunnyvale': 3,
                      'believing': 2,
                      'keyhole': 1,
                      'second-degree': 2,
                      'Hinsdale': 1,
                      'situations': 2,
                      'summoned': 4,
                      'blackout': 1,
                      'Doris': 1,
                      'At': 38,
                      'roundup': 1,
                      "I'm": 18,
                      'deodorant': 2,
                      'focused': 1,
                      'patrolman': 1,
                      '$17': 1,
                      'colleagues': 1,
                      'balconies': 1,
                      'acquittal': 2,
                      'scoop': 1,
                      'Those': 8,
                      'adoption': 2,
                      'rejection': 3,
                      'CBS': 1,
                      'Buddy': 1,
                      'Nelson': 4,
                      'Manville': 1,
                      'career': 10,
                      'baggage': 2,
                      'husband': 13,
                      'friends': 11,
                      'bet': 2,
                      'shouting': 1,
                      'worse': 6,
                      "Hallowell's": 1,
                      'slip': 1,
                      'negative': 2,
                      'Suddenly': 1,
                      'Afterwards': 1,
                      'killed': 5,
                      'small': 27,
                      'Television-Electronics': 1,
                      'bull': 1,
                      'Ruling': 1,
                      'enemy': 3,
                      'continue': 11,
                      'Equator': 1,
                      'understatement': 1,
                      'quo': 1,
                      'Peking': 1,
                      'Hamey': 1,
                      'neckline': 2,
                      'recession': 3,
                      'Worth': 3,
                      'missiles': 3,
                      'recently': 11,
                      'tournament': 13,
                      'enforced': 3,
                      'steamship': 1,
                      'legislation': 14,
                      'next': 40,
                      'Ghormley': 1,
                      'propeller': 1,
                      'sly': 1,
                      'reared': 1,
                      'disagreed': 2,
                      'strike': 12,
                      'four': 73,
                      'Larson': 5,
                      'Marines': 1,
                      'really': 13,
                      'peoples': 1,
                      'singled': 6,
                      'Foliage': 1,
                      'inflicted': 1,
                      'ideology': 1,
                      'comforting': 1,
                      'rushed': 2,
                      'borrowed': 2,
                      'uncorked': 1,
                      'signals': 1,
                      'effect': 17,
                      'senator': 3,
                      'Pete': 2,
                      'exact': 1,
                      'NBC': 2,
                      'tulle': 1,
                      'whites': 2,
                      "Pirates'": 1,
                      "Lenobel's": 1,
                      '187.5': 1,
                      'smoother': 1,
                      'cell': 2,
                      'accounted': 2,
                      'King': 3,
                      'vice-chairman': 1,
                      'artists': 5,
                      'brokers': 2,
                      'lag': 1,
                      '17,000': 1,
                      "Berry's": 1,
                      'swearing-in': 1,
                      'spending': 9,
                      'confronting': 2,
                      'Amateur': 1,
                      'prisoners': 1,
                      'probation': 6,
                      'twelve': 1,
                      'Parmer': 1,
                      'Joseph': 18,
                      'Culbertson': 1,
                      'tip': 1,
                      'Bureau': 2,
                      'daylight': 1,
                      'Carmichael': 1,
                      'colts': 1,
                      'Camilla': 1,
                      'clerk': 4,
                      'Nori': 1,
                      'regretted': 1,
                      '675': 1,
                      'Michaels': 1,
                      '687.87': 1,
                      'assets': 1,
                      'intends': 1,
                      'ask': 7,
                      'found': 30,
                      'hope': 11,
                      'writes': 1,
                      'win': 10,
                      'statutory': 2,
                      'schedule': 9,
                      'dug': 2,
                      'scholarships': 4,
                      'Haynes': 1,
                      'mother': 15,
                      'Galt': 1,
                      'Jorge': 1,
                      'things': 10,
                      'Now': 11,
                      'thank': 1,
                      'Huntingtons': 1,
                      'shortcuts': 1,
                      'alienated': 1,
                      'Ponce': 1,
                      'department': 12,
                      'colorful': 1,
                      'independent': 3,
                      'February': 10,
                      'chow': 1,
                      '3-0': 1,
                      'exported': 2,
                      '4:05': 1,
                      'yardstick': 1,
                      'permission': 2,
                      'divided': 4,
                      'reconstruction': 1,
                      'Ky.': 2,
                      '18': 20,
                      'Grove': 4,
                      'inaugural': 1,
                      'stops': 2,
                      "fund's": 3,
                      'sir': 1,
                      'bat': 6,
                      'boosting': 1,
                      'Fra': 1,
                      'investigations': 1,
                      'advisers': 7,
                      'one-third': 1,
                      'brethren': 1,
                      'pricing': 3,
                      'snack': 1,
                      'fairway': 5,
                      'constituted': 4,
                      'our': 55,
                      'Wright': 1,
                      'German': 8,
                      'Molly': 2,
                      'Nairne': 2,
                      'Bassi': 1,
                      'Welfare': 3,
                      'neighbor': 4,
                      'submarines': 5,
                      'signal': 1,
                      'vehicles': 2,
                      'battalions': 1,
                      '1959': 21,
                      'grabbed': 2,
                      'musicians': 3,
                      'managing': 2,
                      'McEachern': 1,
                      "county's": 1,
                      'taper': 1,
                      'Owls': 1,
                      'Attendance': 1,
                      'deductible': 2,
                      'ordinarily': 1,
                      'Latinovich': 1,
                      'Cox': 4,
                      'outset': 1,
                      'Gomez': 1,
                      'required': 11,
                      'Cadillac': 1,
                      'milestone': 2,
                      'Crumlish': 1,
                      'wages': 11,
                      'surpassed': 1,
                      "players'": 1,
                      'candid': 1,
                      'Lord': 2,
                      'Division': 5,
                      'chicanery': 1,
                      'crews': 1,
                      '1961': 42,
                      'strip': 4,
                      '6th': 1,
                      'hunting': 2,
                      'Joplin': 1,
                      'resource': 1,
                      'run': 35,
                      'discussed': 6,
                      'speedy': 1,
                      'stores': 4,
                      'I.': 7,
                      'Moments': 1,
                      'demanded': 3,
                      'breathes': 1,
                      'effected': 1,
                      'Evans': 1,
                      'grandchildren': 2,
                      'Cocktails': 2,
                      'Opposition': 1,
                      '72-hole': 1,
                      'Shiflett': 2,
                      'themselves': 19,
                      'developing': 4,
                      'academic': 10,
                      'Luette': 1,
                      'Deputy': 2,
                      "Wednesday's": 1,
                      'Supervisor': 1,
                      'Vernava': 3,
                      'eligible': 4,
                      'pecan': 1,
                      'capitalist': 1,
                      'positive': 2,
                      'Santa': 6,
                      'parole': 4,
                      'drunk': 1,
                      'Herridge': 1,
                      'Howard': 13,
                      'newsletter': 1,
                      'Ted': 4,
                      'elegant': 2,
                      'reader': 1,
                      'thorny': 1,
                      'Maris': 36,
                      'Second': 5,
                      'Stamford': 2,
                      'sophomore': 3,
                      'pulled': 3,
                      'blacked': 1,
                      'background': 3,
                      'continuous': 1,
                      'naughtier': 1,
                      'Taxation': 1,
                      'transition': 3,
                      'sentences': 1,
                      'methods': 10,
                      'visited': 4,
                      'loaded': 4,
                      '$5': 2,
                      'visa': 1,
                      'disposed': 1,
                      'Also': 9,
                      'radio': 6,
                      'coat': 2,
                      'Agencies': 1,
                      'delivery': 1,
                      'vindication': 2,
                      'trees': 2,
                      'melee': 2,
                      'Elmer': 1,
                      "they're": 5,
                      'Cotty': 1,
                      'brochures': 1,
                      'Oscar': 2,
                      'kickoff': 2,
                      'exchange': 5,
                      'Frederick': 3,
                      'due': 9,
                      'Fletcher': 1,
                      'congratulated': 2,
                      'Sheldon': 4,
                      'Turnpike': 6,
                      'plea': 3,
                      'warmed': 3,
                      'checked': 2,
                      'problem': 39,
                      'freer': 1,
                      'Morocco': 2,
                      'pulse': 1,
                      'modification': 1,
                      'Summer': 1,
                      'winter': 8,
                      'distance': 2,
                      '5000': 1,
                      'percentage': 4,
                      'receive': 15,
                      'Gauer': 2,
                      'Trustees': 2,
                      'Mortgage': 1,
                      'specialize': 1,
                      'notarized': 1,
                      'informal': 4,
                      'Collins': 4,
                      'looking': 7,
                      '37,679': 1,
                      'catchers': 1,
                      'picker': 1,
                      'British': 24,
                      '255': 1,
                      'Operating': 2,
                      'Time': 6,
                      'humor': 4,
                      "John's": 1,
                      'heyday': 1,
                      'Hat': 1,
                      'Achaeans': 1,
                      'delegation': 3,
                      'Briar': 1,
                      "We're": 3,
                      'collected': 6,
                      'Samoa': 1,
                      'Big': 4,
                      'attract': 2,
                      'Kimpton': 4,
                      'ranked': 2,
                      'Birgit': 1,
                      'pointed': 9,
                      'guiding': 1,
                      'Usually': 1,
                      'determination': 3,
                      'amendment': 5,
                      'evacuation': 1,
                      'slowly': 3,
                      'place': 25,
                      'piece': 3,
                      'Korman': 1,
                      'garrulous': 1,
                      'moldboard': 1,
                      'Kelly': 1,
                      "Princes'": 1,
                      'committees': 4,
                      'payroll': 13,
                      'bombers': 1,
                      'underestimate': 1,
                      'first': 143,
                      'Stetsons': 1,
                      'atmosphere': 5,
                      'union': 21,
                      'urgent': 3,
                      'track': 10,
                      'ex-gambler': 1,
                      'Salvatore': 1,
                      'gulf': 1,
                      'inflamed': 1,
                      '1859': 1,
                      'textile-producing': 2,
                      'announcements': 2,
                      'Natural': 1,
                      '1825': 1,
                      'instrument': 1,
                      'pork': 1,
                      'could': 86,
                      'Lawrence': 12,
                      'teenagers': 3,
                      'Federal': 17,
                      'gruonded': 1,
                      'slippers': 1,
                      'stopped': 6,
                      'Wilson': 6,
                      'aloud': 1,
                      'elaborate': 3,
                      'Revolutionary': 1,
                      'Along': 3,
                      'pride': 3,
                      'Soule': 1,
                      'Franklin': 8,
                      'twisting': 1,
                      'Neiman-Marcus': 4,
                      'Kanin': 1,
                      '13th': 5,
                      '111': 1,
                      'increasingly': 2,
                      'inflexible': 1,
                      'Ind.': 1,
                      'search': 3,
                      'infiltrating': 1,
                      'reflexes': 2,
                      'delinquency': 1,
                      'wagons': 1,
                      'optimism': 2,
                      'meant': 3,
                      'Monroe': 5,
                      'morals': 1,
                      'ivory': 1,
                      'luncheons': 1,
                      'forbids': 1,
                      'Successful': 1,
                      'businesses': 2,
                      'Dumont': 2,
                      'Petipa-Tschaikowsky': 1,
                      'picture': 7,
                      'cuisine': 1,
                      'guarding': 1,
                      'A.M.': 2,
                      'sympathetic': 1,
                      'unemployment': 1,
                      'Vital': 1,
                      'laurels': 1,
                      'Yogi': 2,
                      'convention': 2,
                      'Bourguiba': 1,
                      'buildings': 6,
                      'between': 47,
                      "Gannon's": 1,
                      '2269': 1,
                      'cost': 15,
                      'exempt': 2,
                      '1953': 5,
                      'R-Warren': 1,
                      'nailed': 1,
                      'Inter-American': 1,
                      'Retail': 1,
                      'outmoded': 1,
                      'eighth': 2,
                      'nine-game': 1,
                      'costlier': 1,
                      'masterpiece': 2,
                      'recognize': 1,
                      'improve': 4,
                      'lefthander': 2,
                      'feeds': 1,
                      'marked': 4,
                      'Anderson': 8,
                      '$15': 1,
                      'Pye': 1,
                      'higher': 26,
                      'lovely': 2,
                      'Dutch': 1,
                      'Boonton': 1,
                      'fast-grossing': 1,
                      'Cezannes': 1,
                      'Menderes': 2,
                      'golfing': 1,
                      'Security': 4,
                      'foliage': 1,
                      'Alpha': 1,
                      'Stanley': 2,
                      'residences': 1,
                      'penalized': 1,
                      'firmly': 5,
                      'Until': 5,
                      'marketing': 3,
                      'backed': 1,
                      'handicapped': 1,
                      'Achievement': 7,
                      'bateau': 2,
                      'Bay-front': 1,
                      'Neuberger': 1,
                      'Harveys': 1,
                      'Slate': 5,
                      'Golden': 3,
                      "carpenters'": 1,
                      'endowments': 2,
                      "union's": 2,
                      'hospitable': 1,
                      'dismissed': 1,
                      "writers'": 1,
                      'Campagnoli': 1,
                      'told': 53,
                      'Alice': 3,
                      'distinctly': 1,
                      'wiping': 1,
                      'Transportation': 1,
                      'monopolistic': 2,
                      'The': 806,
                      'promise': 3,
                      'D.': 24,
                      '5155': 1,
                      'Hodges': 7,
                      'sportsman': 1,
                      'Boy': 1,
                      'Ocean': 1,
                      'merited': 1,
                      'of': 2849,
                      'under': 83,
                      'Gateway': 1,
                      'court-appointed': 1,
                      '7.5': 1,
                      'celebrated': 2,
                      '47': 4,
                      '$10,000': 5,
                      'withdrawn': 1,
                      'law': 38,
                      'spiritual': 1,
                      'slugger': 3,
                      'bankers': 8,
                      'Innumerable': 1,
                      '1920s': 1,
                      'untold': 1,
                      'Stalling': 1,
                      'nearing': 4,
                      'forgetting': 1,
                      '553': 1,
                      'police': 31,
                      'Snodgrass': 1,
                      'inboard': 1,
                      "Angeles'": 2,
                      '155-yarder': 1,
                      'rear': 4,
                      'ova': 1,
                      'bustling': 1,
                      'metal': 1,
                      "Brooks's": 1,
                      'refocusing': 1,
                      'industrialist': 1,
                      'lived': 3,
                      'success': 4,
                      'wars': 2,
                      'resisting': 1,
                      'Sanger-Harris': 1,
                      'Eastland': 1,
                      'SMU': 5,
                      'murdered': 1,
                      'Sheets': 4,
                      'Brod': 1,
                      'Rylie': 1,
                      "ladies'": 1,
                      'maid': 3,
                      'low': 7,
                      "Beadles'": 1,
                      'Hudson': 2,
                      'unusually': 2,
                      'retaliating': 1,
                      'assembled': 1,
                      'announcing': 2,
                      'crabapple': 1,
                      'recovery': 9,
                      'peculiar': 2,
                      'display': 4,
                      'Cover': 1,
                      'Dearborn': 1,
                      'Fatima': 1,
                      'well': 43,
                      'Siberia': 2,
                      'orders': 7,
                      'bulwark': 1,
                      'Baltimorean': 1,
                      "Cotten's": 1,
                      'lucky': 1,
                      'factory': 3,
                      'entry': 2,
                      'overhead': 2,
                      'Dame': 5,
                      'outside': 10,
                      'Berger': 5,
                      '1630': 1,
                      'arc': 1,
                      '355': 1,
                      'book': 13,
                      'subscribe': 1,
                      'recommends': 1,
                      '101b': 1,
                      'Eire': 1,
                      'unrest': 1,
                      'planes': 2,
                      'suggested': 8,
                      'Basic': 1,
                      'begging': 1,
                      'Grover': 4,
                      "Grizzlies'": 1,
                      'employees': 11,
                      'mentions': 1,
                      'Notre': 5,
                      'Renaissance': 1,
                      'arrive': 5,
                      'jacket': 2,
                      'nightly': 1,
                      'bridesmaids': 1,
                      'intend': 3,
                      'pars': 3,
                      'presents': 1,
                      'Engaging': 2,
                      'purpose': 9,
                      'broadened': 1,
                      'give': 32,
                      'deferred': 1,
                      'absolute': 1,
                      'Hartman': 4,
                      'membership': 5,
                      'monopolies': 3,
                      'spotlights': 1,
                      'Richards': 1,
                      'Funeral': 5,
                      'dishes': 1,
                      'housing': 14,
                      'beauty': 4,
                      'colonialist': 1,
                      'battery': 1,
                      'Bellows': 5,
                      'Stetson': 1,
                      'Beatrice': 1,
                      'flatly': 1,
                      'glove': 4,
                      'blond': 2,
                      'Richard': 27,
                      'semester': 3,
                      'Branum': 1,
                      '18,792': 1,
                      '1966': 3,
                      'dynamic': 4,
                      'Decries': 1,
                      'applying': 1,
                      'allow': 5,
                      'critical': 6,
                      'Superior': 5,
                      'correctness': 1,
                      'Werner': 5,
                      '3505o': 1,
                      'Aj': 14,
                      'hoodlums': 2,
                      'boost': 6,
                      "Dresbachs'": 1,
                      'missing': 3,
                      'actually': 10,
                      'Several': 5,
                      'economist': 3,
                      'explanation': 3,
                      'replied': 5,
                      'juice': 1,
                      'Billy': 3,
                      'after': 127,
                      'Sees': 1,
                      'homers': 3,
                      'Hewlett-Woodmere': 1,
                      'questioning': 1,
                      'assailant': 1,
                      'ward': 7,
                      'reached': 15,
                      'anyway': 3,
                      'Balkanizing': 1,
                      '469': 1,
                      'response': 5,
                      'Japan': 3,
                      'bargain': 1,
                      'compared': 6,
                      'authority': 8,
                      'Point': 3,
                      'Perhaps': 3,
                      'grace': 2,
                      'jury-tampering': 1,
                      'Audubon': 3,
                      'troopships': 1,
                      'football': 14,
                      'strikingly': 1,
                      'big': 38,
                      'Juvenile': 3,
                      'Sidney': 2,
                      '180-degrees': 1,
                      'services': 18,
                      'nourished': 1,
                      'Cod': 2,
                      'attempted': 5,
                      'completely': 6,
                      'journey': 2,
                      'C': 4,
                      'wooden': 1,
                      'cold': 6,
                      'opened': 9,
                      'resumption': 2,
                      'Elliott': 1,
                      'rookies': 2,
                      'radioactive': 1,
                      'blabbed': 1,
                      'Baker': 6,
                      'fate': 1,
                      'Holy': 2,
                      "Tuesday's": 1,
                      'satin': 4,
                      'stood': 4,
                      'worst': 2,
                      'Boxwood': 1,
                      'lend': 1,
                      'instrumentals': 1,
                      'Milenoff': 1,
                      'Washington-Alexandria': 1,
                      'street': 5,
                      'Horstman': 1,
                      'Gets': 1,
                      '158-pounder': 1,
                      'gas-fired': 1,
                      'Parker': 5,
                      'Belleville': 1,
                      '11,744': 1,
                      'Deloris': 1,
                      'inlaid': 1,
                      'current': 13,
                      'View': 2,
                      'senseless': 1,
                      'Grayson': 1,
                      'fill': 3,
                      'Redevelopment': 3,
                      'Section': 4,
                      'expectations': 1,
                      'Sybert': 1,
                      'Allegretti': 1,
                      'modernized': 1,
                      'accuse': 1,
                      'town': 18,
                      'fulfilled': 3,
                      '$700': 2,
                      'Economically': 1,
                      'settling': 1,
                      'now': 76,
                      'Zinman': 1,
                      'solos': 1,
                      'Bonanza': 1,
                      'full-fledged': 1,
                      'dramatic': 8,
                      'erupts': 1,
                      'bestowal': 1,
                      'Cooperation': 1,
                      'cab': 3,
                      'domination': 1,
                      'De': 11,
                      'Yankee': 6,
                      '4-year-old': 1,
                      "we'll": 4,
                      'lay': 5,
                      'Belanger': 1,
                      'Carson': 1,
                      'newcomers': 1,
                      'loss': 11,
                      'three': 97,
                      "Atlanta's": 4,
                      'automatic': 5,
                      'consulted': 4,
                      'demand': 17,
                      'batch': 1,
                      'stock': 21,
                      'Garza': 1,
                      'Inauguration': 3,
                      'Spring': 1,
                      'By-passing': 1,
                      'alcoholics': 2,
                      'downed': 1,
                      '$28': 4,
                      'San': 22,
                      'furnish': 2,
                      'presidential': 5,
                      'wage': 3,
                      'split-level': 1,
                      'industrial': 5,
                      'Kunkel': 2,
                      'Convenience': 2,
                      'pure': 2,
                      'organize': 2,
                      'extended': 5,
                      'Bernhard': 1,
                      "Louis's": 1,
                      'Screw': 1,
                      'Giacomo': 1,
                      'FBI': 5,
                      'Actor-Crooner': 1,
                      'Slow': 1,
                      "Wouldn't": 1,
                      'ineffectual': 1,
                      'motion': 2,
                      'onetime': 1,
                      'car': 50,
                      'Mansion': 1,
                      'lucrative': 1,
                      'cap': 1,
                      'completions': 3,
                      '$30': 1,
                      'A.': 38,
                      '3-month': 1,
                      'Hyannis': 2,
                      'NLRB': 1,
                      "Hyde's": 1,
                      'present': 30,
                      'describes': 1,
                      'throws': 2,
                      'well-played': 1,
                      '$58,918': 1,
                      'science': 7,
                      'Merit': 1,
                      'officiated': 3,
                      'receives': 1,
                      'be': 526,
                      'wealth': 2,
                      'alternative': 4,
                      'racing': 2,
                      'names': 6,
                      'ready': 12,
                      'Exhibition': 1,
                      'baseballs': 1,
                      'Crawford': 2,
                      'cotton-growing': 1,
                      'bonding': 1,
                      'wed': 2,
                      'adjoining': 2,
                      'shares': 19,
                      'engaged': 3,
                      "Musician's": 1,
                      '220-yard': 1,
                      'Journal-American': 1,
                      'conclusion': 3,
                      'ant': 3,
                      'reasons': 12,
                      'auto': 3,
                      'janitor': 1,
                      'minutes': 25,
                      'Baseball': 4,
                      'Missiles': 1,
                      ...}),
             'religion': Counter({'resented': 1,
                      'injustice': 1,
                      'argument': 4,
                      'Seminary': 2,
                      'read': 7,
                      '23': 4,
                      'entails': 2,
                      'speak': 9,
                      'Wonderland': 1,
                      'dost': 1,
                      'grow': 4,
                      'entities': 4,
                      'vein': 1,
                      'education': 2,
                      'house': 3,
                      'believes': 5,
                      'note': 2,
                      'adultery': 1,
                      "B.B.C.'s": 1,
                      'garb': 1,
                      'building': 1,
                      'Adams': 2,
                      'Boards': 3,
                      'generation': 6,
                      'promptly': 1,
                      'pastoral': 1,
                      'morally': 1,
                      'tracks': 1,
                      'restrain': 1,
                      'operating': 1,
                      'use': 16,
                      'than': 69,
                      'wiped': 1,
                      'killed': 1,
                      'shifts': 1,
                      'recurrence': 1,
                      'experience': 27,
                      'applies': 1,
                      'endure': 1,
                      'served': 4,
                      'committee': 1,
                      'ailments': 3,
                      'Norm': 1,
                      'bars': 1,
                      'population': 10,
                      'rents': 1,
                      'Lucifer': 1,
                      'trip': 4,
                      'typewriter': 1,
                      'anyhow': 1,
                      'has': 111,
                      'letter': 5,
                      'surrendering': 1,
                      'pleading': 3,
                      'pragmatic': 1,
                      'maintenance': 2,
                      'our': 77,
                      'along': 5,
                      'certain': 17,
                      'wages': 3,
                      'zeal': 1,
                      'God': 131,
                      'inward': 1,
                      'demanded': 1,
                      'others': 13,
                      'stronghold': 1,
                      'editorial': 4,
                      'torrent': 1,
                      'strains': 1,
                      'transition': 1,
                      'grew': 2,
                      'Islamic': 2,
                      'contributes': 1,
                      'Diario': 2,
                      'beautiful': 5,
                      'gesture': 2,
                      'Ruth': 1,
                      'sixth': 1,
                      'mailing': 1,
                      'Greenleaf': 1,
                      'marriages': 1,
                      'first': 25,
                      'self-respect': 1,
                      'refuse': 3,
                      'Eternal': 1,
                      'admit': 4,
                      'developed': 5,
                      'ratiocinating': 1,
                      'Shakya': 1,
                      'seed': 1,
                      'promises': 3,
                      'hell-bound': 1,
                      'dominating': 1,
                      'Pythagoreans': 1,
                      'wall': 4,
                      'page': 3,
                      'instructed': 2,
                      'determine': 4,
                      'under': 19,
                      'correct': 3,
                      'reborn': 1,
                      'proportion': 3,
                      'Peace': 1,
                      'metal': 1,
                      'kitchen': 1,
                      "''": 266,
                      'amount': 2,
                      'orders': 2,
                      'ripples': 3,
                      'pressure': 4,
                      'Creator': 3,
                      'rejections': 1,
                      'reciting': 1,
                      'directed': 1,
                      'Berlin': 3,
                      'dynamic': 2,
                      'hear': 5,
                      'man': 64,
                      'redeemed': 1,
                      'evils': 3,
                      'Loyalty': 1,
                      'erased': 1,
                      'awe': 1,
                      'through': 48,
                      'Section': 1,
                      'Following': 1,
                      'never-to-be-forgotten': 1,
                      'anew': 1,
                      '1850': 1,
                      'nationalism': 3,
                      'Just': 2,
                      'specific': 3,
                      'each': 30,
                      'Because': 6,
                      'battle': 1,
                      'Delaware': 2,
                      'proclamation': 1,
                      'Sponsors': 1,
                      'motion': 2,
                      'tons': 2,
                      'arrogance': 1,
                      'France': 1,
                      'it': 264,
                      'burden': 1,
                      'confident': 1,
                      'urns': 1,
                      'balanced': 2,
                      'freely': 2,
                      'raised': 4,
                      'alternation': 1,
                      'love': 13,
                      'meanings': 4,
                      'rise': 2,
                      'ate': 1,
                      'Schleiermacher': 1,
                      'grave': 4,
                      'repeated': 2,
                      'eventually': 3,
                      'astonishingly': 1,
                      'believing': 1,
                      'Center': 4,
                      'eternal': 7,
                      'features': 3,
                      'epitomized': 1,
                      'distrust': 2,
                      'service': 12,
                      'hard': 3,
                      'At': 7,
                      'walked': 2,
                      ')': 127,
                      'Sky-god': 1,
                      'nonchurchgoing': 1,
                      'mastery': 1,
                      'endures': 1,
                      'colleagues': 2,
                      'Take': 1,
                      'loudspeakers': 1,
                      'baby': 1,
                      'Those': 4,
                      'doctrines': 1,
                      'rejection': 2,
                      '1776': 1,
                      'inculcation': 1,
                      'drop': 2,
                      '$2,000': 1,
                      'calculated': 3,
                      'cause': 9,
                      'friends': 5,
                      'integral': 1,
                      'subtly': 2,
                      'shouting': 1,
                      'worse': 3,
                      'Teresa': 1,
                      'anti-slavery': 11,
                      'negative': 3,
                      'truthfully': 1,
                      'thoroughfare': 1,
                      'small': 13,
                      'literal': 1,
                      'create': 2,
                      'enemy': 4,
                      'wept': 1,
                      'recruits': 1,
                      'prophesied': 1,
                      'sect': 1,
                      'Cardinal': 3,
                      'revolutions': 1,
                      'detrimental': 2,
                      'forward': 2,
                      'recently': 4,
                      'training': 3,
                      'distinct': 1,
                      'thunder': 1,
                      'Heenan': 1,
                      'peas': 17,
                      'legislation': 2,
                      'next': 9,
                      'accelerating': 1,
                      'four': 10,
                      'badly': 1,
                      'searching': 1,
                      'Africans': 1,
                      'Len': 2,
                      'bronze': 1,
                      'broad': 1,
                      'stigma': 1,
                      'ball': 1,
                      'passed': 5,
                      'external': 2,
                      'castigation': 1,
                      'please': 1,
                      'ideology': 1,
                      'frivolity': 1,
                      'mountains': 3,
                      'rushed': 1,
                      'female': 2,
                      'efforts': 12,
                      'effect': 13,
                      'lived': 5,
                      'exact': 1,
                      'clothed': 1,
                      'thin': 1,
                      'sciences': 1,
                      'returning': 2,
                      'King': 1,
                      'Interfaith': 2,
                      'sputniks': 1,
                      'brokers': 2,
                      'recommending': 1,
                      'sanctimonious': 1,
                      'family': 20,
                      'ever-changing': 1,
                      'spending': 1,
                      'modernists': 1,
                      'realities': 3,
                      'twelve': 1,
                      'demoted': 1,
                      'paralleling': 1,
                      'contradiction': 1,
                      'fear': 24,
                      'hymn': 3,
                      'merits': 1,
                      'Communists': 1,
                      'dowry': 1,
                      'rice': 1,
                      'P': 3,
                      'theses': 1,
                      'pulpits': 1,
                      'destructive': 1,
                      'needed': 1,
                      'late': 4,
                      '``': 270,
                      'Communism': 2,
                      'systematically': 1,
                      'conversely': 1,
                      'personalities': 1,
                      'marveled': 1,
                      'ask': 9,
                      'urging': 1,
                      'found': 13,
                      'Realtor': 1,
                      'triumph': 1,
                      'unction': 1,
                      'included': 1,
                      'Nevertheless': 4,
                      "one's": 4,
                      'warmly': 1,
                      'mistakes': 1,
                      'wonders': 2,
                      'ethos': 1,
                      'mother': 3,
                      'things': 23,
                      'Now': 12,
                      'Nonconformists': 1,
                      'alienated': 1,
                      'can': 82,
                      'participants': 1,
                      'independent': 2,
                      'dismal': 1,
                      'sees': 3,
                      'debunking': 1,
                      'Mountain': 1,
                      'permission': 1,
                      'suppose': 1,
                      'carnal': 1,
                      'nation-states': 1,
                      'bits': 1,
                      'are': 193,
                      '18': 3,
                      'imperial': 1,
                      'treatment': 1,
                      'drawn': 3,
                      'ecclesiastical': 2,
                      'reconcile': 1,
                      'advisers': 1,
                      'instruction': 2,
                      'one-third': 2,
                      'brethren': 4,
                      'hatred': 1,
                      'reflects': 1,
                      'feelings': 3,
                      'constituted': 1,
                      'Buddhist': 1,
                      'giveth': 2,
                      'activities': 4,
                      'Early': 2,
                      'voting': 1,
                      'leave': 8,
                      'York': 5,
                      'exquisitely': 1,
                      'importance': 3,
                      'assert': 1,
                      'power': 49,
                      'yielded': 1,
                      'till': 1,
                      'shoot': 1,
                      'shadow': 2,
                      'classes': 4,
                      'any': 60,
                      'Buddhists': 2,
                      'bore': 1,
                      'Hence': 2,
                      'dying': 3,
                      'observe': 1,
                      'automatically': 1,
                      'insurgence': 1,
                      'talent': 3,
                      'front': 2,
                      'deceived': 1,
                      'absolutes': 1,
                      'Ibn': 1,
                      'dispute': 1,
                      'gladness': 1,
                      'Odors': 1,
                      'severely': 1,
                      'institutionalized': 1,
                      '1961': 5,
                      'Beautiful': 1,
                      'Pragmatism': 1,
                      'drink': 1,
                      'Beach': 1,
                      'run': 8,
                      'discussed': 3,
                      'persons': 10,
                      'equity': 1,
                      'Continental': 1,
                      'addresses': 1,
                      'tribulation': 1,
                      'effected': 3,
                      'indelible': 1,
                      'unconcerned': 1,
                      'visitor': 1,
                      'intellectus': 1,
                      'thousand': 6,
                      '70,000,000': 1,
                      'themselves': 15,
                      'academic': 1,
                      'music': 6,
                      'visit': 3,
                      'experiences': 4,
                      'recounted': 1,
                      'fallen': 1,
                      'eighteenth': 1,
                      'Church': 43,
                      'himself': 22,
                      'metaphysic': 3,
                      'dropped': 1,
                      'parties': 4,
                      'provoke': 1,
                      'divinely': 1,
                      'speaks': 1,
                      'anyone': 1,
                      'reader': 1,
                      'unnecessary': 1,
                      'her': 8,
                      'correspond': 1,
                      'bottom': 1,
                      'spring': 1,
                      'eradicate': 1,
                      'continuous': 1,
                      'role': 5,
                      'longer': 14,
                      'sing': 1,
                      'strange': 2,
                      'Missions': 1,
                      'mercy': 4,
                      'Manchester': 2,
                      'methods': 1,
                      'band': 1,
                      'trends': 3,
                      'visited': 2,
                      'liberation': 1,
                      'hyperbole': 2,
                      'confessional': 2,
                      'disposed': 2,
                      'Also': 1,
                      'radio': 3,
                      'societies': 1,
                      'trees': 1,
                      'Some': 19,
                      'squarely': 1,
                      'pilgrim': 1,
                      'possessing': 2,
                      'Japanese': 1,
                      'exchange': 2,
                      'due': 4,
                      'considering': 3,
                      'attainment': 4,
                      'selection': 1,
                      'seeming': 1,
                      'Scotian': 1,
                      'Arhat': 1,
                      'appearing': 1,
                      'compressed': 1,
                      'sixty': 1,
                      'Virgin': 4,
                      'proviso': 1,
                      'serve': 3,
                      'overcomes': 2,
                      'problem': 12,
                      'walking': 1,
                      'percentage': 4,
                      'stage': 1,
                      'Catherine': 1,
                      'Outline': 1,
                      'enlightened': 2,
                      'reflect': 1,
                      'looking': 2,
                      'commission': 1,
                      'Bushnell': 1,
                      'nuns': 1,
                      'shorter': 1,
                      'unavoidably': 2,
                      'striking': 1,
                      'British': 1,
                      'monk': 7,
                      'Time': 1,
                      'prudential': 1,
                      'humor': 1,
                      'periods': 1,
                      'resent': 2,
                      "John's": 2,
                      'astounding': 1,
                      'exclusive': 2,
                      'descending': 1,
                      'paredon': 2,
                      'anti-Christian': 1,
                      'teacher': 4,
                      'pointed': 2,
                      'realized': 1,
                      'determination': 2,
                      'same': 28,
                      'possess': 3,
                      'place': 12,
                      'piece': 3,
                      'apprehend': 1,
                      'hardly': 2,
                      'ethicist': 1,
                      'amusement': 2,
                      '1861': 2,
                      'reform': 4,
                      'atmosphere': 5,
                      'apprehension': 2,
                      'union': 3,
                      'melting': 1,
                      "Batista's": 1,
                      'physician': 1,
                      'dictator': 2,
                      'greater': 11,
                      'gulf': 1,
                      'selflessness': 1,
                      'therefore': 9,
                      'remain': 3,
                      'equal': 7,
                      'factual': 1,
                      'think': 17,
                      'could': 59,
                      'Especially': 1,
                      'Smith': 1,
                      'Federal': 1,
                      'organizes': 1,
                      'fondness': 1,
                      'stopped': 1,
                      'alleviation': 2,
                      'unmixed': 1,
                      'related': 2,
                      'structurally': 1,
                      'circumcision': 1,
                      'frank': 2,
                      'Along': 1,
                      'pride': 1,
                      'shore': 1,
                      'remarkably': 1,
                      '13th': 1,
                      'wise': 1,
                      'customs': 1,
                      'all-inclusive': 1,
                      'Buddhism': 6,
                      'unity': 21,
                      'proximate': 1,
                      'Dr.': 7,
                      'tagged': 1,
                      'defend': 1,
                      'appetites': 1,
                      'meant': 5,
                      'shield': 1,
                      'old': 5,
                      'attested': 1,
                      'morals': 2,
                      'tiny': 1,
                      'forbids': 1,
                      'proportionate': 3,
                      'picture': 15,
                      'contain': 2,
                      'includes': 1,
                      'mystics': 1,
                      'for': 283,
                      'slavery': 11,
                      'equally': 1,
                      'buildings': 3,
                      'between': 31,
                      'excellent': 2,
                      'cost': 5,
                      'exempt': 1,
                      'host': 3,
                      'Nationalism': 1,
                      '6': 6,
                      'oceans': 1,
                      'report': 3,
                      'converts': 2,
                      'braces': 1,
                      'appear': 2,
                      'diagrams': 1,
                      'Roman': 13,
                      'irritations': 1,
                      'recognize': 5,
                      'improve': 1,
                      'World': 4,
                      'marked': 4,
                      'higher': 3,
                      'diplomatic': 2,
                      'eyes': 4,
                      'filled': 5,
                      'simple': 4,
                      'racially': 1,
                      'firmly': 1,
                      'Until': 1,
                      'hand': 10,
                      'dear': 2,
                      'fruit': 2,
                      'Taoists': 2,
                      'covenant': 1,
                      'remind': 3,
                      'Varner': 1,
                      'contingencies': 2,
                      'favorite': 1,
                      'universality': 1,
                      'brightens': 1,
                      'told': 9,
                      'they': 115,
                      'Alice': 1,
                      'done': 13,
                      'Communist': 3,
                      'discovery': 1,
                      'evangelism': 4,
                      '37': 1,
                      'may': 78,
                      'The': 185,
                      'heat': 1,
                      'benefit': 2,
                      'uselessly': 1,
                      'objectively': 1,
                      'Boston': 7,
                      'reports': 3,
                      'undermined': 1,
                      'portion': 1,
                      'emphasize': 2,
                      'rained': 1,
                      'celebrated': 1,
                      'law': 10,
                      'boot': 1,
                      'Lloyd': 1,
                      'Slough': 1,
                      'clues': 1,
                      'Buri': 3,
                      'defense': 3,
                      'quarter': 2,
                      'police': 1,
                      'world': 90,
                      'acting': 1,
                      'observed': 2,
                      'state': 14,
                      're-evaluation': 1,
                      'trend': 4,
                      'felt': 6,
                      'private': 3,
                      'conception': 3,
                      'contagion': 1,
                      'variant': 1,
                      'confuse': 1,
                      'precisely': 4,
                      'Pledge': 1,
                      'nun': 1,
                      'fighting': 1,
                      'protest': 1,
                      '1845': 1,
                      'self-confidence': 1,
                      'relaxation': 1,
                      'Confucian': 3,
                      'brutally': 1,
                      'spell': 4,
                      'relativity': 1,
                      'hate': 3,
                      'facts': 2,
                      'recreation': 1,
                      'enjoyed': 2,
                      'doctrinal': 1,
                      'assembled': 2,
                      'disastrous': 1,
                      'taking': 4,
                      'L.': 1,
                      'fascination': 1,
                      'dependable': 1,
                      'too': 28,
                      'agreed-upon': 1,
                      'exciting': 1,
                      'aspect': 2,
                      'confirmed': 2,
                      'ruler': 1,
                      'outside': 5,
                      'hung': 2,
                      'relation': 1,
                      '355': 1,
                      'book': 4,
                      'aids': 1,
                      'sets': 2,
                      'following': 11,
                      'opiates': 1,
                      'Basic': 1,
                      'event': 8,
                      'diametrically': 1,
                      'Devonshire': 1,
                      'employees': 1,
                      'specialization': 2,
                      'particularistic': 1,
                      'bound': 2,
                      'Musical': 1,
                      'intend': 1,
                      'presents': 3,
                      "Paul's": 1,
                      'above': 2,
                      'endlessly': 1,
                      'celebrating': 1,
                      'purpose': 4,
                      'volumes': 1,
                      'unpublished': 1,
                      'give': 17,
                      'catechism': 1,
                      'foundations': 2,
                      'Thus': 11,
                      'membership': 33,
                      'assemble': 1,
                      'successfully': 1,
                      'medieval': 2,
                      'implications': 3,
                      'upset': 2,
                      'scientists': 3,
                      'treason': 1,
                      'housing': 13,
                      'infatuation': 2,
                      'odor': 1,
                      'convenient': 1,
                      'infraction': 1,
                      'rapidly': 1,
                      'waving': 1,
                      'mapping': 1,
                      'mutual': 3,
                      'diverse': 1,
                      'statistics': 3,
                      'strenuous': 3,
                      'redactions': 1,
                      'allow': 3,
                      'critical': 2,
                      'circulate': 1,
                      'correctness': 1,
                      'anticipations': 1,
                      'identified': 2,
                      'grasped': 1,
                      'cancer': 2,
                      'Pope': 3,
                      'inadequacy': 1,
                      'hermeneutics': 1,
                      'actually': 8,
                      'limit': 1,
                      'advice': 1,
                      'Several': 4,
                      'explanation': 2,
                      'replied': 1,
                      'preserve': 4,
                      'Machado': 1,
                      'sea': 5,
                      'Behold': 1,
                      'thought': 12,
                      'employed': 4,
                      'earthly': 4,
                      'on': 160,
                      'response': 3,
                      'conviction': 5,
                      'compared': 2,
                      'authority': 2,
                      'Queen': 1,
                      'Our': 7,
                      'Perhaps': 5,
                      'maturation': 1,
                      'confused': 1,
                      'football': 3,
                      'big': 5,
                      'earnestly': 1,
                      'Tsou': 1,
                      'dislikes': 1,
                      'comply': 1,
                      'Sidney': 1,
                      'clearly': 9,
                      'nourished': 1,
                      'spiral': 6,
                      'attempted': 1,
                      'completely': 10,
                      'abides': 1,
                      'C': 1,
                      'wei': 2,
                      'civilization': 1,
                      'cold': 2,
                      'Reproach': 1,
                      'financial': 2,
                      'coverage': 2,
                      'apostolic': 1,
                      'excited': 1,
                      'regeneration': 1,
                      'Tai': 1,
                      'idle': 1,
                      'utterly': 4,
                      'Opportunities': 2,
                      'mere': 4,
                      'mend': 1,
                      'fate': 2,
                      'insults': 1,
                      'Heavenly': 1,
                      'clergyman': 4,
                      'intellectually': 2,
                      'worst': 2,
                      'reminding': 1,
                      'contracted': 1,
                      'impact': 1,
                      'Psalm': 3,
                      'glasses': 1,
                      'street': 1,
                      'fortiori': 1,
                      'rid': 1,
                      'self-deprecation': 1,
                      'orbiting': 1,
                      'Parker': 28,
                      'Bible-emancipated': 1,
                      'sins': 5,
                      'utmost': 1,
                      'Braille': 1,
                      'current': 2,
                      'worth': 1,
                      'thousands': 1,
                      'tradition': 8,
                      'priest': 4,
                      'expectations': 1,
                      'down-and-outers': 1,
                      '22': 1,
                      'Alfred': 1,
                      'intended': 6,
                      'town': 2,
                      'opinions': 2,
                      'capitalists': 1,
                      'now': 33,
                      'lies': 3,
                      'reign': 1,
                      'dramatic': 4,
                      'radiant': 1,
                      'expects': 1,
                      'faint': 1,
                      'demolition': 1,
                      'suddenly': 3,
                      'De': 2,
                      '260': 1,
                      'gets': 3,
                      'respond': 1,
                      'advocate': 1,
                      'individuality': 1,
                      'division': 3,
                      'loss': 2,
                      'Will': 1,
                      'bigotry': 1,
                      'three': 20,
                      'shut': 1,
                      'proclaims': 2,
                      'demand': 11,
                      'presumes': 1,
                      'stock': 5,
                      'restrictive': 3,
                      'centenary': 1,
                      'sympathy': 2,
                      'San': 1,
                      'presidential': 1,
                      'Unitarians': 2,
                      'bulletin': 1,
                      'clearer': 1,
                      'industrial': 1,
                      'virtue': 5,
                      'pure': 3,
                      'organize': 1,
                      'extended': 1,
                      'disciples': 1,
                      'alumni': 1,
                      'emphatically': 1,
                      'choice': 12,
                      'desires': 1,
                      'joy': 3,
                      '1789-1839': 1,
                      "community's": 1,
                      'hole': 1,
                      'provided': 4,
                      'Taoist': 3,
                      'forceful': 1,
                      'nation': 12,
                      'heroic': 1,
                      'narrow-minded': 1,
                      'personal': 14,
                      'difficulty': 3,
                      'Arhats': 1,
                      'monastic': 1,
                      'secularists': 1,
                      'West': 8,
                      'present': 25,
                      'describes': 1,
                      'dealing': 2,
                      'science': 8,
                      'necessary': 8,
                      'unleashing': 1,
                      'hit': 1,
                      'receives': 3,
                      'be': 243,
                      'must': 54,
                      'racing': 1,
                      'Scholars': 1,
                      'helpful': 3,
                      'ready': 1,
                      'reconsidered': 3,
                      'Mary': 4,
                      'an': 119,
                      'Troeltsch': 1,
                      'suspicion': 3,
                      'presented': 2,
                      'culture': 4,
                      'witches': 3,
                      'abundance': 1,
                      'shares': 1,
                      'sincerity': 3,
                      'Rauschenbusch': 1,
                      'engaged': 2,
                      'Movement': 1,
                      'findings': 2,
                      'designate': 1,
                      'conclusion': 3,
                      'reasons': 5,
                      'traditionally': 1,
                      'Ti': 1,
                      'prudentially': 1,
                      "T'ien": 1,
                      'remote': 1,
                      'oxygen': 1,
                      'Head': 1,
                      'joined': 2,
                      'criticize': 1,
                      'transience': 1,
                      'friendly': 2,
                      'Garrison': 3,
                      'war-dirty': 1,
                      'refusal': 2,
                      'Word': 7,
                      'regular': 1,
                      'shivering': 1,
                      'covering': 1,
                      'numerous': 1,
                      'contended': 1,
                      'status': 2,
                      'Ferris': 1,
                      'central': 3,
                      'carefully': 4,
                      'colonial': 1,
                      'twenty-eighth': 1,
                      'folly': 2,
                      'fail': 1,
                      'commands': 1,
                      'burial': 1,
                      'residence': 1,
                      'Glenn': 1,
                      'dignity': 1,
                      'how': 23,
                      'force': 9,
                      '1803-1895': 1,
                      'employ': 1,
                      'establishment': 4,
                      'weeping': 2,
                      'preparation': 8,
                      'prevented': 1,
                      'popularly': 1,
                      'traditional': 13,
                      'Bay': 3,
                      'throw': 2,
                      'key': 1,
                      'embraces': 1,
                      'obligations': 1,
                      'repressions': 1,
                      'Parris': 2,
                      'paid': 5,
                      'trail': 1,
                      'Had': 1,
                      'strongest': 2,
                      'Certainly': 1,
                      '1924': 1,
                      'revealed': 1,
                      'alone': 8,
                      'rightly': 2,
                      'humans': 1,
                      'statutes': 1,
                      'deemed': 1,
                      'cherubim': 1,
                      'Bultmann': 4,
                      '1815': 3,
                      'counseling': 1,
                      'into': 82,
                      'precepts': 1,
                      'strife': 1,
                      'question': 10,
                      'else': 3,
                      'cult': 3,
                      'local': 10,
                      'justify': 3,
                      'let': 11,
                      'promote': 1,
                      'religions': 6,
                      'libertarians': 1,
                      'devout': 2,
                      'gave': 11,
                      'becoming': 2,
                      'May': 1,
                      'Vineyard': 1,
                      'facing': 1,
                      'necessity': 3,
                      'seeks': 1,
                      'grown': 2,
                      'dare': 3,
                      'separating': 1,
                      '12': 4,
                      'render': 1,
                      'Jewish': 1,
                      'majesty': 1,
                      'octillion': 3,
                      'relationship': 3,
                      'anchorite': 1,
                      'existed': 4,
                      'because': 39,
                      'doubting': 1,
                      'chaplains': 1,
                      '2': 20,
                      'decides': 1,
                      'persisted': 2,
                      'hours': 7,
                      'hunger': 2,
                      'preconditioned': 2,
                      'variety': 2,
                      'Puritan': 2,
                      '220': 1,
                      'Fortunately': 2,
                      'technological': 1,
                      ...}),
             'reviews': Counter({'Without': 1,
                      'festival': 5,
                      'contented': 2,
                      'Wallenstein': 2,
                      'deft': 1,
                      'More': 4,
                      '23': 11,
                      'fiesta': 1,
                      'Anthony': 3,
                      'located': 1,
                      'plush': 1,
                      'infectious': 1,
                      'future-day': 1,
                      "Hendricks'": 1,
                      'education': 1,
                      'house': 6,
                      'Hewett': 1,
                      'barrage': 1,
                      "choir's": 1,
                      'outmaneuvered': 1,
                      'waters': 3,
                      'bespeak': 1,
                      'Between': 1,
                      'restrain': 1,
                      'operating': 1,
                      'use': 8,
                      'Lower': 2,
                      'killed': 4,
                      'vitally': 2,
                      'recruits': 1,
                      'endure': 1,
                      'sly': 1,
                      'bars': 1,
                      'sports': 2,
                      "Charley's": 1,
                      'Viscount': 2,
                      'thoroughly': 2,
                      'February': 1,
                      'Imaginary': 1,
                      'sextet': 2,
                      'well-received': 1,
                      'abrupt': 1,
                      'certain': 5,
                      'season': 24,
                      'Nazi': 2,
                      'K': 1,
                      'shade': 2,
                      'others': 14,
                      'Nikolai': 2,
                      'transition': 1,
                      'Laboratory': 1,
                      'catch': 2,
                      'beautiful': 11,
                      'averaged': 1,
                      'Lewisohn': 4,
                      'denying': 1,
                      'Toobin': 2,
                      'cadenza': 2,
                      'Kornevey': 1,
                      'company': 15,
                      'Stevens': 1,
                      'cliches': 2,
                      'discovery': 3,
                      'Vicenza': 2,
                      'materially': 1,
                      'Shrewd': 1,
                      'Pontchartrain': 1,
                      'correct': 1,
                      "Hampton's": 1,
                      'lumbering': 1,
                      'kitchen': 1,
                      'Gilels': 2,
                      'amount': 5,
                      'Mazowsze': 1,
                      'few': 15,
                      'omitted': 1,
                      'Renaissance': 1,
                      'pressure': 1,
                      'scientists': 4,
                      'gorgeous': 1,
                      "wouldn't": 1,
                      'Grigory': 1,
                      'through': 26,
                      'Gospel-singer': 1,
                      'polonaise': 1,
                      'Just': 2,
                      'specific': 4,
                      'each': 22,
                      'Because': 2,
                      'Boulez': 1,
                      '29-Oct.': 1,
                      'Xydis': 5,
                      'Lanza': 2,
                      'tons': 1,
                      'timid': 1,
                      'stakes': 1,
                      'decaying': 1,
                      'forget': 2,
                      'doltish': 1,
                      'ceased': 1,
                      "yesterday's": 1,
                      'Democrats': 2,
                      'cottage': 1,
                      'attractively': 1,
                      'shy': 1,
                      'raised': 1,
                      'meanings': 1,
                      'ex-prize': 1,
                      'revealed': 3,
                      'Ernst': 1,
                      'grave': 1,
                      'distaste': 1,
                      '19,000,000': 1,
                      'sweet-sounding': 1,
                      'repressive': 1,
                      'situations': 2,
                      "Shakespeare's": 2,
                      'At': 12,
                      ')': 101,
                      'mastery': 1,
                      'colleagues': 1,
                      'novelized': 1,
                      'Those': 2,
                      'Raine': 1,
                      'Nelson': 1,
                      'career': 1,
                      'Splendid': 1,
                      'friends': 2,
                      'twists': 2,
                      'Tannhaeuser': 2,
                      'subtly': 1,
                      'shouting': 1,
                      'small': 10,
                      'bull': 1,
                      'needle-sharp': 1,
                      'jam': 1,
                      'Skolovsky': 4,
                      'enemy': 3,
                      'competition': 1,
                      'experience': 6,
                      'revolutions': 1,
                      'also': 31,
                      'recently': 10,
                      'earthy': 2,
                      'tournament': 2,
                      'distinct': 2,
                      'diehards': 1,
                      'next': 10,
                      'accelerating': 1,
                      'armchair': 1,
                      'Poindexter': 1,
                      'four': 6,
                      'Serafin': 1,
                      'e.g.': 1,
                      'really': 19,
                      'peoples': 1,
                      'broad': 6,
                      'Foliage': 2,
                      'understanding': 2,
                      'drinker': 2,
                      'messy': 1,
                      'striving': 1,
                      'rushed': 1,
                      'borrowed': 1,
                      'Getz': 1,
                      'signals': 1,
                      'effect': 6,
                      'bookers': 1,
                      'Girl': 3,
                      'NBC': 3,
                      'Cologne': 1,
                      'Kingwood': 1,
                      "brain's": 1,
                      'encores': 1,
                      'on-stage': 1,
                      'population': 6,
                      'King': 3,
                      'artists': 4,
                      'physiological': 2,
                      'confronting': 1,
                      'evening': 18,
                      'Replied': 1,
                      'Joseph': 4,
                      'keen': 1,
                      'eyewitness': 1,
                      'tragi-comic': 1,
                      'retrieved': 1,
                      'regretted': 1,
                      'tempted': 1,
                      'formidably': 1,
                      'ask': 5,
                      'pleasures': 1,
                      'found': 5,
                      'Bastianini': 1,
                      'Yurochka': 1,
                      'guitarist': 1,
                      "one's": 5,
                      'domes': 1,
                      'pretty': 6,
                      "afternoon's": 1,
                      'barges': 1,
                      'mother': 3,
                      'Garry': 1,
                      'things': 10,
                      'Now': 7,
                      'thank': 1,
                      'department': 2,
                      'squealed': 1,
                      'divided': 1,
                      'suppose': 2,
                      'Hall': 14,
                      'transported': 1,
                      '18': 2,
                      'drawn': 2,
                      'spur': 1,
                      'stops': 1,
                      'monologist': 1,
                      'bat': 1,
                      'stags': 1,
                      'German': 4,
                      'obliterate': 1,
                      "Leningrad's": 1,
                      'compulsive': 1,
                      'chanted': 1,
                      'vehicles': 2,
                      'caloric': 3,
                      '1959': 2,
                      'musicians': 5,
                      "Cole's": 1,
                      'shoot': 3,
                      'prints': 2,
                      'marched': 1,
                      'Hutton': 1,
                      'flights': 1,
                      'Overture': 4,
                      'Resourceful': 1,
                      'outset': 1,
                      'Gomez': 1,
                      'cycles': 1,
                      'required': 3,
                      'Arrowhead': 1,
                      'candid': 1,
                      'Gaieties': 1,
                      'Taruffi': 1,
                      'throng': 1,
                      '1961': 1,
                      'strip': 1,
                      'Beach': 1,
                      'run': 1,
                      'discussed': 2,
                      'stores': 2,
                      'Projects': 2,
                      'semi-abstractions': 1,
                      'fronting': 1,
                      'Sonates': 1,
                      'dimension': 2,
                      'uncomfortable': 2,
                      'themselves': 9,
                      'roughness': 1,
                      'Yardumian': 1,
                      'academic': 1,
                      'experiences': 1,
                      'Pinkie': 1,
                      'Dec.': 7,
                      'speaks': 3,
                      'anyone': 8,
                      'reader': 9,
                      'pulled': 2,
                      'continuous': 1,
                      'cosmology': 1,
                      'Roxy': 1,
                      'Tenda': 1,
                      'mercy': 1,
                      'sentences': 1,
                      'domestic': 1,
                      'visited': 1,
                      "L'orchestre": 1,
                      'meticulously': 2,
                      'executing': 1,
                      'teens': 1,
                      'myriad': 1,
                      'encroaching': 1,
                      'arty': 1,
                      'delivery': 1,
                      'seacoast': 1,
                      'trees': 2,
                      "mother's": 1,
                      "Crosby's": 1,
                      'Oscar': 1,
                      'Japanese': 4,
                      'exchange': 1,
                      'Frederick': 1,
                      'due': 3,
                      'Sheldon': 1,
                      '1876': 1,
                      'devastating': 1,
                      'unabashed': 1,
                      'plea': 1,
                      'aural': 1,
                      'overcomes': 1,
                      'problem': 7,
                      'Summer': 1,
                      'winter': 1,
                      'distance': 1,
                      'goodies': 1,
                      'Nervousness': 1,
                      'specialize': 1,
                      'informal': 2,
                      'reflect': 2,
                      'looking': 3,
                      'Journal': 3,
                      'Belgium': 1,
                      'tasting': 1,
                      'British': 15,
                      'Bradley': 1,
                      'moderately': 1,
                      'clever': 5,
                      'humor': 9,
                      'periods': 1,
                      "John's": 1,
                      'Smiling': 1,
                      'sentimental': 4,
                      'Big': 2,
                      'descending': 1,
                      "Cibula's": 1,
                      'determination': 1,
                      'slowly': 4,
                      'Korneyev': 1,
                      'place': 19,
                      'piece': 7,
                      'Parks': 2,
                      'atmosphere': 5,
                      'blessed': 2,
                      'melting': 1,
                      'urgent': 1,
                      'inspiring': 2,
                      'race-driver': 3,
                      'folk-dance': 1,
                      'Eh': 1,
                      '110': 1,
                      'instrument': 1,
                      'think': 10,
                      'could': 40,
                      'dissimilar': 1,
                      'Lawrence': 1,
                      'conform': 1,
                      'teenagers': 1,
                      'audience': 32,
                      'rich': 6,
                      'Carnegie': 5,
                      'Wilson': 2,
                      'spaciousness': 1,
                      'elaborate': 2,
                      'planting': 1,
                      'Along': 1,
                      'pride': 3,
                      'Chorale': 1,
                      'Franklin': 1,
                      'Brooding': 1,
                      'compulsion': 1,
                      'Tallchief': 1,
                      'ran': 1,
                      'tiny': 1,
                      'Sonoma': 1,
                      'picture': 13,
                      'prevents': 1,
                      'sympathetic': 2,
                      'hypothalamus': 1,
                      'gaunt': 1,
                      'fan': 4,
                      'between': 20,
                      '1643': 2,
                      'host': 1,
                      'Hirsch': 2,
                      'intimate': 4,
                      'nailed': 1,
                      'gilded': 1,
                      'recognize': 1,
                      'emerge': 1,
                      'marked': 1,
                      'Toch': 1,
                      'unstilted': 1,
                      'higher': 3,
                      'Dutch': 1,
                      'Stephen': 1,
                      'foliage': 1,
                      "fan's": 1,
                      'firmly': 2,
                      'Tenderly': 1,
                      'oppression': 1,
                      'told': 7,
                      'composers': 5,
                      'velvet': 1,
                      'distinctly': 1,
                      'chicken': 1,
                      'thenceforth': 1,
                      'The': 322,
                      'collaborator': 1,
                      'benefit': 1,
                      'coincidence': 1,
                      'Hodges': 1,
                      'of': 1299,
                      'shores': 1,
                      'celebrated': 2,
                      '1920s': 1,
                      'Toast': 1,
                      'police': 1,
                      'visiting': 1,
                      "Angeles'": 1,
                      'narrative': 2,
                      'perky': 1,
                      'yearned': 1,
                      'corner': 1,
                      'bold': 3,
                      'Skating': 1,
                      'lived': 4,
                      'Sokolov': 2,
                      'wars': 3,
                      'Information': 1,
                      'Adultery': 1,
                      'Gibson': 3,
                      'Alfredo': 1,
                      'facts': 6,
                      'blockages': 1,
                      'assembled': 4,
                      'plump': 1,
                      'disastrous': 1,
                      'announcing': 2,
                      'taking': 5,
                      'recovery': 2,
                      'display': 5,
                      'preacher-singer': 1,
                      'eccentrics': 1,
                      'scale': 1,
                      'funny': 6,
                      "America's": 2,
                      'outside': 6,
                      'relation': 1,
                      'book': 26,
                      'sprue': 1,
                      'Eire': 1,
                      'unrest': 1,
                      'criticism': 3,
                      'Notre': 1,
                      'glances': 1,
                      'jacket': 2,
                      'blame': 3,
                      'conventions': 1,
                      'old-time': 1,
                      'presents': 7,
                      'purpose': 4,
                      'give': 14,
                      "Puttin'": 1,
                      'misfortune': 2,
                      'nos.': 1,
                      'horrors': 1,
                      'Terrible': 1,
                      'tape': 2,
                      'beauty': 5,
                      'flatly': 1,
                      'formalities': 1,
                      'slavish': 1,
                      'blond': 1,
                      'Richard': 6,
                      'Milstein': 4,
                      'diverse': 1,
                      'allow': 2,
                      'critical': 3,
                      'library': 1,
                      'cancer': 3,
                      'missing': 1,
                      'actually': 2,
                      'Several': 4,
                      'explanation': 2,
                      'connected': 2,
                      'juice': 1,
                      'Billy': 2,
                      'Holmes': 1,
                      'Roll': 1,
                      'bang-sashes': 1,
                      'reached': 4,
                      'anyway': 2,
                      'programed': 1,
                      'response': 2,
                      'authority': 2,
                      'Point': 1,
                      'Perhaps': 5,
                      'grace': 1,
                      'missile': 1,
                      'gallant': 1,
                      'big': 23,
                      'Zooey': 1,
                      'quantities': 1,
                      'arresting': 2,
                      'E': 1,
                      'Yea': 1,
                      'nourished': 1,
                      'attempted': 3,
                      'Mario': 1,
                      'C': 3,
                      'alike': 1,
                      'cold': 2,
                      'opened': 11,
                      'Elliott': 1,
                      'feared': 1,
                      'nurses': 1,
                      'Saturated': 1,
                      'fate': 2,
                      'clip': 1,
                      'storyline': 2,
                      'unfortunately': 3,
                      'deck': 2,
                      'lend': 3,
                      'mealie-meal': 1,
                      'street': 1,
                      'bow': 1,
                      'pessimistic': 1,
                      'Vadim': 1,
                      'current': 5,
                      'Rhythms': 1,
                      'brand-new': 1,
                      'bonus': 1,
                      'ludicrous': 1,
                      'cares': 2,
                      'glumly': 1,
                      'trumpeter': 1,
                      'metaphorical': 1,
                      'town': 1,
                      'fulfilled': 1,
                      'situated': 2,
                      'now': 20,
                      'solos': 1,
                      "Larkin's": 1,
                      'dramatic': 9,
                      'Elaine': 1,
                      'suddenly': 3,
                      'De': 12,
                      'gets': 6,
                      'Kerr': 1,
                      'lay': 4,
                      'Carson': 1,
                      'declamatory': 1,
                      'three': 13,
                      'demand': 2,
                      'stock': 1,
                      'words': 8,
                      'planks': 1,
                      'San': 6,
                      'romancing': 1,
                      'virtue': 2,
                      'pure': 1,
                      'Anna': 2,
                      'shed': 1,
                      'nervous': 1,
                      'College': 7,
                      'intrigues': 1,
                      'detective': 1,
                      'continuity': 2,
                      'forceful': 1,
                      'car': 2,
                      'Treasure': 1,
                      'rebut': 1,
                      'Quintet': 1,
                      'A.': 4,
                      'finely': 1,
                      'Triptych': 1,
                      'musicality': 1,
                      'present': 12,
                      'describes': 1,
                      'throws': 1,
                      'science': 3,
                      'influential': 1,
                      'interdependence': 1,
                      'mid-October': 2,
                      'be': 153,
                      'racing': 7,
                      'names': 4,
                      'frolic': 1,
                      'vegetable': 4,
                      'Bix': 1,
                      'Ten': 1,
                      'abundance': 1,
                      'France': 5,
                      'sincerity': 4,
                      'posing': 1,
                      'heap': 1,
                      'resolute': 1,
                      'conclusion': 2,
                      'reasons': 5,
                      'auto': 1,
                      'invent': 1,
                      'minutes': 5,
                      'Agriculture': 1,
                      'remote': 1,
                      'touching': 1,
                      'sweepings': 1,
                      'joined': 1,
                      "He's": 1,
                      'gallantry': 2,
                      'Hearts': 2,
                      'travelled': 1,
                      'docile': 1,
                      'Cesare': 1,
                      'afternoon': 6,
                      'Alvise': 2,
                      'Dream': 1,
                      'Forgotten': 1,
                      'status': 2,
                      'entirety': 1,
                      'commercial': 1,
                      'tarantara': 1,
                      'central': 3,
                      'abbreviated': 1,
                      'soloist': 6,
                      'dignity': 3,
                      'suitably': 1,
                      'force': 2,
                      'replaced': 1,
                      'sneaky': 1,
                      'traditional': 1,
                      'ballerina': 1,
                      'eastern': 2,
                      'RCA': 2,
                      'wildly': 1,
                      'Aspencades': 2,
                      'vistas': 2,
                      'encumbered': 1,
                      'lighting': 2,
                      "script's": 1,
                      'alone': 7,
                      'bracing': 1,
                      'cent': 8,
                      'breathe': 1,
                      'filagree': 1,
                      'into': 42,
                      'principally': 1,
                      'long-sought': 1,
                      'Psyche': 1,
                      'Mijbil': 3,
                      'Tintoretto': 1,
                      'affiliated': 1,
                      'quicksilver': 1,
                      'else': 10,
                      'let': 9,
                      'boatsmen': 1,
                      'Fifty-fifth': 2,
                      'genuinely': 2,
                      'becoming': 7,
                      'May': 2,
                      'promising': 1,
                      'necessity': 1,
                      'managed': 3,
                      'autobiography': 1,
                      'narrator': 1,
                      'Jewish': 1,
                      'method': 1,
                      'Mimieux': 1,
                      'murder': 1,
                      'shakes': 2,
                      'grown-up': 1,
                      'election': 2,
                      'sweeping': 1,
                      'landscapes': 2,
                      '2': 9,
                      'violin': 4,
                      'insiders': 1,
                      'ramps': 1,
                      'self-deceptions': 1,
                      'directors': 1,
                      'matchmaking': 1,
                      'revolved': 1,
                      "Russia's": 2,
                      'assimilation': 1,
                      'temperature': 1,
                      'embodiment': 1,
                      'curative': 1,
                      'entourage': 2,
                      'Iraq': 1,
                      'electric': 1,
                      'Instrumental': 1,
                      'hypothesis': 1,
                      'Bud': 1,
                      'Danish': 1,
                      'Garrick': 1,
                      'corners': 1,
                      'nightclub': 1,
                      'Countrymen': 1,
                      'Brass': 1,
                      'textures': 1,
                      'Otherwise': 1,
                      'S.': 3,
                      'Cervantes': 1,
                      'Adagio': 1,
                      'parades': 1,
                      'remarks': 3,
                      'Pavlovsky': 1,
                      'overly': 2,
                      'preoccupied': 1,
                      'commands': 2,
                      '1915': 1,
                      'standards': 1,
                      'clothes': 4,
                      'lukewarm': 1,
                      'qualities': 6,
                      'dispose': 1,
                      'converse': 1,
                      'Alger': 1,
                      'dignified': 1,
                      'monumental': 1,
                      'Plebian': 1,
                      'released': 1,
                      'ends': 1,
                      'ninth': 1,
                      'chronology': 1,
                      'letter': 4,
                      'treacherous': 1,
                      'Drifts': 1,
                      'plan': 2,
                      'infected': 1,
                      'parked': 2,
                      'startlingly': 1,
                      'substance': 4,
                      "rock'n'roll": 1,
                      'from': 138,
                      '500-mile': 1,
                      'Hot': 1,
                      'shredding': 1,
                      'appearance': 2,
                      'ball': 1,
                      'serfs': 1,
                      'reality': 1,
                      'hooting': 1,
                      'quintet': 1,
                      'studied': 3,
                      'twenty-five': 1,
                      'deeds': 1,
                      'years': 33,
                      'Nutcracker': 3,
                      'East': 5,
                      'probably': 8,
                      'frequently': 8,
                      'eagerness': 1,
                      'During': 3,
                      'corniest': 1,
                      'Cold': 1,
                      'Chorus': 1,
                      'wry': 1,
                      'Obviously': 1,
                      'trio': 1,
                      'what': 44,
                      'or': 97,
                      'states': 4,
                      'fifteen': 2,
                      'destruction': 2,
                      'Wind': 1,
                      '300th': 1,
                      'city': 7,
                      'attributes': 1,
                      'sex': 1,
                      "writers'": 1,
                      'related': 3,
                      'active': 1,
                      'Shelley': 1,
                      'easier': 1,
                      'Phil': 1,
                      'unbidden': 1,
                      'allowing': 2,
                      'general': 9,
                      'puerile': 1,
                      'presentations': 2,
                      'invite': 1,
                      'diplomat': 1,
                      'Jouvet': 4,
                      'primary': 3,
                      'national': 3,
                      'tomorrow': 2,
                      'formulae': 1,
                      'top-drawer': 1,
                      'Allied': 10,
                      'expanding': 2,
                      'notes': 5,
                      'skillful': 1,
                      'Naomi': 2,
                      'Bulba': 2,
                      'collections': 1,
                      'called': 20,
                      'opinions': 2,
                      'urges': 1,
                      'blues': 3,
                      'fictional': 1,
                      'Futhermore': 1,
                      'year': 17,
                      'reverse': 2,
                      'Rhenish': 1,
                      'Straits': 1,
                      'Coletta': 1,
                      'courage': 3,
                      'painted': 2,
                      'city-bred': 1,
                      'Radio': 2,
                      'G.': 2,
                      'stager': 1,
                      'deals': 1,
                      'scraps': 1,
                      'conspicuously': 1,
                      'ransack': 1,
                      'organized': 2,
                      'wonder': 6,
                      'partial': 1,
                      'objective': 1,
                      'coarsened': 1,
                      'three-week': 1,
                      'born': 2,
                      'sharply': 2,
                      'non-sentimental': 1,
                      'chance': 7,
                      'tarpon': 1,
                      "Shearing's": 1,
                      'spirits': 1,
                      'high-minded': 1,
                      'introspection': 1,
                      "composer's": 1,
                      'salty': 1,
                      'gaps': 1,
                      'compositions': 2,
                      'indomitable': 1,
                      'not': 143,
                      'Kennett': 1,
                      'sons': 1,
                      'unsympathetic': 1,
                      'headquarters': 2,
                      'chuckles': 1,
                      'epigrams': 1,
                      'imponderable': 1,
                      '19': 1,
                      'savory': 1,
                      'growth': 2,
                      'college': 2,
                      'legato': 1,
                      'Shearing': 3,
                      'placed': 3,
                      'enforced': 1,
                      'noble': 2,
                      'illustrated': 1,
                      'set': 16,
                      'Melodious': 1,
                      'hormones': 1,
                      'Married': 1,
                      'rationalist': 1,
                      'arrangement': 4,
                      'manages': 1,
                      'laced': 1,
                      'essential': 1,
                      'five': 6,
                      'breathtaking': 1,
                      'common': 7,
                      'less': 16,
                      'Mondays': 1,
                      'ballets': 1,
                      'thrusting': 1,
                      'trailers': 1,
                      'participate': 1,
                      'jocular': 1,
                      "Creston's": 1,
                      'application': 1,
                      'temperament': 1,
                      'chronologically': 1,
                      'calisthenics': 1,
                      'ceremonies': 1,
                      'Ancel': 1,
                      'dark': 2,
                      'Mi': 1,
                      'shall': 1,
                      '1,488': 1,
                      'word': 16,
                      'innocence': 1,
                      'magnificent': 5,
                      'showcase': 2,
                      'thinks': 3,
                      'benefactor': 1,
                      'shells': 1,
                      '211': 1,
                      'Sharing': 1,
                      'Ferguson': 1,
                      'Galina': 1,
                      'gardens': 1,
                      'sloppy': 1,
                      'Katherine': 1,
                      'Walter': 2,
                      'dam': 1,
                      'Anton': 2,
                      'bronze': 1,
                      'greenhouses': 1,
                      'Chicago-style': 1,
                      'business': 7,
                      'strength': 2,
                      'Toccata': 1,
                      'claims': 1,
                      'golfer': 1,
                      'Van': 2,
                      'conductor': 6,
                      'airdrops': 1,
                      'propaganda': 1,
                      '5-mile': 1,
                      'conclusions': 3,
                      'operatic': 3,
                      'gems': 2,
                      'publication': 3,
                      'multimillionaire': 1,
                      'troubles': 1,
                      'largely': 3,
                      'unusual': 1,
                      'Young': 1,
                      'vigilant': 1,
                      'Brother': 1,
                      'flow': 5,
                      'white': 8,
                      '1941': 1,
                      'consistency': 1,
                      'Gesangverein': 1,
                      'Brahms': 5,
                      'stares': 1,
                      'task': 3,
                      'Swedish': 1,
                      'stumbled': 1,
                      'pro-Communist': 1,
                      'warmed-over': 1,
                      'Dvorak': 1,
                      'sort': 6,
                      'temptation': 1,
                      'loudest': 1,
                      'couples': 1,
                      'LeClair': 1,
                      "Jenni's": 1,
                      'partially': 1,
                      'Preludes': 1,
                      'touring': 1,
                      'categorize': 1,
                      'artificial': 2,
                      'comedians': 1,
                      'enacting': 1,
                      'depending': 1,
                      'depression': 1,
                      'kind': 13,
                      'prolonged': 1,
                      'questionable': 1,
                      'distant': 2,
                      'bear': 3,
                      'wicked': 1,
                      'animals': 1,
                      'super-charged': 1,
                      'Picon': 1,
                      'exact': 1,
                      'experimenters': 1,
                      'recalled': 1,
                      'speedboat': 1,
                      'casually': 2,
                      'pleasantly': 1,
                      'reward': 1,
                      'visitors': 1,
                      'sneering': 1,
                      'rapists': 1,
                      'accurately': 1,
                      'forty-five': 1,
                      'marinas': 2,
                      'Malcolm': 1,
                      'mettlesome': 1,
                      'Dunn': 1,
                      'countryside': 1,
                      'chronicle': 3,
                      'ear': 3,
                      'tea': 2,
                      'dramatically': 1,
                      'best': 20,
                      'Rider': 1,
                      'James': 11,
                      'Hart': 9,
                      'lower': 1,
                      'ingredient': 1,
                      "Leonato's": 1,
                      'though': 18,
                      'buckwheat': 1,
                      'successful': 5,
                      'shift': 1,
                      'facetiously': 1,
                      'defeat': 1,
                      'motif': 1,
                      'nurse': 1,
                      'princess': 1,
                      "aren't": 1,
                      'spot': 3,
                      'crimson': 1,
                      'cells': 2,
                      'Department': 4,
                      'twilight': 1,
                      'extremes': 1,
                      'birches': 1,
                      '2454': 1,
                      'gobbledygook': 1,
                      'dogmatism': 1,
                      ...}),
             'romance': Counter({'Without': 2,
                      'vacuum': 2,
                      'festival': 1,
                      'contented': 2,
                      'laden': 1,
                      'More': 4,
                      'oystchers': 1,
                      'anyhow': 4,
                      'speak': 11,
                      'plain-out': 1,
                      'house': 36,
                      'promises': 1,
                      'skilled': 1,
                      'muscles': 1,
                      'shipwrecked': 1,
                      'overdeveloped': 1,
                      'darling': 4,
                      'building': 8,
                      'Adams': 1,
                      'waters': 1,
                      'dig': 1,
                      'threshold': 1,
                      'use': 14,
                      'than': 65,
                      'killed': 4,
                      'experience': 4,
                      'served': 1,
                      'jubilantly': 1,
                      'sports': 3,
                      'tempted': 2,
                      'thoroughly': 2,
                      'February': 1,
                      'Spike-haired': 1,
                      'pleading': 1,
                      'forum': 2,
                      'coursing': 1,
                      'certain': 13,
                      'season': 1,
                      'Jenny': 7,
                      'inward': 2,
                      'shade': 2,
                      'others': 9,
                      'sunshine': 2,
                      'Wait': 2,
                      'mercy': 1,
                      'bucking': 1,
                      'taut': 3,
                      'catch': 3,
                      'beautiful': 25,
                      'shorter': 1,
                      'marrying': 2,
                      'relay': 1,
                      'Karamazov': 1,
                      'Jeroboam': 1,
                      'denying': 1,
                      "John'll": 1,
                      'brushing': 2,
                      'sorted': 1,
                      'imposition': 1,
                      'nuzzled': 1,
                      'hardships': 1,
                      'inflection': 1,
                      'chic': 2,
                      'greatness': 2,
                      'drab': 1,
                      'necktie': 1,
                      "cousins'": 1,
                      'less-dramatic': 1,
                      'correct': 2,
                      'corporation': 2,
                      'cabins': 1,
                      'glance': 6,
                      'sage': 1,
                      'sommelier': 1,
                      'lumbering': 1,
                      'kitchen': 12,
                      'amount': 1,
                      "She'd": 3,
                      'rabbits': 1,
                      'kibbutzim': 1,
                      'fire-colored': 1,
                      'crack': 2,
                      'omitted': 1,
                      'glances': 2,
                      'Seeing': 2,
                      'way': 83,
                      'intriguingly': 1,
                      'unbelievable': 2,
                      "wouldn't": 33,
                      'clucked': 1,
                      'hurry': 3,
                      'interdependent': 1,
                      'through': 56,
                      'Just': 28,
                      "we'll": 4,
                      'each': 24,
                      'bucking-up': 1,
                      'Ah': 5,
                      'Rossi': 1,
                      'expanse': 1,
                      'tons': 1,
                      'timid': 2,
                      'vegetable': 1,
                      'forget': 5,
                      "Maggie's": 3,
                      'behave': 1,
                      'postscript': 1,
                      'log-house': 1,
                      'cottage': 6,
                      'shy': 1,
                      'raised': 6,
                      'magnificence': 1,
                      'stray': 1,
                      'catchee': 1,
                      'stream': 3,
                      'ate': 4,
                      'grave': 3,
                      'distaste': 2,
                      'charlotte': 1,
                      'sprung': 1,
                      'buggy': 1,
                      'eternal': 3,
                      'blackout': 1,
                      'descend': 1,
                      'fences': 1,
                      'At': 29,
                      'nervously': 3,
                      "I'm": 77,
                      'focused': 1,
                      'vessels': 1,
                      'les': 1,
                      'tipsy': 1,
                      'Those': 5,
                      'surprisingly': 2,
                      'bordering': 1,
                      'drop': 5,
                      'prophets': 1,
                      'career': 2,
                      'idly': 1,
                      'husband': 10,
                      'friends': 13,
                      'Toonker': 1,
                      'bet': 3,
                      'dully': 1,
                      'Deegan': 22,
                      'shouting': 5,
                      'wiped': 1,
                      'sands': 1,
                      'slip': 1,
                      'negative': 1,
                      "patient's": 1,
                      'Afterwards': 1,
                      'small': 31,
                      'Myra': 24,
                      'competition': 1,
                      'Heads': 1,
                      'intrepid': 1,
                      'doubtfully': 1,
                      'snag': 1,
                      'knife': 9,
                      'hostess': 1,
                      'missiles': 1,
                      'recently': 2,
                      'earthy': 1,
                      'vanity': 1,
                      'babes': 1,
                      'snapped': 5,
                      'next': 43,
                      'sly': 1,
                      'strike': 1,
                      'shall': 3,
                      'lapping': 2,
                      'searching': 3,
                      'really': 38,
                      'broad': 5,
                      'tireless': 2,
                      'disdain': 1,
                      'Please': 5,
                      'comforting': 1,
                      'rushed': 5,
                      'lath': 2,
                      'signals': 1,
                      'effect': 4,
                      'Pete': 11,
                      'Anything': 2,
                      'giggled': 1,
                      'King': 2,
                      'artists': 2,
                      'grasses': 1,
                      'masculine': 1,
                      'Remembering': 1,
                      'evening': 14,
                      'prisoners': 5,
                      'dripped': 1,
                      'twelve': 5,
                      'telegraph': 1,
                      'keen': 1,
                      'relax': 1,
                      'Lovejoy': 3,
                      'terribly': 4,
                      'Tolley': 6,
                      'regretted': 2,
                      'defiance': 1,
                      'assets': 2,
                      'shooing': 1,
                      'ask': 10,
                      'invested': 1,
                      'found': 29,
                      'floorshow': 1,
                      "one's": 4,
                      'pretty': 19,
                      'Springfield': 1,
                      'scholarships': 1,
                      'mother': 44,
                      'Galt': 1,
                      'things': 30,
                      'Now': 29,
                      'job': 27,
                      'thank': 3,
                      'parkish': 1,
                      'department': 1,
                      'independent': 1,
                      'lewd': 1,
                      'permission': 2,
                      'divided': 3,
                      'suppose': 7,
                      'Hall': 2,
                      'Patrol': 2,
                      'sir': 6,
                      'bat': 4,
                      'harshened': 1,
                      'hatred': 4,
                      'German': 1,
                      'neighbor': 1,
                      'scream': 2,
                      'signal': 4,
                      'vehicles': 1,
                      'exquisitely': 1,
                      'grabbed': 1,
                      'medium-sized': 1,
                      'gushed': 1,
                      'yielded': 1,
                      'commencement': 1,
                      'shadow': 3,
                      'teats': 2,
                      'fabric': 1,
                      'chuck-a-luck': 1,
                      "wasn't": 50,
                      'ordinarily': 1,
                      'bore': 3,
                      'dying': 5,
                      'Lord': 6,
                      'strip': 2,
                      'curiously': 2,
                      'hunting': 3,
                      'Coopers': 1,
                      'run': 16,
                      'discussed': 2,
                      'hundreds': 1,
                      'unconcerned': 1,
                      'grandchildren': 2,
                      'uncomfortable': 1,
                      'cacophony': 1,
                      'themselves': 10,
                      'guest': 6,
                      'Collector': 1,
                      'licking': 1,
                      'forepaws': 1,
                      'wicker': 1,
                      'positive': 4,
                      'Santa': 1,
                      "Gladdy's": 2,
                      'Stern-faced': 1,
                      'elegant': 2,
                      'defeatism': 1,
                      'aviator': 1,
                      'pulled': 8,
                      'sprinkling': 2,
                      'framed': 1,
                      'novelty': 1,
                      'visited': 3,
                      'loaded': 1,
                      'Also': 1,
                      'radio': 1,
                      'coat': 6,
                      'delivery': 1,
                      'trees': 7,
                      "mother's": 4,
                      'Oscar': 1,
                      'Japanese': 14,
                      'flatter': 1,
                      'due': 4,
                      'seeming': 1,
                      'plea': 1,
                      'warmed': 2,
                      'furniture': 2,
                      'sixty': 2,
                      'marry': 7,
                      'problem': 2,
                      'pulse': 2,
                      'winter': 5,
                      'distance': 7,
                      'mutineer': 1,
                      'walking': 12,
                      'percentage': 1,
                      'receive': 1,
                      'quok': 1,
                      'insurmountable': 1,
                      'midweek': 1,
                      'reflect': 1,
                      'looking': 36,
                      'azalea': 2,
                      'crook': 1,
                      'Watch': 1,
                      'Time': 1,
                      'humor': 5,
                      'periods': 1,
                      'baggy': 2,
                      'Pendleton': 3,
                      "We're": 4,
                      'sentimental': 2,
                      'Big': 3,
                      'bottle': 8,
                      'quench': 1,
                      'pointed': 7,
                      'whiskered': 1,
                      'determination': 1,
                      'slowly': 22,
                      'unlocked': 2,
                      'place': 45,
                      'piece': 2,
                      'rode': 4,
                      'laments': 2,
                      'amusement': 1,
                      'tasted': 1,
                      'underestimate': 1,
                      'Straightened': 1,
                      'undying': 1,
                      'atmosphere': 2,
                      'blessed': 1,
                      'melting': 1,
                      'urgent': 1,
                      'Ugh': 1,
                      'Eh': 2,
                      'instrument': 1,
                      'could': 193,
                      'Especially': 1,
                      'enjoying': 2,
                      'winking': 2,
                      'conform': 1,
                      'audience': 1,
                      'rich': 6,
                      'stopped': 25,
                      'Wilson': 3,
                      'dilapidated': 1,
                      'elaborate': 2,
                      'creating': 1,
                      'pride': 2,
                      'sequence': 1,
                      'okay': 1,
                      'tapping': 1,
                      'darkened': 2,
                      'increasingly': 1,
                      'brute': 1,
                      'snippy': 1,
                      'ran': 13,
                      'wagons': 1,
                      'optimism': 1,
                      'Cars': 1,
                      'meant': 11,
                      'tiny': 12,
                      'picture': 8,
                      'guarding': 1,
                      'A.M.': 1,
                      'sympathetic': 2,
                      'unemployment': 1,
                      'stairway': 1,
                      'buildings': 4,
                      'between': 22,
                      'cost': 3,
                      'glorying': 1,
                      'intimate': 3,
                      'noncommittally': 1,
                      'eighth': 1,
                      'gilded': 1,
                      'emerge': 1,
                      'daydreaming': 1,
                      'higher': 3,
                      'Ready': 1,
                      'Ronald': 3,
                      'forever-Cathy': 1,
                      "Pete's": 1,
                      'sewed': 1,
                      'nod': 3,
                      'firmly': 2,
                      'backed': 1,
                      'fruit': 1,
                      'yanking': 1,
                      'dismissed': 1,
                      'nowhere': 2,
                      'told': 47,
                      'paot': 3,
                      'discovery': 1,
                      'chants': 1,
                      'The': 230,
                      'heat': 4,
                      'downstairs': 4,
                      'dilatation': 1,
                      'messing': 1,
                      'Hodges': 2,
                      'good-living': 1,
                      'Boy': 1,
                      'of': 1186,
                      'bicarbonate': 1,
                      'law': 1,
                      'transcending': 1,
                      'stained': 1,
                      'squirt': 1,
                      'dainty': 1,
                      'virility': 1,
                      'rear': 1,
                      'Homes': 1,
                      'unnnt': 1,
                      'yearned': 1,
                      'corner': 9,
                      'bold': 1,
                      'cups': 2,
                      'industrialist': 1,
                      'lived': 19,
                      'wars': 1,
                      'Larkspur': 2,
                      'spoiled': 2,
                      'mingled': 1,
                      'watching': 20,
                      'maid': 4,
                      'numb': 1,
                      'facts': 2,
                      'Hudson': 1,
                      'unusually': 1,
                      'Rhine-Main': 1,
                      "Riverside's": 1,
                      'assembled': 1,
                      'announcing': 1,
                      'peculiar': 2,
                      'display': 2,
                      'sherbet-colored': 1,
                      'wallpaper': 2,
                      'funny': 7,
                      'factory': 1,
                      'entry': 1,
                      'overhead': 3,
                      'outside': 11,
                      'hung': 3,
                      'arc': 1,
                      'book': 10,
                      'suggested': 6,
                      'begging': 2,
                      'Renaissance': 1,
                      'jacket': 9,
                      'stairways': 1,
                      'Marvelous': 1,
                      'pressure': 2,
                      'bridesmaids': 1,
                      'sullen': 3,
                      'knowingly': 1,
                      'intend': 1,
                      'endlessly': 1,
                      'snuck': 1,
                      'purpose': 1,
                      'give': 28,
                      'loathing': 1,
                      'clamped': 2,
                      'absolute': 1,
                      'misfortune': 1,
                      'dishes': 1,
                      'tape': 2,
                      'beauty': 6,
                      'odor': 1,
                      'flatly': 1,
                      'glove': 3,
                      'tunnel': 1,
                      'blond': 2,
                      'Richard': 17,
                      'waving': 3,
                      'limpid': 1,
                      'sanction': 1,
                      'strenuous': 1,
                      'applying': 1,
                      'allow': 1,
                      'shuttered': 1,
                      'blushed': 1,
                      'library': 3,
                      'cancer': 2,
                      'missing': 6,
                      'actually': 5,
                      'Several': 1,
                      'rocked': 2,
                      'explanation': 1,
                      'replied': 2,
                      'juice': 4,
                      'Malta': 1,
                      'purtiest': 1,
                      'after': 58,
                      'questioning': 2,
                      'ward': 6,
                      'reached': 6,
                      "Brush-off's": 1,
                      'anyway': 10,
                      'response': 2,
                      'bargain': 1,
                      'authority': 1,
                      'Perhaps': 9,
                      'grace': 3,
                      'confused': 2,
                      'cherries': 1,
                      'Shall': 1,
                      'strikingly': 1,
                      'big': 36,
                      'overhand': 1,
                      'quantities': 1,
                      'inspect': 1,
                      'femme': 1,
                      'nine-to-five': 1,
                      'completely': 1,
                      'wooden': 3,
                      'lore': 1,
                      'freak': 2,
                      'opened': 13,
                      'Elliott': 1,
                      'soldierly': 1,
                      'Papanicolaou': 1,
                      'twenty-three': 1,
                      'Holy': 2,
                      'Heavenly': 1,
                      'satin': 1,
                      'stood': 37,
                      'worst': 3,
                      'Dolan': 2,
                      'deck': 4,
                      'lipstick': 2,
                      'drinking': 4,
                      'rages': 1,
                      'street': 14,
                      'glass': 7,
                      'bow': 1,
                      'cops': 2,
                      'priest': 1,
                      'Longue': 3,
                      'Leaning': 1,
                      'wears': 1,
                      'mist-like': 1,
                      'accuse': 1,
                      'town': 15,
                      'now': 83,
                      'solos': 1,
                      'azaleas': 2,
                      'radiant': 2,
                      'upsets': 1,
                      'suddenly': 22,
                      'colder': 1,
                      'gets': 4,
                      'Quint': 11,
                      'weatherproof': 1,
                      'lay': 11,
                      'loss': 3,
                      "Deegan's": 2,
                      'Jobs': 1,
                      'three': 17,
                      'shut': 9,
                      'blazing': 1,
                      'Because': 3,
                      'stock': 7,
                      'Spring': 4,
                      'downed': 1,
                      'San': 5,
                      'saddles': 1,
                      'virtue': 1,
                      'pat': 1,
                      'pure': 4,
                      'organize': 1,
                      'shed': 1,
                      'patiently': 2,
                      'lips': 11,
                      'blossom': 1,
                      'mama': 1,
                      'joy': 4,
                      'beast': 1,
                      "Wouldn't": 1,
                      'dark-gray': 1,
                      'car': 20,
                      'godless': 1,
                      'shine': 1,
                      'discovered': 5,
                      'crickets': 1,
                      'salad': 1,
                      'present': 9,
                      'sonofabitch': 2,
                      'science': 6,
                      'unnaturally': 1,
                      'be': 289,
                      'alternative': 1,
                      'racing': 3,
                      'names': 1,
                      'ready': 11,
                      'Ten': 1,
                      'witches': 1,
                      'abundance': 1,
                      'France': 1,
                      'engaged': 1,
                      'dark-blue': 1,
                      'helluva': 1,
                      'conclusion': 1,
                      'ceased': 1,
                      'janitor': 1,
                      'minutes': 9,
                      'unbent': 1,
                      'remote': 3,
                      'touching': 5,
                      'joined': 4,
                      "He's": 14,
                      'nose': 7,
                      'sharpened': 1,
                      'covering': 1,
                      'afternoon': 12,
                      'prisoner': 2,
                      'teasing': 1,
                      'drained': 2,
                      'thousand-legged': 1,
                      'frivolous': 2,
                      'commercial': 1,
                      'sun-warmed': 1,
                      'harbor': 2,
                      'shivered': 2,
                      'satin-covered': 1,
                      'brandishing': 2,
                      'fail': 2,
                      'boarding': 1,
                      'burial': 2,
                      'mergers': 1,
                      "lovers'": 1,
                      'dignity': 4,
                      'force': 4,
                      'replaced': 2,
                      'Puny': 1,
                      'sneaky': 1,
                      'dawns': 1,
                      'prevented': 1,
                      'popularly': 1,
                      'rackety': 1,
                      'eastern': 1,
                      'dinners': 2,
                      'wildly': 5,
                      'Eating': 1,
                      'Donald': 7,
                      "child's": 3,
                      'revealed': 2,
                      'alone': 24,
                      'pardon': 1,
                      'fetes': 1,
                      'cent': 3,
                      'Half': 2,
                      'beneath': 6,
                      'bullet': 1,
                      'cramp': 1,
                      "You'd": 2,
                      'into': 136,
                      'encouragingly': 1,
                      'snoring': 1,
                      'address': 3,
                      'else': 29,
                      'skipped': 3,
                      'justify': 1,
                      'let': 39,
                      'fonder': 1,
                      'becoming': 3,
                      'promising': 2,
                      'outraged': 1,
                      'artfully': 1,
                      'necessity': 2,
                      'brother': 5,
                      'Somebody': 4,
                      'managed': 4,
                      'gleaming': 1,
                      'Dillinger': 1,
                      'fifty-fifty': 1,
                      'method': 1,
                      'murder': 4,
                      'Swan': 1,
                      'scarf': 2,
                      'grown-up': 1,
                      'Kare': 1,
                      'sweeping': 1,
                      'forgiven': 1,
                      'conceptions': 1,
                      "Alexander's": 2,
                      'gumming': 1,
                      'ironed': 1,
                      'hunger': 2,
                      'husband-stealer': 1,
                      'insiders': 1,
                      "Partlow's": 1,
                      'directors': 1,
                      'instigating': 1,
                      'pupils': 3,
                      'straining': 2,
                      'switch': 1,
                      'desiring': 1,
                      'Subdued': 1,
                      'Gordon': 2,
                      'singsonged': 1,
                      'electric': 1,
                      'jammed': 2,
                      'muttered': 3,
                      'secretaries': 1,
                      'absurd': 3,
                      ')': 21,
                      'forgive': 1,
                      'corners': 1,
                      'around': 68,
                      'understands': 1,
                      'edges': 2,
                      "I'll": 53,
                      'mosaic': 1,
                      'parades': 1,
                      'maw': 1,
                      'monotonous': 1,
                      'overly': 1,
                      'tawny': 1,
                      'naked': 7,
                      'clothes': 26,
                      'merest': 1,
                      'lukewarm': 1,
                      'advancement': 1,
                      'dignified': 1,
                      'thankful': 3,
                      'released': 1,
                      'neatness': 1,
                      'rapping': 2,
                      'attracted': 1,
                      'letter': 19,
                      'hide': 1,
                      "Horne's": 1,
                      'frenziedly': 1,
                      'infected': 1,
                      'parked': 1,
                      'biggest': 1,
                      'from': 202,
                      'honestly': 2,
                      "four-o'clock": 1,
                      'Swallow': 1,
                      'ages': 2,
                      'warning': 2,
                      'appearance': 4,
                      'atheists': 1,
                      'Pollock': 1,
                      'ball': 18,
                      'paunch': 1,
                      'Mother': 5,
                      'half-murmured': 1,
                      'Fudo': 7,
                      'reality': 1,
                      "Concetta's": 2,
                      'stretch': 1,
                      'gambit': 1,
                      'highball': 2,
                      'studied': 7,
                      'twenty-five': 1,
                      'appreciate': 1,
                      'years': 34,
                      'sticks': 3,
                      'East': 4,
                      'probably': 13,
                      'brunettes': 1,
                      'frequently': 1,
                      'During': 2,
                      'Cold': 1,
                      'pilgrim': 1,
                      'tore': 3,
                      'Obviously': 1,
                      'cashmere': 1,
                      'what': 121,
                      'or': 150,
                      'alloy': 1,
                      "Don't": 24,
                      'states': 2,
                      "Jim's": 3,
                      'shaky': 2,
                      'put-upon': 1,
                      'city': 10,
                      'sex': 4,
                      'ginkgo': 1,
                      'stomach': 8,
                      'sensing': 1,
                      'skins': 1,
                      'backing': 1,
                      'easier': 3,
                      'consciousness': 1,
                      'thoughtfully': 1,
                      'accordance': 1,
                      'Cursed': 1,
                      'stickman': 1,
                      'general': 3,
                      'sprawled': 2,
                      'Sam': 18,
                      'invite': 1,
                      'heart': 17,
                      'bleak': 1,
                      'national': 5,
                      'tomorrow': 10,
                      'umbrella': 2,
                      'expanding': 2,
                      'grasp': 1,
                      'contracted': 1,
                      "lady's": 2,
                      'called': 31,
                      'namesake': 1,
                      'year': 14,
                      'victory': 2,
                      'Dunne': 4,
                      'Clouds': 4,
                      'Too': 2,
                      'courage': 4,
                      'subdued': 2,
                      'painted': 4,
                      'agricultural': 1,
                      'susceptible': 1,
                      'bring': 9,
                      'shopping': 1,
                      'uncomfortably': 2,
                      'showering': 1,
                      'hurled': 1,
                      'ripening': 1,
                      'wardroom': 1,
                      'protected': 2,
                      'wonder': 10,
                      'solution': 3,
                      'born': 1,
                      'sharply': 3,
                      'makeshift': 1,
                      'chance': 16,
                      'tenant': 1,
                      'spirits': 8,
                      'Methodist': 1,
                      'blackjack': 2,
                      'ailment': 1,
                      'fellow': 5,
                      'halfway': 1,
                      'swears': 1,
                      'graham': 2,
                      'Shades': 2,
                      'not': 250,
                      'Adrien': 2,
                      'sons': 2,
                      'taught': 2,
                      'fuss': 1,
                      'basket': 2,
                      'isolating': 1,
                      'suffer': 1,
                      'complained': 1,
                      'Cross': 1,
                      'varicolored': 1,
                      'college': 12,
                      'envisioned': 1,
                      'placed': 4,
                      'Handsomest': 1,
                      'boat': 2,
                      'crumbling': 1,
                      'noble': 1,
                      'illustrated': 1,
                      'Nor': 3,
                      'Unlike': 1,
                      'sliding': 1,
                      'glanced': 3,
                      'worried': 7,
                      'hated': 6,
                      'soda': 1,
                      'arrangement': 1,
                      'limping': 1,
                      'laced': 1,
                      'Anniston': 15,
                      'five': 6,
                      'Rosa': 4,
                      'bankruptcy': 1,
                      'common': 3,
                      'Arcilla': 1,
                      'mash': 1,
                      'less': 12,
                      'movement': 1,
                      'gleeful': 1,
                      'scraping': 1,
                      'thrusting': 2,
                      "mustn't": 2,
                      'participate': 1,
                      'jocular': 1,
                      'capital': 1,
                      'cords': 1,
                      'application': 2,
                      'sulked': 1,
                      'temperament': 1,
                      'infection': 1,
                      'smaller': 2,
                      'cheerful': 3,
                      'AA': 1,
                      'dragooned': 1,
                      'sting': 2,
                      'word': 17,
                      'jobless': 1,
                      'magnificent': 2,
                      'Mattie': 1,
                      'restriction': 1,
                      'thinks': 4,
                      'ground': 8,
                      "We'd": 3,
                      'primitive': 1,
                      'bouts': 1,
                      'shrubs': 1,
                      'sloppy': 1,
                      'Walter': 2,
                      'stir': 1,
                      'triangular': 1,
                      'plodding': 2,
                      'blood-soaked': 1,
                      'disquiet': 1,
                      'business': 21,
                      'strength': 8,
                      'leads': 1,
                      'wreck': 2,
                      'adults': 1,
                      'Cooper': 8,
                      'Freed': 1,
                      'conclusions': 1,
                      'swim': 1,
                      'unusual': 5,
                      'Young': 2,
                      'clasped': 2,
                      'talons': 1,
                      'white': 30,
                      'dreamed': 4,
                      'Named': 1,
                      'Road': 1,
                      'Pink': 1,
                      'familial': 1,
                      'hopped': 1,
                      'stumbled': 5,
                      'spit': 1,
                      'carved': 1,
                      'stones': 2,
                      'sort': 10,
                      'bookish': 1,
                      'temptation': 2,
                      'cats': 2,
                      'mastiff': 1,
                      'stoop': 1,
                      'partially': 1,
                      'Petty': 1,
                      'trudged': 1,
                      'caves': 1,
                      'artificial': 1,
                      'safely': 2,
                      'depending': 1,
                      'depression': 3,
                      'assume': 1,
                      'Eugenia': 15,
                      'bear': 4,
                      'wicked': 2,
                      'chapel-like': 1,
                      'co-operate': 1,
                      'potential': 2,
                      'schools': 2,
                      'darted': 1,
                      'recalled': 2,
                      'Byron': 1,
                      'casually': 1,
                      'pleasantly': 1,
                      'handspikes': 3,
                      'russe': 1,
                      'visitors': 1,
                      'Hotel': 1,
                      'risky': 1,
                      'bless': 3,
                      'swift': 1,
                      'Chinaman': 1,
                      'remoteness': 1,
                      'hangers': 1,
                      'cremate': 1,
                      'censure': 1,
                      'crates': 1,
                      'obligated': 1,
                      'traffic': 3,
                      'ear': 3,
                      'tea': 3,
                      'wire-haired': 1,
                      'shackles': 2,
                      'best': 12,
                      'sweetly': 2,
                      'loyal': 1,
                      'trestles': 1,
                      'unquenched': 1,
                      'lingerie': 1,
                      'white-columned': 1,
                      ...}),
             'science_fiction': Counter({'premise': 1,
                      'Without': 1,
                      'vacuum': 1,
                      'critically': 1,
                      'W.': 1,
                      'read': 6,
                      'limited': 2,
                      'logical': 2,
                      'triumphantly': 1,
                      'dig': 1,
                      "They'll": 1,
                      'beyond': 1,
                      'aboard': 2,
                      'anyhow': 1,
                      'speak': 4,
                      'unaware': 1,
                      'encouraged': 1,
                      'Many': 1,
                      'grow': 5,
                      'hegemony': 1,
                      'company': 3,
                      'feel': 4,
                      'muscles': 1,
                      'fast': 4,
                      'long-distance': 1,
                      'darling': 1,
                      'building': 1,
                      'personnel': 6,
                      'Between': 1,
                      'smell': 1,
                      'eaten': 1,
                      'walked': 3,
                      'use': 4,
                      'considered': 1,
                      'worse': 1,
                      'reconstruct': 1,
                      'grok': 5,
                      'wept': 2,
                      'experience': 4,
                      'whichever': 1,
                      'rest': 2,
                      'endure': 1,
                      'whisper': 1,
                      'served': 1,
                      'committee': 2,
                      'dead': 7,
                      'tear': 1,
                      'sheered': 1,
                      'Ozagenians': 1,
                      'Australia': 2,
                      'trip': 3,
                      'hygiene': 1,
                      'survived': 2,
                      'has': 9,
                      'controlled': 1,
                      "That'll": 1,
                      'knot': 1,
                      'are': 23,
                      'maintenance': 1,
                      "instant's": 1,
                      'our': 6,
                      'along': 2,
                      'promoted': 2,
                      'side-effects': 1,
                      'certain': 2,
                      'season': 1,
                      'heaved': 1,
                      'God': 5,
                      'demanded': 2,
                      'My': 2,
                      'others': 4,
                      'philology': 1,
                      'covered': 2,
                      'happiness': 5,
                      'Muslim': 1,
                      'indicating': 1,
                      "skiff's": 1,
                      'anyone': 2,
                      'sure': 5,
                      'entertain': 1,
                      'colossal': 1,
                      'transition': 1,
                      'accident': 3,
                      'calendars': 1,
                      'contrite': 1,
                      'catch': 2,
                      'equally': 1,
                      'contributes': 1,
                      'beautiful': 6,
                      'elaborately': 1,
                      'Ringing': 1,
                      'nymphs': 1,
                      'retorted': 1,
                      'articulate': 1,
                      'first': 15,
                      'is': 47,
                      'pity': 1,
                      'implant': 1,
                      'extensions': 1,
                      'instruments': 1,
                      'alive': 4,
                      'threes-fulfilled': 1,
                      'cradled': 1,
                      'approximations': 1,
                      'shelled': 1,
                      'Malay': 1,
                      'spaces': 1,
                      'Radiation': 1,
                      'page': 1,
                      'discovery': 1,
                      'determine': 1,
                      'under': 5,
                      'infinitive': 1,
                      'correct': 2,
                      'charming': 1,
                      'vibrancy': 1,
                      'none': 4,
                      'glance': 2,
                      'plan': 2,
                      'streaks': 1,
                      'amount': 2,
                      'sonic': 1,
                      'orders': 1,
                      'died': 3,
                      'ship': 20,
                      'ripples': 1,
                      'mineral': 1,
                      'pressure': 3,
                      'unlikely': 1,
                      'Short': 3,
                      'dominant': 1,
                      'ABC': 1,
                      'scientists': 1,
                      "you'll": 1,
                      'directed': 1,
                      'normal': 5,
                      'Jack': 18,
                      'hear': 2,
                      'amended': 1,
                      'start': 3,
                      'man': 17,
                      'parlor': 1,
                      'interrupted': 1,
                      'been': 40,
                      'augmented': 1,
                      'organized': 1,
                      'through': 15,
                      'proper': 4,
                      'shockwave': 1,
                      'corporate': 1,
                      'Neither': 1,
                      'ring': 1,
                      "we'll": 1,
                      'each': 12,
                      'nondefeatist': 1,
                      'Plus': 1,
                      'slid': 1,
                      'free': 3,
                      'dwindle': 1,
                      'hug': 1,
                      'shape': 1,
                      'terrestrial': 1,
                      'grabbing': 1,
                      'decaying': 1,
                      'plain': 3,
                      'changes': 1,
                      'forget': 2,
                      'apparently': 2,
                      'Icelandic': 1,
                      'interstellar': 1,
                      'raised': 1,
                      'love': 3,
                      'Todd': 1,
                      'rise': 1,
                      'clubbed': 1,
                      'repeated': 2,
                      'eventually': 4,
                      'earlier': 1,
                      "'": 11,
                      'unfathomable': 1,
                      'swath': 1,
                      'comes': 1,
                      'service': 1,
                      'day': 6,
                      'hard': 1,
                      'At': 4,
                      'half-transparent': 1,
                      "I'm": 12,
                      'hand-covered': 1,
                      'mastery': 1,
                      'die': 6,
                      'BCD': 1,
                      'Daily': 1,
                      'commingled': 1,
                      'rebuild': 1,
                      'Icelandic-speaking': 1,
                      'green': 1,
                      'precedent': 1,
                      'Romano': 1,
                      'ago': 6,
                      'ladies': 2,
                      'Think': 1,
                      'cartilage': 1,
                      'husband': 2,
                      'friends': 1,
                      'gritty-eyed': 1,
                      'bet': 1,
                      'home': 3,
                      'sands': 1,
                      'insisted': 1,
                      'want': 6,
                      'manage': 1,
                      'negative': 2,
                      'Afterwards': 1,
                      'killed': 4,
                      'small': 6,
                      'Miss': 1,
                      'Times': 1,
                      'solid': 2,
                      'Fuming': 1,
                      'red': 1,
                      'did': 35,
                      'grokked': 4,
                      'copied': 1,
                      'pool': 1,
                      'appearance': 1,
                      'forward': 1,
                      'barriers': 1,
                      'disarmament': 1,
                      'Hills': 1,
                      'lost': 8,
                      'enforced': 1,
                      'babes': 1,
                      'grounds': 1,
                      'Sicily': 1,
                      'content': 2,
                      'message': 1,
                      'next': 3,
                      'Nations': 1,
                      'Fosterite': 1,
                      'infection': 1,
                      'screens': 1,
                      'shall': 3,
                      'waist': 1,
                      'examined': 1,
                      'really': 2,
                      'peoples': 2,
                      'ball': 1,
                      'might': 12,
                      'Mercer': 26,
                      'Uh-huh': 1,
                      'please': 1,
                      'Please': 1,
                      'comforting': 1,
                      'mates': 1,
                      'barefoot': 1,
                      'female': 3,
                      'ritual': 1,
                      'effect': 1,
                      'selected': 2,
                      "don't": 11,
                      'comment': 1,
                      'thin': 3,
                      'attractive': 1,
                      'thrashed': 1,
                      'truth': 2,
                      'courses': 1,
                      'Out': 1,
                      'party': 4,
                      'card': 1,
                      'physiological': 1,
                      'masculine': 2,
                      'weakness': 1,
                      'Hawaiian-Americans': 1,
                      'together': 1,
                      'bunk': 6,
                      'me': 20,
                      'dwarfs': 1,
                      'fear': 1,
                      'keen': 1,
                      'average': 1,
                      'South': 2,
                      'races': 3,
                      'remained': 2,
                      'needed': 5,
                      'People': 3,
                      'late': 1,
                      'Others': 1,
                      'dance': 1,
                      'choking': 1,
                      'Maybe': 4,
                      'ask': 6,
                      'Greenland': 1,
                      'veldt': 1,
                      'found': 9,
                      'sad': 3,
                      'touch': 3,
                      'war': 2,
                      'included': 1,
                      'doubted': 1,
                      'nevertheless': 1,
                      'Nevertheless': 1,
                      'exploring': 1,
                      ':': 10,
                      'titanium': 1,
                      'domes': 1,
                      'What': 14,
                      'pretty': 2,
                      'sense': 5,
                      'supported': 1,
                      'Caribbean': 1,
                      'things': 2,
                      'Now': 8,
                      'Earthmen': 5,
                      'shipboard': 1,
                      'its': 13,
                      'can': 16,
                      'gentleman': 1,
                      'master': 1,
                      'instructing': 1,
                      'independent': 1,
                      'phrase': 2,
                      'dismal': 1,
                      ';': 88,
                      'poured': 1,
                      'rabbit': 1,
                      'forced': 5,
                      'neuter': 1,
                      'Mary': 5,
                      'Swahili': 1,
                      'Suppose': 3,
                      'lights': 1,
                      'expression': 1,
                      'mind': 15,
                      'sir': 2,
                      'field': 5,
                      'warts': 1,
                      'Breath': 1,
                      'snack': 1,
                      "What's": 3,
                      '``': 235,
                      'Urielites': 1,
                      'leave': 2,
                      'Hebrew': 1,
                      'evidence': 1,
                      'importance': 1,
                      'power': 7,
                      'tears': 2,
                      'eliminating': 1,
                      'entire': 1,
                      'permanent': 1,
                      'shadow': 1,
                      'any': 17,
                      'alto': 1,
                      'voyage': 1,
                      'In': 12,
                      'belt': 2,
                      'observe': 1,
                      'cycles': 1,
                      'automatically': 2,
                      'required': 3,
                      'associations': 1,
                      'absolutes': 1,
                      'concealed': 1,
                      'riot': 1,
                      'Lord': 1,
                      'I.Q.': 1,
                      'unique': 2,
                      'beasts': 1,
                      'burning': 1,
                      'practicality': 1,
                      'run': 4,
                      'poisoned': 1,
                      'experimental': 1,
                      'victims': 1,
                      'connections': 1,
                      'exactly': 1,
                      'sparse': 1,
                      'done': 3,
                      'web': 3,
                      'tone': 2,
                      'week': 1,
                      'uniform': 1,
                      'uncomfortable': 2,
                      'thousand': 3,
                      'govern': 1,
                      'verb': 1,
                      'themselves': 3,
                      'Ariadne': 2,
                      'send': 2,
                      'occur': 1,
                      'dome': 2,
                      'music': 2,
                      'visit': 1,
                      'stepping': 1,
                      'Positive': 1,
                      'moral': 1,
                      'simultaneously': 2,
                      "world's": 1,
                      'fine': 2,
                      'himself': 17,
                      'agitation': 1,
                      'dropped': 1,
                      're-examine': 1,
                      'mental': 2,
                      'atomic': 2,
                      'Angels': 8,
                      'my': 30,
                      'agreed': 5,
                      'unnecessary': 1,
                      'Besides': 2,
                      'species': 1,
                      'invade': 1,
                      'considerably': 1,
                      'citizens': 2,
                      'activating': 1,
                      'spring': 1,
                      'centuries': 1,
                      'longer': 8,
                      'sing': 4,
                      'strange': 4,
                      'Though': 3,
                      'Manchester': 1,
                      'methods': 1,
                      'scowled': 1,
                      'loaded': 1,
                      'grokking': 4,
                      'seismological': 1,
                      'wanting': 1,
                      'doorway': 1,
                      'growing': 6,
                      'matches': 1,
                      'water': 7,
                      'Some': 4,
                      'tiptoeing': 1,
                      'miracle': 1,
                      'agreements': 2,
                      'Antares': 1,
                      'unfurled': 1,
                      'considering': 1,
                      'guys': 2,
                      'everybody': 2,
                      'cut': 3,
                      'men': 5,
                      'match': 2,
                      'lotus': 1,
                      'photographs': 1,
                      'marry': 1,
                      'serve': 2,
                      'while': 8,
                      'blaze': 1,
                      'cook': 1,
                      'distance': 2,
                      'feeling': 2,
                      'walking': 1,
                      'sit': 2,
                      'completed': 1,
                      'approach': 1,
                      'final': 3,
                      'face': 8,
                      'That': 5,
                      'looking': 1,
                      "Digby's": 1,
                      'Shadow': 1,
                      'hostile': 1,
                      'CB': 1,
                      'Time': 4,
                      'humor': 2,
                      'periods': 1,
                      'ponderous': 1,
                      'hindered': 1,
                      'Bazaar': 1,
                      'March': 1,
                      'resembled': 1,
                      'lie': 2,
                      'attract': 1,
                      'Anyhow': 1,
                      'reaction': 1,
                      'towards': 4,
                      'Suspicion': 1,
                      'argued': 1,
                      'predict': 1,
                      'Science': 1,
                      'pointed': 2,
                      'Lady': 7,
                      'realized': 1,
                      'If': 13,
                      'same': 6,
                      'dragged': 1,
                      'gay': 1,
                      'midnight': 3,
                      'states': 2,
                      'illness': 1,
                      'place': 8,
                      'swiftly': 1,
                      'elders': 3,
                      'patsy': 1,
                      'committees': 1,
                      'sigh': 2,
                      'amusement': 1,
                      'tasted': 1,
                      'Mike': 20,
                      'Even': 2,
                      'preferences': 2,
                      "B'dikkat's": 1,
                      'jump': 4,
                      'atmosphere': 4,
                      'the': 652,
                      'Thank': 1,
                      'photon-counting': 1,
                      'sometimes': 1,
                      'greater': 1,
                      'therefore': 2,
                      'remain': 1,
                      'Over': 1,
                      'taps': 1,
                      'help': 8,
                      'Philippine': 1,
                      'could': 49,
                      'chipper': 1,
                      'dissimilar': 1,
                      'failed': 1,
                      'sixteenth': 1,
                      'audience': 1,
                      'trusting': 1,
                      'triggered': 1,
                      'opening': 1,
                      'stopped': 3,
                      'decorticated': 1,
                      'aloud': 1,
                      'elaborate': 1,
                      'pride': 1,
                      'planetoids': 1,
                      'rich': 1,
                      'if': 16,
                      'sorry': 1,
                      'darkened': 1,
                      'sealed': 1,
                      'spoke': 9,
                      'Soviet': 1,
                      'something': 8,
                      'astronomical': 1,
                      'here': 8,
                      'ran': 2,
                      'matsyendra': 1,
                      'optimism': 1,
                      'defend': 1,
                      'meant': 6,
                      'actions': 1,
                      'shield': 1,
                      'old': 3,
                      'screams': 1,
                      'tiny': 3,
                      'paced': 1,
                      'knotty': 1,
                      'guru': 1,
                      'regret': 1,
                      'hospital': 1,
                      'politely': 2,
                      'picture': 2,
                      'mature': 1,
                      'morrow': 1,
                      'making': 3,
                      'full': 2,
                      'performance': 2,
                      'count': 2,
                      'Stinky': 1,
                      'correlating': 1,
                      'glittering': 1,
                      'between': 10,
                      'base': 1,
                      'commissioned': 1,
                      'corrosive': 1,
                      'recognized': 2,
                      'reactors': 1,
                      'air': 1,
                      'report': 1,
                      'solar': 1,
                      'scraped': 1,
                      'answers': 1,
                      'metal': 2,
                      'sound': 5,
                      'concerti': 1,
                      'film': 1,
                      'lovelies': 1,
                      'World': 1,
                      'sleepily': 2,
                      'assigned': 1,
                      'insoluble': 1,
                      'diplomatic': 2,
                      'eyes': 9,
                      'lightyears': 1,
                      'Holies': 1,
                      'hearth': 1,
                      'deformities': 1,
                      'Ones': 2,
                      'deposits': 1,
                      'brilliantly': 1,
                      'Coal': 2,
                      'Tabernacle': 1,
                      'felt': 13,
                      'discorporate': 2,
                      'Not': 1,
                      'modern': 1,
                      "Hal's": 1,
                      'spat': 1,
                      "Angel's": 2,
                      'chills': 1,
                      'baby': 1,
                      'hideous': 1,
                      'dismissed': 1,
                      'girls': 2,
                      'told': 3,
                      'they': 53,
                      'deceleration': 2,
                      'chicken': 1,
                      'find': 4,
                      'center': 5,
                      'The': 71,
                      'casual': 1,
                      'heat': 1,
                      'downstairs': 1,
                      'gases': 2,
                      'boot': 1,
                      'of': 321,
                      'parts': 2,
                      'crooned': 2,
                      'celebrated': 1,
                      'gripes': 1,
                      'published': 2,
                      'law': 2,
                      'spiritual': 1,
                      'Where': 5,
                      'scouts': 1,
                      'blastdown': 1,
                      'fact': 5,
                      'where': 10,
                      'Larry': 1,
                      'advanced': 1,
                      'whole': 1,
                      'fleeting': 1,
                      'lavishly': 1,
                      'limits': 1,
                      'private': 1,
                      'memorize': 1,
                      'presumed': 1,
                      'He': 52,
                      'lived': 3,
                      'scanners': 1,
                      'threshed': 1,
                      'protest': 1,
                      'Candide': 1,
                      'rewarding': 1,
                      'annals': 1,
                      'trouble': 3,
                      'fell': 2,
                      'totality': 1,
                      'reference': 1,
                      'compute': 1,
                      'pains': 2,
                      'low': 4,
                      'facts': 2,
                      'cremated': 1,
                      'Hudson': 1,
                      'moss': 1,
                      'unusually': 1,
                      'enjoyed': 3,
                      'virtually': 1,
                      'disastrous': 1,
                      'weep': 1,
                      'peculiar': 1,
                      'faster': 2,
                      'too': 14,
                      'Siberia': 2,
                      'linguists': 1,
                      'bulwark': 1,
                      'peculiarities': 1,
                      'aspect': 1,
                      'confirmed': 1,
                      '25': 1,
                      'outside': 8,
                      'later': 6,
                      'partnered': 1,
                      'resumed': 1,
                      'section': 1,
                      'language': 3,
                      'practicing': 1,
                      'book': 1,
                      'sprawled': 1,
                      'every': 4,
                      'Shayol': 4,
                      'following': 2,
                      'eggs': 1,
                      'deliberately': 1,
                      'suggested': 3,
                      'Naturally': 2,
                      'nature': 2,
                      'event': 1,
                      'married': 3,
                      'codified': 1,
                      'instantly': 1,
                      'do-good': 1,
                      'shrugged': 1,
                      'doing': 4,
                      '5': 1,
                      'ticket': 1,
                      '138': 1,
                      'purpose': 1,
                      'star': 4,
                      'whereby': 1,
                      'smiling': 1,
                      'Thus': 2,
                      'visual': 2,
                      'number': 3,
                      'enormous': 2,
                      'eat': 3,
                      'practice': 1,
                      'tape': 1,
                      'beauty': 1,
                      "it'll": 1,
                      'minute': 1,
                      'harvest': 1,
                      "he'll": 1,
                      'floor': 1,
                      'properly': 1,
                      'slave': 1,
                      'Tibetan': 1,
                      'angelic': 2,
                      'far': 2,
                      'mapping': 1,
                      'mutual': 1,
                      'easily': 1,
                      'planet': 10,
                      'flash': 1,
                      'shortly': 1,
                      'blushed': 1,
                      'library': 2,
                      'blanket': 1,
                      'Dawn': 1,
                      'walk': 1,
                      'disgrace': 1,
                      'missing': 1,
                      'actually': 1,
                      'shouldered': 1,
                      'dressed': 1,
                      'limitations': 1,
                      'nor': 1,
                      'after': 11,
                      'rock-steady': 1,
                      'thought': 11,
                      'ward': 2,
                      'lungs': 1,
                      'anyway': 1,
                      'programed': 1,
                      'on': 85,
                      'conviction': 1,
                      'bargain': 1,
                      'directions': 1,
                      'Perhaps': 3,
                      'Hell': 1,
                      'showing': 1,
                      'big': 6,
                      'ambulatory': 1,
                      'surviving': 1,
                      'recommend': 1,
                      'put': 2,
                      'bodies': 10,
                      'Sandalphon': 3,
                      'Mars': 9,
                      'telescope': 1,
                      'Arizona': 1,
                      'pleasure': 4,
                      'civilization': 1,
                      'cold': 2,
                      'punk': 1,
                      'northern': 3,
                      'opened': 2,
                      'fun': 1,
                      'readings': 1,
                      'period': 4,
                      'excited': 1,
                      'radioactive': 1,
                      'misuse': 1,
                      'lamechian': 1,
                      "R.'s": 1,
                      'powers': 2,
                      'seemed': 4,
                      'decided': 2,
                      'fate': 2,
                      'Holy': 1,
                      'Heavenly': 1,
                      'stood': 5,
                      'Exploration': 1,
                      'electronic': 1,
                      'everlasting': 1,
                      'pronounced': 1,
                      'stolen': 1,
                      'these': 12,
                      'pick': 1,
                      'clear': 3,
                      'patterns': 2,
                      'girl': 2,
                      'pranha': 1,
                      'pidgin': 1,
                      'happened': 4,
                      'insomnia': 2,
                      'worth': 3,
                      'destination': 2,
                      'combine': 1,
                      'adjectives': 1,
                      'expectations': 1,
                      'squat': 1,
                      'Service': 2,
                      'cares': 1,
                      'accuse': 1,
                      'Remember': 2,
                      'hands': 7,
                      'laxness': 1,
                      'prognosis': 1,
                      'manipulation': 1,
                      'now': 19,
                      'suspensor': 2,
                      'curve': 1,
                      'A': 14,
                      'funeral': 1,
                      'planning': 1,
                      'massive': 1,
                      'radiated': 1,
                      'nicest': 1,
                      'lady': 4,
                      'chase': 1,
                      'already': 6,
                      'surrounding': 1,
                      'colder': 1,
                      'last': 9,
                      'turtle': 1,
                      'Another': 2,
                      'asteroid': 1,
                      'exercise': 1,
                      'Will': 1,
                      'three': 6,
                      'Second': 1,
                      'automatic': 2,
                      'purposes': 2,
                      'itself': 4,
                      'nations': 1,
                      'Monitor': 1,
                      'thicken': 1,
                      'Fine': 2,
                      'Schools': 1,
                      'play': 3,
                      'beginnings': 1,
                      'grinned': 1,
                      'clearer': 1,
                      'industrial': 1,
                      'grammatical': 1,
                      'take': 11,
                      'Tristan': 1,
                      'goodness': 3,
                      'notion': 1,
                      'stretched': 1,
                      'beast': 1,
                      'horrified': 2,
                      'carry': 3,
                      'apartment': 1,
                      'cloud': 1,
                      "dabhumaksanigalu'ahai": 1,
                      'personal': 1,
                      'difficulty': 1,
                      'manipulated': 1,
                      'bomb': 1,
                      'people': 20,
                      'may': 4,
                      'resources': 1,
                      'keening': 1,
                      "Let's": 1,
                      'present': 2,
                      'radar': 2,
                      'revive': 1,
                      'necessary': 1,
                      'Dark': 1,
                      'as': 64,
                      'hit': 2,
                      'helium': 1,
                      'be': 80,
                      'ecological': 1,
                      'must': 8,
                      'names': 1,
                      'helpful': 1,
                      'ready': 4,
                      'faintest': 1,
                      'corpses': 1,
                      'metallic': 1,
                      'vegetable': 1,
                      'an': 33,
                      'name': 2,
                      'winds': 2,
                      'marriage': 2,
                      'Farrell': 1,
                      'Well': 4,
                      'advantage': 1,
                      "isn't": 6,
                      'salted': 1,
                      'trying': 2,
                      'being': 11,
                      'Same': 1,
                      'miss': 1,
                      'Listen': 2,
                      'minutes': 3,
                      'mate': 1,
                      'marks': 1,
                      'oxygen': 3,
                      'nose': 1,
                      'we': 30,
                      'friendly': 1,
                      'load': 1,
                      'wrangled': 1,
                      'Venusians': 1,
                      "Earthmen's": 1,
                      'necklaces': 1,
                      'story': 1,
                      'hospitable': 1,
                      'creepy': 1,
                      'letting': 1,
                      'proficiency': 1,
                      'central': 1,
                      "Jubal's": 3,
                      'Part': 1,
                      'pieces': 1,
                      'Tibet': 1,
                      'perceive': 1,
                      'supposed': 1,
                      'Visiting': 1,
                      'force': 1,
                      'grass': 2,
                      'prevented': 1,
                      'understanding': 1,
                      'Bay': 1,
                      'bother': 2,
                      'nudged': 2,
                      'nirvana': 1,
                      'violations': 1,
                      'how': 12,
                      'possibility': 2,
                      'eastern': 1,
                      'should': 3,
                      'ten': 1,
                      'Had': 2,
                      'wildly': 1,
                      'some': 16,
                      "child's": 1,
                      'alone': 3,
                      'dusty': 2,
                      'pardon': 1,
                      'half': 10,
                      'Above': 1,
                      'food': 1,
                      'Dissect': 1,
                      'whether': 3,
                      'serviettes': 1,
                      'recordings': 2,
                      'increased': 1,
                      'living': 3,
                      'breathe': 2,
                      'craters': 1,
                      'declaration': 1,
                      'into': 22,
                      'precepts': 1,
                      'exceedingly': 1,
                      'Get': 1,
                      'question': 4,
                      'Insects': 1,
                      'address': 1,
                      'else': 2,
                      'random': 1,
                      'local': 1,
                      'classical': 1,
                      'let': 4,
                      'above': 4,
                      'qualified': 1,
                      'gave': 3,
                      'maintained': 1,
                      'We': 15,
                      'necessity': 2,
                      'grown': 3,
                      'millennia': 1,
                      'follow': 2,
                      'smelling': 1,
                      'National': 1,
                      "Gabriel's": 1,
                      'method': 1,
                      'looks': 1,
                      'shell-psychology': 1,
                      'existed': 1,
                      'trance': 2,
                      'because': 10,
                      'given': 5,
                      'give': 7,
                      "Grandmothers'": 1,
                      'situations': 1,
                      ...})})

In [19]:
genres = ['news', 'religion', 'hobbies', 'science_fiction', 'romance', 'humor']
modals = ['can', 'could', 'may', 'might', 'must', 'will']
cfd.tabulate(conditions=genres, samples=modals)


                 can could  may might must will 
           news   93   86   66   38   50  389 
       religion   82   59   78   12   54   71 
        hobbies  268   58  131   22   83  264 
science_fiction   16   49    4   12    8   16 
        romance   74  193   11   51   45   43 
          humor   16   30    8    8    9   13 

most frequent modal in 'news' is will, most frequent modal in 'romance' is could


Reuters Corpus

documents are grouped into training and test sets

something wrong with this corpus + Python 3.5.1


In [20]:
from nltk.corpus import reuters
reuters.fileids()


Out[20]:
['test/14826',
 'test/14828',
 'test/14829',
 'test/14832',
 'test/14833',
 'test/14839',
 'test/14840',
 'test/14841',
 'test/14842',
 'test/14843',
 'test/14844',
 'test/14849',
 'test/14852',
 'test/14854',
 'test/14858',
 'test/14859',
 'test/14860',
 'test/14861',
 'test/14862',
 'test/14863',
 'test/14865',
 'test/14867',
 'test/14872',
 'test/14873',
 'test/14875',
 'test/14876',
 'test/14877',
 'test/14881',
 'test/14882',
 'test/14885',
 'test/14886',
 'test/14888',
 'test/14890',
 'test/14891',
 'test/14892',
 'test/14899',
 'test/14900',
 'test/14903',
 'test/14904',
 'test/14907',
 'test/14909',
 'test/14911',
 'test/14912',
 'test/14913',
 'test/14918',
 'test/14919',
 'test/14921',
 'test/14922',
 'test/14923',
 'test/14926',
 'test/14928',
 'test/14930',
 'test/14931',
 'test/14932',
 'test/14933',
 'test/14934',
 'test/14941',
 'test/14943',
 'test/14949',
 'test/14951',
 'test/14954',
 'test/14957',
 'test/14958',
 'test/14959',
 'test/14960',
 'test/14962',
 'test/14963',
 'test/14964',
 'test/14965',
 'test/14967',
 'test/14968',
 'test/14969',
 'test/14970',
 'test/14971',
 'test/14974',
 'test/14975',
 'test/14978',
 'test/14981',
 'test/14982',
 'test/14983',
 'test/14984',
 'test/14985',
 'test/14986',
 'test/14987',
 'test/14988',
 'test/14993',
 'test/14995',
 'test/14998',
 'test/15000',
 'test/15001',
 'test/15002',
 'test/15004',
 'test/15005',
 'test/15006',
 'test/15011',
 'test/15012',
 'test/15013',
 'test/15016',
 'test/15017',
 'test/15020',
 'test/15023',
 'test/15024',
 'test/15026',
 'test/15027',
 'test/15028',
 'test/15029',
 'test/15031',
 'test/15032',
 'test/15033',
 'test/15037',
 'test/15038',
 'test/15043',
 'test/15045',
 'test/15046',
 'test/15048',
 'test/15049',
 'test/15052',
 'test/15053',
 'test/15055',
 'test/15056',
 'test/15060',
 'test/15061',
 'test/15062',
 'test/15063',
 'test/15065',
 'test/15067',
 'test/15069',
 'test/15070',
 'test/15074',
 'test/15077',
 'test/15078',
 'test/15079',
 'test/15082',
 'test/15090',
 'test/15091',
 'test/15092',
 'test/15093',
 'test/15094',
 'test/15095',
 'test/15096',
 'test/15097',
 'test/15103',
 'test/15104',
 'test/15106',
 'test/15107',
 'test/15109',
 'test/15110',
 'test/15111',
 'test/15112',
 'test/15118',
 'test/15119',
 'test/15120',
 'test/15121',
 'test/15122',
 'test/15124',
 'test/15126',
 'test/15128',
 'test/15129',
 'test/15130',
 'test/15132',
 'test/15136',
 'test/15138',
 'test/15141',
 'test/15144',
 'test/15145',
 'test/15146',
 'test/15149',
 'test/15152',
 'test/15153',
 'test/15154',
 'test/15156',
 'test/15157',
 'test/15161',
 'test/15162',
 'test/15171',
 'test/15172',
 'test/15175',
 'test/15179',
 'test/15180',
 'test/15185',
 'test/15188',
 'test/15189',
 'test/15190',
 'test/15193',
 'test/15194',
 'test/15197',
 'test/15198',
 'test/15200',
 'test/15204',
 'test/15205',
 'test/15206',
 'test/15207',
 'test/15208',
 'test/15210',
 'test/15211',
 'test/15212',
 'test/15213',
 'test/15217',
 'test/15219',
 'test/15220',
 'test/15221',
 'test/15222',
 'test/15223',
 'test/15226',
 'test/15227',
 'test/15230',
 'test/15233',
 'test/15234',
 'test/15237',
 'test/15238',
 'test/15239',
 'test/15240',
 'test/15242',
 'test/15243',
 'test/15244',
 'test/15246',
 'test/15247',
 'test/15250',
 'test/15253',
 'test/15254',
 'test/15255',
 'test/15258',
 'test/15259',
 'test/15262',
 'test/15263',
 'test/15264',
 'test/15265',
 'test/15270',
 'test/15271',
 'test/15273',
 'test/15274',
 'test/15276',
 'test/15278',
 'test/15280',
 'test/15281',
 'test/15283',
 'test/15287',
 'test/15290',
 'test/15292',
 'test/15294',
 'test/15295',
 'test/15296',
 'test/15299',
 'test/15300',
 'test/15302',
 'test/15303',
 'test/15306',
 'test/15307',
 'test/15308',
 'test/15309',
 'test/15310',
 'test/15311',
 'test/15312',
 'test/15313',
 'test/15314',
 'test/15315',
 'test/15321',
 'test/15322',
 'test/15324',
 'test/15325',
 'test/15326',
 'test/15327',
 'test/15329',
 'test/15335',
 'test/15336',
 'test/15337',
 'test/15339',
 'test/15341',
 'test/15344',
 'test/15345',
 'test/15348',
 'test/15349',
 'test/15351',
 'test/15352',
 'test/15354',
 'test/15356',
 'test/15357',
 'test/15359',
 'test/15363',
 'test/15364',
 'test/15365',
 'test/15366',
 'test/15367',
 'test/15368',
 'test/15372',
 'test/15375',
 'test/15378',
 'test/15379',
 'test/15380',
 'test/15383',
 'test/15384',
 'test/15386',
 'test/15387',
 'test/15388',
 'test/15389',
 'test/15391',
 'test/15394',
 'test/15396',
 'test/15397',
 'test/15400',
 'test/15404',
 'test/15406',
 'test/15409',
 'test/15410',
 'test/15411',
 'test/15413',
 'test/15415',
 'test/15416',
 'test/15417',
 'test/15420',
 'test/15421',
 'test/15424',
 'test/15425',
 'test/15427',
 'test/15428',
 'test/15429',
 'test/15430',
 'test/15431',
 'test/15432',
 'test/15436',
 'test/15438',
 'test/15441',
 'test/15442',
 'test/15444',
 'test/15446',
 'test/15447',
 'test/15448',
 'test/15449',
 'test/15450',
 'test/15451',
 'test/15452',
 'test/15453',
 'test/15454',
 'test/15455',
 'test/15457',
 'test/15459',
 'test/15460',
 'test/15462',
 'test/15464',
 'test/15467',
 'test/15468',
 'test/15471',
 'test/15472',
 'test/15476',
 'test/15477',
 'test/15478',
 'test/15479',
 'test/15481',
 'test/15482',
 'test/15483',
 'test/15484',
 'test/15485',
 'test/15487',
 'test/15489',
 'test/15494',
 'test/15495',
 'test/15496',
 'test/15500',
 'test/15501',
 'test/15503',
 'test/15504',
 'test/15510',
 'test/15511',
 'test/15515',
 'test/15520',
 'test/15521',
 'test/15522',
 'test/15523',
 'test/15527',
 'test/15528',
 'test/15531',
 'test/15532',
 'test/15535',
 'test/15536',
 'test/15539',
 'test/15540',
 'test/15542',
 'test/15543',
 'test/15544',
 'test/15545',
 'test/15547',
 'test/15548',
 'test/15549',
 'test/15550',
 'test/15551',
 'test/15552',
 'test/15553',
 'test/15556',
 'test/15558',
 'test/15559',
 'test/15560',
 'test/15561',
 'test/15562',
 'test/15563',
 'test/15565',
 'test/15566',
 'test/15567',
 'test/15568',
 'test/15569',
 'test/15570',
 'test/15571',
 'test/15572',
 'test/15573',
 'test/15574',
 'test/15575',
 'test/15578',
 'test/15579',
 'test/15580',
 'test/15581',
 'test/15582',
 'test/15583',
 'test/15584',
 'test/15585',
 'test/15590',
 'test/15591',
 'test/15593',
 'test/15594',
 'test/15595',
 'test/15596',
 'test/15597',
 'test/15598',
 'test/15600',
 'test/15601',
 'test/15602',
 'test/15603',
 'test/15605',
 'test/15607',
 'test/15610',
 'test/15613',
 'test/15615',
 'test/15616',
 'test/15617',
 'test/15618',
 'test/15620',
 'test/15621',
 'test/15623',
 'test/15624',
 'test/15625',
 'test/15626',
 'test/15629',
 'test/15632',
 'test/15634',
 'test/15636',
 'test/15637',
 'test/15639',
 'test/15640',
 'test/15641',
 'test/15642',
 'test/15643',
 'test/15646',
 'test/15648',
 'test/15649',
 'test/15651',
 'test/15653',
 'test/15655',
 'test/15656',
 'test/15664',
 'test/15666',
 'test/15667',
 'test/15668',
 'test/15669',
 'test/15672',
 'test/15674',
 'test/15675',
 'test/15676',
 'test/15677',
 'test/15679',
 'test/15680',
 'test/15682',
 'test/15686',
 'test/15688',
 'test/15689',
 'test/15691',
 'test/15692',
 'test/15694',
 'test/15695',
 'test/15696',
 'test/15698',
 'test/15702',
 'test/15703',
 'test/15704',
 'test/15707',
 'test/15708',
 'test/15709',
 'test/15710',
 'test/15713',
 'test/15715',
 'test/15717',
 'test/15719',
 'test/15720',
 'test/15721',
 'test/15723',
 'test/15725',
 'test/15726',
 'test/15727',
 'test/15728',
 'test/15729',
 'test/15732',
 'test/15733',
 'test/15736',
 'test/15737',
 'test/15739',
 'test/15742',
 'test/15749',
 'test/15751',
 'test/15753',
 'test/15757',
 'test/15759',
 'test/15762',
 'test/15767',
 'test/15768',
 'test/15769',
 'test/15772',
 'test/15777',
 'test/15778',
 'test/15780',
 'test/15782',
 'test/15785',
 'test/15790',
 'test/15793',
 'test/15797',
 'test/15798',
 'test/15800',
 'test/15801',
 'test/15803',
 'test/15804',
 'test/15805',
 'test/15807',
 'test/15808',
 'test/15810',
 'test/15811',
 'test/15816',
 'test/15817',
 'test/15819',
 'test/15821',
 'test/15822',
 'test/15823',
 'test/15829',
 'test/15831',
 'test/15832',
 'test/15833',
 'test/15834',
 'test/15836',
 'test/15838',
 'test/15840',
 'test/15841',
 'test/15842',
 'test/15844',
 'test/15845',
 'test/15846',
 'test/15847',
 'test/15851',
 'test/15852',
 'test/15853',
 'test/15854',
 'test/15855',
 'test/15856',
 'test/15858',
 'test/15859',
 'test/15860',
 'test/15861',
 'test/15863',
 'test/15864',
 'test/15865',
 'test/15866',
 'test/15867',
 'test/15868',
 'test/15869',
 'test/15870',
 'test/15871',
 'test/15872',
 'test/15874',
 'test/15875',
 'test/15876',
 'test/15877',
 'test/15878',
 'test/15879',
 'test/15881',
 'test/15885',
 'test/15886',
 'test/15888',
 'test/15889',
 'test/15890',
 'test/15892',
 'test/15893',
 'test/15894',
 'test/15895',
 'test/15896',
 'test/15897',
 'test/15898',
 'test/15899',
 'test/15900',
 'test/15901',
 'test/15902',
 'test/15903',
 'test/15904',
 'test/15906',
 'test/15908',
 'test/15909',
 'test/15910',
 'test/15911',
 'test/15912',
 'test/15913',
 'test/15914',
 'test/15916',
 'test/15917',
 'test/15918',
 'test/15920',
 'test/15921',
 'test/15922',
 'test/15923',
 'test/15924',
 'test/15925',
 'test/15927',
 'test/15928',
 'test/15929',
 'test/15930',
 'test/15932',
 'test/15933',
 'test/15934',
 'test/15937',
 'test/15939',
 'test/15942',
 'test/15944',
 'test/15949',
 'test/15950',
 'test/15951',
 'test/15952',
 'test/15953',
 'test/15956',
 'test/15959',
 'test/15960',
 'test/15961',
 'test/15963',
 'test/15964',
 'test/15967',
 'test/15968',
 'test/15969',
 'test/15970',
 'test/15973',
 'test/15975',
 'test/15976',
 'test/15977',
 'test/15978',
 'test/15979',
 'test/15980',
 'test/15981',
 'test/15984',
 'test/15985',
 'test/15987',
 'test/15988',
 'test/15989',
 'test/15993',
 'test/15995',
 'test/15996',
 'test/15997',
 'test/15999',
 'test/16002',
 'test/16003',
 'test/16004',
 'test/16005',
 'test/16006',
 'test/16007',
 'test/16009',
 'test/16012',
 'test/16013',
 'test/16014',
 'test/16015',
 'test/16016',
 'test/16021',
 'test/16022',
 'test/16023',
 'test/16026',
 'test/16029',
 'test/16030',
 'test/16033',
 'test/16037',
 'test/16040',
 'test/16041',
 'test/16045',
 'test/16052',
 'test/16053',
 'test/16055',
 'test/16063',
 'test/16066',
 'test/16067',
 'test/16068',
 'test/16069',
 'test/16071',
 'test/16072',
 'test/16074',
 'test/16075',
 'test/16076',
 'test/16077',
 'test/16079',
 'test/16080',
 'test/16083',
 'test/16086',
 'test/16088',
 'test/16091',
 'test/16093',
 'test/16094',
 'test/16095',
 'test/16096',
 'test/16097',
 'test/16098',
 'test/16099',
 'test/16100',
 'test/16103',
 'test/16106',
 'test/16107',
 'test/16108',
 'test/16110',
 'test/16111',
 'test/16112',
 'test/16115',
 'test/16117',
 'test/16118',
 'test/16119',
 'test/16120',
 'test/16122',
 'test/16123',
 'test/16125',
 'test/16126',
 'test/16130',
 'test/16133',
 'test/16134',
 'test/16136',
 'test/16139',
 'test/16140',
 'test/16141',
 'test/16142',
 'test/16143',
 'test/16144',
 'test/16145',
 'test/16146',
 'test/16147',
 'test/16148',
 'test/16149',
 'test/16150',
 'test/16152',
 'test/16155',
 'test/16158',
 'test/16159',
 'test/16161',
 'test/16162',
 'test/16163',
 'test/16164',
 'test/16166',
 'test/16170',
 'test/16171',
 'test/16172',
 'test/16173',
 'test/16175',
 'test/16176',
 'test/16177',
 'test/16179',
 'test/16180',
 'test/16185',
 'test/16188',
 'test/16189',
 'test/16190',
 'test/16193',
 'test/16194',
 'test/16195',
 'test/16196',
 'test/16197',
 'test/16200',
 'test/16201',
 'test/16202',
 'test/16203',
 'test/16206',
 'test/16207',
 'test/16210',
 'test/16211',
 'test/16212',
 'test/16213',
 'test/16214',
 'test/16215',
 'test/16216',
 'test/16219',
 'test/16221',
 'test/16223',
 'test/16225',
 'test/16226',
 'test/16228',
 'test/16230',
 'test/16232',
 'test/16233',
 'test/16234',
 'test/16236',
 'test/16238',
 'test/16241',
 'test/16243',
 'test/16244',
 'test/16246',
 'test/16247',
 'test/16248',
 'test/16250',
 'test/16251',
 'test/16252',
 'test/16255',
 'test/16256',
 'test/16257',
 'test/16258',
 'test/16260',
 'test/16262',
 'test/16263',
 'test/16264',
 'test/16265',
 'test/16266',
 'test/16268',
 'test/16269',
 'test/16270',
 'test/16271',
 'test/16274',
 'test/16275',
 'test/16277',
 'test/16278',
 'test/16279',
 'test/16281',
 'test/16282',
 'test/16283',
 'test/16284',
 'test/16285',
 'test/16286',
 'test/16287',
 'test/16288',
 'test/16289',
 'test/16291',
 'test/16294',
 'test/16297',
 'test/16298',
 'test/16299',
 'test/16300',
 'test/16301',
 'test/16302',
 'test/16303',
 'test/16304',
 'test/16307',
 'test/16310',
 'test/16311',
 'test/16312',
 'test/16314',
 'test/16315',
 'test/16316',
 'test/16317',
 'test/16318',
 'test/16319',
 'test/16320',
 'test/16324',
 'test/16327',
 'test/16331',
 'test/16332',
 'test/16336',
 'test/16337',
 'test/16339',
 'test/16342',
 'test/16343',
 'test/16346',
 'test/16347',
 'test/16348',
 'test/16350',
 'test/16354',
 'test/16357',
 'test/16359',
 'test/16360',
 'test/16362',
 'test/16363',
 'test/16365',
 'test/16366',
 'test/16367',
 'test/16369',
 'test/16370',
 'test/16371',
 'test/16372',
 'test/16374',
 'test/16376',
 'test/16377',
 'test/16379',
 'test/16380',
 'test/16383',
 'test/16385',
 'test/16386',
 'test/16388',
 'test/16390',
 'test/16392',
 'test/16393',
 'test/16394',
 'test/16395',
 'test/16396',
 'test/16398',
 'test/16399',
 'test/16400',
 'test/16401',
 'test/16402',
 'test/16403',
 'test/16404',
 'test/16405',
 'test/16406',
 'test/16407',
 'test/16409',
 'test/16410',
 'test/16415',
 'test/16417',
 'test/16418',
 'test/16419',
 'test/16420',
 'test/16421',
 'test/16422',
 'test/16424',
 'test/16426',
 'test/16427',
 'test/16428',
 'test/16429',
 'test/16430',
 'test/16432',
 'test/16433',
 'test/16434',
 'test/16437',
 'test/16438',
 'test/16440',
 'test/16441',
 'test/16442',
 'test/16443',
 'test/16444',
 'test/16448',
 'test/16449',
 'test/16450',
 'test/16454',
 'test/16457',
 'test/16458',
 'test/16459',
 'test/16460',
 'test/16461',
 'test/16463',
 'test/16465',
 'test/16468',
 'test/16469',
 'test/16470',
 'test/16471',
 'test/16472',
 'test/16473',
 'test/16475',
 'test/16476',
 'test/16478',
 'test/16479',
 'test/16480',
 'test/16481',
 'test/16483',
 'test/16486',
 'test/16487',
 'test/16488',
 'test/16490',
 'test/16492',
 'test/16493',
 'test/16495',
 'test/16496',
 'test/16499',
 'test/16502',
 'test/16505',
 'test/16510',
 'test/16512',
 'test/16513',
 'test/16518',
 'test/16519',
 'test/16521',
 'test/16522',
 'test/16523',
 'test/16525',
 'test/16527',
 'test/16530',
 'test/16531',
 'test/16533',
 'test/16538',
 'test/16539',
 'test/16545',
 'test/16546',
 'test/16549',
 'test/16551',
 'test/16554',
 'test/16555',
 'test/16561',
 'test/16563',
 'test/16564',
 'test/16565',
 'test/16568',
 'test/16569',
 'test/16570',
 'test/16574',
 'test/16577',
 'test/16581',
 'test/16583',
 'test/16584',
 'test/16585',
 'test/16587',
 'test/16588',
 'test/16589',
 'test/16590',
 'test/16591',
 ...]

In [21]:
from nltk.corpus import reuters
# reuters.categories() ## assertion error in Python 3.5

In [22]:
# reuters.categories('training/9865') ## assertion error in Python 3.5

In [23]:
# reuters.fileids('barley') ## ValueError: Category barley not found

In [24]:
# reuters.fileids(['barley', 'corn']) 

# AssertionError
# /usr/local/Cellar/python3/3.5.1/Frameworks/Python.framework/Versions/3.5/lib/python3.5/zipfile.py in _fpclose(self, fp)
#    1712 
#    1713     def _fpclose(self, fp):
# -> 1714         assert self._fileRefCnt > 0
#    1715         self._fileRefCnt -= 1
#    1716         if not self._fileRefCnt and not self._filePassed:

Inaugural Address Corpus


In [25]:
from nltk.corpus import inaugural
inaugural.fileids()


Out[25]:
['1789-Washington.txt',
 '1793-Washington.txt',
 '1797-Adams.txt',
 '1801-Jefferson.txt',
 '1805-Jefferson.txt',
 '1809-Madison.txt',
 '1813-Madison.txt',
 '1817-Monroe.txt',
 '1821-Monroe.txt',
 '1825-Adams.txt',
 '1829-Jackson.txt',
 '1833-Jackson.txt',
 '1837-VanBuren.txt',
 '1841-Harrison.txt',
 '1845-Polk.txt',
 '1849-Taylor.txt',
 '1853-Pierce.txt',
 '1857-Buchanan.txt',
 '1861-Lincoln.txt',
 '1865-Lincoln.txt',
 '1869-Grant.txt',
 '1873-Grant.txt',
 '1877-Hayes.txt',
 '1881-Garfield.txt',
 '1885-Cleveland.txt',
 '1889-Harrison.txt',
 '1893-Cleveland.txt',
 '1897-McKinley.txt',
 '1901-McKinley.txt',
 '1905-Roosevelt.txt',
 '1909-Taft.txt',
 '1913-Wilson.txt',
 '1917-Wilson.txt',
 '1921-Harding.txt',
 '1925-Coolidge.txt',
 '1929-Hoover.txt',
 '1933-Roosevelt.txt',
 '1937-Roosevelt.txt',
 '1941-Roosevelt.txt',
 '1945-Roosevelt.txt',
 '1949-Truman.txt',
 '1953-Eisenhower.txt',
 '1957-Eisenhower.txt',
 '1961-Kennedy.txt',
 '1965-Johnson.txt',
 '1969-Nixon.txt',
 '1973-Nixon.txt',
 '1977-Carter.txt',
 '1981-Reagan.txt',
 '1985-Reagan.txt',
 '1989-Bush.txt',
 '1993-Clinton.txt',
 '1997-Clinton.txt',
 '2001-Bush.txt',
 '2005-Bush.txt',
 '2009-Obama.txt']

In [26]:
[fileid[:4] for fileid in inaugural.fileids()]


Out[26]:
['1789',
 '1793',
 '1797',
 '1801',
 '1805',
 '1809',
 '1813',
 '1817',
 '1821',
 '1825',
 '1829',
 '1833',
 '1837',
 '1841',
 '1845',
 '1849',
 '1853',
 '1857',
 '1861',
 '1865',
 '1869',
 '1873',
 '1877',
 '1881',
 '1885',
 '1889',
 '1893',
 '1897',
 '1901',
 '1905',
 '1909',
 '1913',
 '1917',
 '1921',
 '1925',
 '1929',
 '1933',
 '1937',
 '1941',
 '1945',
 '1949',
 '1953',
 '1957',
 '1961',
 '1965',
 '1969',
 '1973',
 '1977',
 '1981',
 '1985',
 '1989',
 '1993',
 '1997',
 '2001',
 '2005',
 '2009']

all words in the Inaugural Address Corpus that begin with america or citizen, counts over time (counts are not normalized for document length)


In [27]:
cfd = nltk.ConditionalFreqDist((target, fileid[:4])
                               for fileid in inaugural.fileids()
                               for w in inaugural.words(fileid)
                               for target in ['america', 'citizen']
                               if w.lower().startswith(target))
plot = plt.figure(figsize=(18,10))
cfd.plot()


Annotated Text Corpora

need to download other corpora through nltk.download()

Corpora in Other Languages

need to download other corpora through nltk.download()

Loading your own Corpus

Use PlaintextCorpusReader

commented out -- no corpus


In [28]:
# from nltk.corpus import PlaintextCorpusReader
# corpus_root = '/usr/share/dict'

# # the second parameter to PlaintextCorpusReader can be:
# # - a list of fileids, like ['a.txt', 'test/b.txt']
# # - a pattern that matches all fileids, like '[abc]/.*\.txt'
# wordlists = PlaintextCorpusReader(corpus_root, '.*')
# wordlists.fileids()

In [29]:
# wordlists.words('connectives')

Conditional Frequency Distributions

Remember that we can compute occurrences of items (for example, words) in a list using FreqDist(mylist)

"A conditional frequency distribution is a collection of frequency distributions, each one for a different condition"

Use ConditionalFreqDist

Conditions and events

Conditional frequency distribution needs to pair observable events with conditions, so the input is a sequence of pairs (not just a list of words).

text = ['The', 'Fulton', 'County', 'Grand', 'Jury', 'said', ...]
pairs = [('news', 'The'), ('news', 'Fulton'), ('news', 'County'), ...]

Pairs have the form (condition, event)

Counting Words by Genre


In [30]:
from nltk.corpus import brown
cfd = nltk.ConditionalFreqDist((genre, word) 
                               for genre in brown.categories()
                               for word in brown.words(categories=genre))
cfd


Out[30]:
defaultdict(nltk.probability.FreqDist,
            {'adventure': Counter({'Without': 1,
                      'lighted': 5,
                      'laden': 1,
                      'shimmering': 1,
                      'More': 1,
                      'assent': 1,
                      'festival': 1,
                      'speak': 6,
                      'located': 2,
                      'pandanus': 1,
                      'education': 4,
                      'house': 60,
                      'muscles': 5,
                      "She'd": 1,
                      'tube': 1,
                      'travelers': 1,
                      'Adams': 5,
                      'waters': 6,
                      'forearm': 1,
                      'dig': 2,
                      'hurdled': 1,
                      'suds': 3,
                      'use': 15,
                      'calloused': 1,
                      'killed': 15,
                      'experience': 1,
                      'sunbaked': 1,
                      'served': 3,
                      'octagonal': 1,
                      'bars': 5,
                      'many-times': 1,
                      'thoroughly': 1,
                      "Ayres'": 1,
                      'unaware': 1,
                      'singed': 1,
                      'abrupt': 1,
                      'certain': 7,
                      'heaved': 1,
                      'Tomorrow': 1,
                      'Nazi': 1,
                      'demanded': 4,
                      'shade': 3,
                      'others': 19,
                      'sunshine': 2,
                      'vicinity': 1,
                      'Wait': 4,
                      'Unconcerned': 1,
                      'strains': 1,
                      'mercy': 1,
                      'bucking': 1,
                      'taut': 1,
                      'catch': 10,
                      'ngandlu': 1,
                      'beautiful': 5,
                      'warrior': 3,
                      "stealin'": 1,
                      'supported': 2,
                      'brushing': 2,
                      'grotesquely': 1,
                      'McLish': 3,
                      'gesticulating': 1,
                      'Stevens': 8,
                      'Kneeling': 1,
                      'lapsing': 1,
                      'paradoxically': 1,
                      'chicken': 4,
                      'determine': 2,
                      'goaded': 1,
                      'correct': 2,
                      'cabins': 1,
                      'glance': 6,
                      'cups': 1,
                      'kitchen': 7,
                      'amount': 2,
                      'rabbits': 1,
                      'crack': 5,
                      'glances': 1,
                      'pressure': 6,
                      'overrun': 1,
                      'thet': 2,
                      'cheek': 11,
                      'perpetrator': 1,
                      'buddies': 2,
                      'dangle': 1,
                      'way': 65,
                      'gauche': 1,
                      'Ballet': 1,
                      'bawled': 1,
                      "wouldn't": 25,
                      'gulps': 1,
                      'cheated': 1,
                      'through': 78,
                      'rimmed': 1,
                      'Just': 22,
                      "we'll": 6,
                      'each': 26,
                      'Because': 3,
                      'Plus': 1,
                      'Ah': 8,
                      'interminable': 1,
                      'mustering': 1,
                      'pavement': 6,
                      'tons': 3,
                      'surreptitiously': 1,
                      'timid': 1,
                      'Seton': 1,
                      'decaying': 1,
                      'Walters': 1,
                      'forget': 3,
                      'apparently': 2,
                      'unselfishly': 1,
                      "yesterday's": 1,
                      'ye': 1,
                      'tossed': 5,
                      'shy': 2,
                      'raised': 7,
                      'fertility': 1,
                      'stray': 1,
                      'Easterners': 1,
                      'stream': 2,
                      'revealed': 2,
                      'sprung': 1,
                      'eternal': 1,
                      'fences': 1,
                      'At': 25,
                      ')': 22,
                      'wallowed': 1,
                      'focused': 1,
                      'patrolman': 3,
                      'clog': 1,
                      'tipsy': 1,
                      'Those': 4,
                      'surprisingly': 1,
                      'half-a-dozen': 1,
                      'sizzled': 1,
                      'lucky': 5,
                      'career': 1,
                      'idly': 2,
                      'meld': 1,
                      'husband': 13,
                      'friends': 6,
                      'bet': 3,
                      'Prime': 2,
                      'shouting': 7,
                      'worse': 2,
                      'Shillong': 1,
                      'slip': 1,
                      'unenunciated': 1,
                      'small': 35,
                      'bull': 1,
                      'doorways': 1,
                      'enemy': 15,
                      'wept': 2,
                      'neckline': 1,
                      'also': 19,
                      'knife': 11,
                      'recently': 2,
                      'reined': 2,
                      'distinct': 2,
                      'snapped': 7,
                      'next': 22,
                      'cipher': 1,
                      'propeller': 1,
                      'sly': 1,
                      'sweltering': 1,
                      'strike': 3,
                      'four': 16,
                      'lame': 1,
                      'coiling': 1,
                      'really': 18,
                      'broad': 4,
                      'Please': 3,
                      'comforting': 1,
                      'Pistol-whipping': 1,
                      'rushed': 3,
                      'effect': 3,
                      'planted': 1,
                      'exact': 1,
                      'clothed': 1,
                      'Anything': 3,
                      'cell': 3,
                      'King': 1,
                      'steeper': 2,
                      "bo'sun's": 1,
                      "Pettigrew's": 1,
                      'Remembering': 1,
                      'evening': 9,
                      'dripped': 1,
                      'pungent': 1,
                      'Houses': 1,
                      'Whaddya': 1,
                      'telegraph': 1,
                      'Joseph': 3,
                      'coward': 3,
                      'daylight': 5,
                      'These': 6,
                      'retrieved': 1,
                      'towering': 3,
                      'runaway': 1,
                      "Todman's": 1,
                      'ask': 8,
                      'found': 36,
                      'steadiness': 1,
                      "one's": 2,
                      'Laurel': 1,
                      'badge-toter': 1,
                      'pretty': 6,
                      'mother': 14,
                      "t'lah": 1,
                      'things': 18,
                      'Now': 37,
                      'thank': 4,
                      'balled': 1,
                      'negligence': 1,
                      "He'll": 2,
                      'Running': 1,
                      'permission': 2,
                      'suppose': 4,
                      'vertical': 1,
                      'bits': 2,
                      'goodbye': 1,
                      'onct': 2,
                      'cocked': 2,
                      'midst': 1,
                      'drawn': 6,
                      'sir': 2,
                      'bat': 2,
                      'cellophane': 1,
                      'gallop': 2,
                      'marrowbones': 1,
                      'Bear': 1,
                      'brethren': 1,
                      'hatred': 3,
                      'snack': 1,
                      'German': 11,
                      'neighbor': 1,
                      'scream': 4,
                      'signal': 5,
                      'grabbed': 10,
                      'musicians': 1,
                      'gushed': 2,
                      'shoot': 7,
                      'pierced': 2,
                      'shadow': 5,
                      'pinpoints': 1,
                      'stiffened': 1,
                      'radii': 1,
                      "wasn't": 17,
                      "Delphine's": 2,
                      'narrowing': 1,
                      'flights': 5,
                      'dying': 5,
                      'required': 1,
                      'remained': 11,
                      'Griggs': 3,
                      'Lord': 23,
                      'throng': 1,
                      'beasts': 1,
                      'strip': 6,
                      'hunting': 3,
                      'resource': 1,
                      'run': 11,
                      'discussed': 1,
                      'speedy': 1,
                      'stores': 1,
                      'veering': 1,
                      'cobra': 2,
                      'quavered': 1,
                      'web': 1,
                      'lilting': 1,
                      'nerves': 3,
                      'shark-infested': 1,
                      '30': 1,
                      'motioning': 2,
                      'themselves': 6,
                      'developing': 1,
                      'prettiness': 1,
                      'storefront': 1,
                      "wieners'": 1,
                      'Deputy': 2,
                      'experiences': 1,
                      'wicker': 1,
                      'However': 1,
                      'drahve': 1,
                      'stumps': 1,
                      'positive': 2,
                      'Dammit': 2,
                      'Cheyennes': 2,
                      'execution': 1,
                      'Howard': 1,
                      'speaks': 1,
                      'thorny': 1,
                      'clean': 6,
                      "mulatto's": 1,
                      'pulled': 19,
                      'doggone': 1,
                      'framed': 2,
                      'Dawn': 1,
                      'methods': 3,
                      'visited': 3,
                      'loaded': 1,
                      'Also': 3,
                      'radio': 8,
                      'coat': 11,
                      'Hustle': 1,
                      'trees': 21,
                      'anonymous': 5,
                      'Japanese': 4,
                      'Begging': 1,
                      'action-packed': 1,
                      'seeming': 2,
                      'devastating': 1,
                      'sixty': 2,
                      'broad-brimmed': 1,
                      'problem': 4,
                      'winter': 1,
                      'distance': 12,
                      'divining': 1,
                      'walking': 3,
                      'sleeve': 6,
                      'floorboards': 1,
                      'proffered': 2,
                      'enlightened': 1,
                      "money's": 1,
                      'looking': 19,
                      'goad': 1,
                      'Journal': 1,
                      'chamois': 1,
                      'crook': 1,
                      'splinters': 1,
                      'suggest': 1,
                      'British': 3,
                      'sheeted': 1,
                      'trading': 2,
                      'Time': 2,
                      'humor': 1,
                      'baggy': 1,
                      'fibers': 1,
                      'Smiling': 1,
                      "We're": 8,
                      'sentimental': 1,
                      'Big': 5,
                      'descending': 1,
                      'thrills': 1,
                      'garbed': 1,
                      'corkers': 1,
                      'inductees': 1,
                      'pointed': 7,
                      'wounded': 6,
                      'thumbing': 1,
                      'determination': 1,
                      'hey': 1,
                      'slowly': 14,
                      'place': 39,
                      'piece': 8,
                      'rode': 12,
                      'amusement': 1,
                      'bombers': 1,
                      'Piwen': 1,
                      'tasted': 1,
                      'atmosphere': 2,
                      'teetering': 1,
                      'Montero': 10,
                      'urgent': 1,
                      'land': 16,
                      'gravely': 2,
                      'blocking': 1,
                      'wires': 2,
                      'think': 44,
                      'could': 151,
                      'enjoying': 1,
                      'rusty': 4,
                      'stopped': 17,
                      'Wilson': 17,
                      'Dangerous': 2,
                      'marvelled': 1,
                      'related': 1,
                      'elaborate': 2,
                      'Along': 2,
                      'pride': 4,
                      'compulsion': 1,
                      'twisting': 5,
                      'darkened': 2,
                      'rattle': 1,
                      'reeking': 1,
                      'rites': 1,
                      'brute': 1,
                      'Juan': 2,
                      'Corporal': 2,
                      'ran': 33,
                      'wagons': 7,
                      'meant': 14,
                      'Oooo': 1,
                      'shield': 1,
                      'intent': 1,
                      'tiny': 4,
                      'seepage': 1,
                      'picture': 3,
                      'guarding': 2,
                      'A.M.': 2,
                      'sympathetic': 1,
                      'Chicago': 1,
                      'stairway': 1,
                      'colossal': 1,
                      'buildings': 10,
                      'between': 19,
                      "weeks'": 1,
                      'cost': 2,
                      'collapsing': 1,
                      'host': 1,
                      'courteously': 2,
                      'intimate': 2,
                      'lifeless': 1,
                      'braces': 1,
                      'eighth': 1,
                      'tombstone': 1,
                      'Senora': 1,
                      'recognize': 1,
                      'emerge': 1,
                      'Ramey': 28,
                      'wagon': 27,
                      'companions': 1,
                      'higher': 5,
                      'Dutch': 4,
                      'willow': 1,
                      'foliage': 1,
                      'turtlebacks': 1,
                      'nod': 3,
                      'decomposing': 1,
                      'firmly': 5,
                      'deplorable': 1,
                      'Soak': 1,
                      'backed': 6,
                      'gunner': 1,
                      'placid': 2,
                      'Emigrant': 2,
                      'dismissed': 2,
                      'nowhere': 6,
                      'told': 46,
                      'Alice': 1,
                      "Munich's": 1,
                      'The': 410,
                      'Philly': 1,
                      'heat': 7,
                      'someplace': 2,
                      'Opening': 1,
                      'mouldering': 1,
                      'riders': 1,
                      'Resisting': 1,
                      'of': 1322,
                      'feminist': 1,
                      "Morgan's": 1,
                      'withdrawn': 1,
                      'law': 6,
                      'spiritual': 1,
                      'Ponkob': 1,
                      'scouts': 1,
                      'Perdido': 3,
                      'nearing': 1,
                      'stained': 3,
                      'manservant': 2,
                      'police': 5,
                      'guess': 11,
                      'fates': 1,
                      'First': 4,
                      'rear': 19,
                      'crossbars': 1,
                      'Hallelujah': 1,
                      'doubloon': 1,
                      'metal': 5,
                      'corner': 19,
                      'sworn': 2,
                      'ducked': 2,
                      'bold': 5,
                      'lived': 10,
                      'protest': 1,
                      'Sierra': 1,
                      'self-confidence': 1,
                      'mingled': 1,
                      'dispersed': 1,
                      'maid': 1,
                      'Grace': 3,
                      'facts': 1,
                      "two-bits'": 1,
                      'Figger': 1,
                      'unusually': 4,
                      'cursed': 3,
                      'idiotically': 1,
                      'craved': 1,
                      'peculiar': 1,
                      'display': 2,
                      'Siberia': 1,
                      'frothing': 1,
                      'funny': 2,
                      'entry': 1,
                      'overhead': 2,
                      'outside': 10,
                      'hung': 14,
                      'book': 4,
                      'Methodists': 1,
                      'suggested': 1,
                      "coroner's": 4,
                      'harshly': 3,
                      'comport': 1,
                      'jacket': 6,
                      'Seeing': 2,
                      'bound': 4,
                      'intend': 1,
                      'precaution': 2,
                      'Okay': 1,
                      'poems': 4,
                      'endlessly': 1,
                      'purpose': 5,
                      'blood-chilling': 1,
                      'give': 27,
                      'engraving': 1,
                      'Bullets': 1,
                      'John-Henry': 1,
                      'wisps': 1,
                      'dishes': 5,
                      'tape': 1,
                      'unnamed': 1,
                      'odor': 2,
                      'stomping': 1,
                      'Warmly': 1,
                      'Stetson': 1,
                      'ammunition': 5,
                      'flatly': 1,
                      'tunnel': 1,
                      'blond': 5,
                      'waving': 1,
                      'crest': 2,
                      'applying': 3,
                      'allow': 4,
                      'dangerous': 2,
                      'Aj': 1,
                      'Embarcadero': 1,
                      'daylights': 1,
                      'missing': 1,
                      'actually': 6,
                      'Several': 1,
                      'connected': 1,
                      'softened': 2,
                      'Billy': 17,
                      'after': 53,
                      'questioning': 4,
                      'young': 24,
                      'anyway': 7,
                      'authority': 3,
                      'nester': 2,
                      'Perhaps': 6,
                      'grace': 1,
                      'confused': 2,
                      'Shall': 4,
                      'big': 50,
                      'ill-conceived': 1,
                      'fawn': 1,
                      'quantities': 1,
                      'arresting': 1,
                      'grenades': 1,
                      'riddled': 1,
                      'fills': 1,
                      'attempted': 4,
                      'completely': 7,
                      'pleasure': 5,
                      "'tain't": 1,
                      'unwaveringly': 1,
                      'opened': 19,
                      'left': 47,
                      'stain': 1,
                      'fortified': 1,
                      'Holy': 1,
                      'clip': 1,
                      'blast': 4,
                      'stood': 36,
                      'worst': 3,
                      'Coble': 1,
                      'deck': 7,
                      'yuh': 1,
                      'drinking': 6,
                      'toasted': 1,
                      'street': 23,
                      'liveried': 1,
                      'bow': 2,
                      'lawmen': 2,
                      'current': 3,
                      'streamed': 2,
                      'fill': 1,
                      'ludicrous': 1,
                      'easiest': 1,
                      'priest': 1,
                      'Leaning': 2,
                      "t'hi-im": 1,
                      'town': 28,
                      'now': 90,
                      'OK.': 1,
                      'Drink': 2,
                      'levi-clad': 1,
                      'Melanesian': 2,
                      'buxom': 1,
                      'Favor': 1,
                      'cab': 2,
                      'suddenly': 16,
                      'turtle': 2,
                      'lay': 15,
                      'cohnfidunt': 1,
                      'loss': 1,
                      'three': 35,
                      'shut': 7,
                      'bobbing': 1,
                      'demand': 2,
                      'batch': 2,
                      'stock': 5,
                      'Spring': 1,
                      'throughout': 1,
                      'planks': 3,
                      'San': 3,
                      'saddles': 1,
                      'wage': 1,
                      'industrial': 2,
                      'dozed': 1,
                      'pat': 1,
                      'pure': 3,
                      'extended': 6,
                      'sluices': 1,
                      'demoniac': 1,
                      'patiently': 1,
                      'twins': 1,
                      'Slow': 1,
                      'joy': 2,
                      'detective': 1,
                      'gag': 1,
                      'car': 36,
                      'murders': 1,
                      'clambered': 2,
                      'cap': 6,
                      'West': 7,
                      '$30': 1,
                      'A.': 1,
                      'discovered': 3,
                      'present': 5,
                      'galley': 2,
                      'be': 183,
                      'racing': 3,
                      'names': 4,
                      'ready': 14,
                      'corpses': 1,
                      'sodden': 1,
                      'gunslinger': 1,
                      'passiveness': 1,
                      'glistened': 2,
                      'litle': 1,
                      'Ten': 4,
                      'witches': 2,
                      'abundance': 1,
                      'sincerity': 1,
                      'heap': 3,
                      'sparring': 1,
                      'conclusion': 1,
                      'ant': 1,
                      'ceased': 2,
                      'auto': 1,
                      'minutes': 27,
                      'pinch': 1,
                      'Gas': 1,
                      'equipped': 1,
                      'intertwined': 1,
                      'joined': 6,
                      "He's": 15,
                      'hush': 1,
                      'leaping': 1,
                      'load': 5,
                      'convicts': 1,
                      'distributor': 1,
                      'sharpened': 1,
                      "Amelia's": 2,
                      'covering': 5,
                      'afternoon': 9,
                      'scored': 1,
                      'indistinguishable': 1,
                      'drained': 2,
                      'central': 1,
                      'shivered': 1,
                      'Poussin': 1,
                      'brandishing': 1,
                      'fail': 2,
                      'boarding': 1,
                      'burial': 2,
                      'weeds': 1,
                      'suitably': 1,
                      'force': 6,
                      'replaced': 2,
                      'Bonaventure': 2,
                      "Christ's": 1,
                      'prevented': 2,
                      'Raton': 1,
                      'absent-mindedly': 1,
                      'spasms': 1,
                      'cringing': 2,
                      'eastern': 1,
                      "Tilghman's": 4,
                      'plates': 3,
                      'wildly': 7,
                      'Neal': 3,
                      'loin': 1,
                      'Donald': 4,
                      'ate': 2,
                      'alone': 14,
                      'pardon': 2,
                      'trail-worn': 1,
                      'Venice': 1,
                      'beneath': 15,
                      'bullet': 12,
                      'offsaddled': 1,
                      "You'd": 3,
                      'slugs': 1,
                      'backbone': 1,
                      'tongue-tied': 1,
                      'camouflage': 1,
                      'into': 180,
                      'address': 3,
                      'keel': 1,
                      'else': 21,
                      'let': 38,
                      'panted': 1,
                      'Shu-tt': 1,
                      'becoming': 2,
                      'May': 2,
                      'artfully': 1,
                      'vanishing': 1,
                      'brother': 2,
                      'Somebody': 6,
                      'managed': 3,
                      'ruthlessly': 1,
                      'method': 2,
                      'murder': 9,
                      '2': 2,
                      'associates': 1,
                      '400': 1,
                      'blinked': 1,
                      'dispatched': 1,
                      'ferociously': 1,
                      'trigger-happy': 2,
                      'visible': 2,
                      'involved': 4,
                      'detonation': 1,
                      'pupils': 1,
                      'quirt': 8,
                      'straining': 1,
                      'style': 1,
                      'switch': 4,
                      "Cargill's": 1,
                      'electric': 1,
                      'reunion': 3,
                      'shots': 7,
                      "marshal's": 2,
                      'muttered': 3,
                      'honeymoon': 1,
                      "I'm": 50,
                      'forgive': 3,
                      'secrets': 1,
                      'congealed': 1,
                      'corners': 1,
                      'around': 71,
                      'gasped': 1,
                      'Donna': 4,
                      'engages': 3,
                      'edges': 1,
                      'missy': 1,
                      'Otherwise': 1,
                      "I'll": 35,
                      'sling': 1,
                      'slivery': 1,
                      'rockstrewn': 1,
                      'monotonous': 1,
                      'stilts': 1,
                      'naked': 5,
                      'clicks': 1,
                      'clothes': 11,
                      'gripped': 6,
                      'clapped': 2,
                      'gruff': 2,
                      'headwalls': 2,
                      'thankful': 1,
                      'released': 4,
                      'ends': 1,
                      'possibilities': 1,
                      'eromonga': 1,
                      'attracted': 2,
                      'controlled': 2,
                      'hide': 3,
                      'farming': 2,
                      'mid-section': 1,
                      'inadvertently': 1,
                      'self-dictate': 1,
                      'parked': 4,
                      'biggest': 1,
                      'from': 260,
                      'devilish': 1,
                      'Jim': 1,
                      'Hot': 3,
                      'ages': 2,
                      'Fat': 1,
                      'warning': 5,
                      'dejectedly': 1,
                      'Murder': 1,
                      'ball': 10,
                      'Mother': 1,
                      'spade': 1,
                      'catlike': 1,
                      'reality': 1,
                      'stoker': 1,
                      'studied': 7,
                      'twenty-five': 1,
                      "boat's": 1,
                      'appreciate': 2,
                      'prancing': 1,
                      'deeds': 1,
                      'years': 32,
                      'sticks': 3,
                      'probably': 18,
                      'listless': 1,
                      'frequently': 2,
                      'Splendid': 1,
                      'During': 1,
                      'well-brushed': 1,
                      'majuh': 1,
                      'Blackfeet': 2,
                      'Cold': 1,
                      'tore': 3,
                      'cashmere': 1,
                      'Ducking': 1,
                      'drunkenly': 1,
                      'victuals': 1,
                      'or': 115,
                      'madly': 2,
                      "Don't": 14,
                      "Carwood's": 1,
                      'fifteen': 3,
                      'shaky': 1,
                      'destruction': 1,
                      'dives': 3,
                      'blissfully': 1,
                      'ma': 1,
                      'city': 5,
                      'sex': 3,
                      'Looking': 2,
                      'muzzles': 1,
                      'stomach': 9,
                      'sensing': 1,
                      'holstered': 3,
                      'Miraculously': 3,
                      'hosts': 1,
                      'backing': 1,
                      'easier': 1,
                      'consciousness': 1,
                      'plans': 2,
                      'ropes': 3,
                      'pick-up': 2,
                      'sprawled': 2,
                      'Vs.': 1,
                      'heart': 11,
                      'primary': 1,
                      'national': 1,
                      'ah': 2,
                      'tomorrow': 3,
                      'Dealing': 2,
                      'suspicion': 2,
                      'Gloom': 1,
                      'bandits': 1,
                      'tacked': 1,
                      'boiler': 1,
                      'called': 22,
                      'Cabot': 1,
                      'year': 11,
                      'Straits': 1,
                      'exerted': 1,
                      'Too': 6,
                      'courage': 1,
                      'sighted': 2,
                      'painted': 1,
                      'agricultural': 1,
                      'slumped': 5,
                      'savvy': 1,
                      'Resignedly': 1,
                      'Ah-ah': 1,
                      'scraps': 1,
                      'courtliness': 1,
                      'conspicuously': 1,
                      'Hoe-Down': 1,
                      'organized': 2,
                      'wonder': 6,
                      'partial': 1,
                      'revelry': 1,
                      'gestured': 1,
                      'sharply': 4,
                      'makeshift': 1,
                      'chance': 20,
                      'squeaking': 1,
                      'munched': 1,
                      'stampeded': 1,
                      'somnolent': 1,
                      'sourly': 1,
                      'vine-crisscrossed': 1,
                      'fellow': 4,
                      'halfway': 3,
                      'mission': 4,
                      'Hair': 1,
                      'not': 179,
                      'blazed': 1,
                      'sons': 1,
                      'taught': 2,
                      'headquarters': 2,
                      'Brakes': 1,
                      'basket': 1,
                      'suffer': 2,
                      'soared': 3,
                      'bedroom': 6,
                      'growth': 1,
                      'Arm': 1,
                      'Eyes': 8,
                      'shattering': 4,
                      'college': 4,
                      'unoccupied': 1,
                      '15th': 1,
                      'placed': 7,
                      'moaned': 2,
                      'landmarks': 1,
                      'boat': 4,
                      'noble': 4,
                      'zigzagging': 3,
                      'set': 17,
                      'refuse-littered': 1,
                      'sliding': 2,
                      'glanced': 4,
                      'winked': 1,
                      'worried': 4,
                      'hated': 6,
                      'arrangement': 2,
                      'Meaning': 1,
                      'essential': 1,
                      'crowd': 4,
                      'five': 13,
                      'begrudge': 1,
                      'thrilled': 1,
                      'neatly': 1,
                      'less': 13,
                      "Breed's": 1,
                      'movement': 4,
                      'scraping': 3,
                      'dwindling': 1,
                      'united': 1,
                      'application': 2,
                      'smaller': 3,
                      'crimsoning': 1,
                      'surreptitious': 1,
                      'shall': 7,
                      'word': 8,
                      'innocence': 1,
                      'magnificent': 1,
                      'quarter-mile': 1,
                      'detectives': 1,
                      'quieted': 1,
                      'searching': 4,
                      "hadn't": 14,
                      'shells': 1,
                      'ground': 29,
                      "We'd": 2,
                      'Ferguson': 1,
                      'broil': 1,
                      'locate': 3,
                      'bowstring': 1,
                      'Walter': 7,
                      'stir': 2,
                      'liaisons': 1,
                      'mudguard': 1,
                      '7:30': 1,
                      'business': 8,
                      'strength': 5,
                      'claims': 1,
                      'wreck': 1,
                      'adults': 2,
                      'wallet': 1,
                      'oval': 1,
                      'sack': 2,
                      'Start': 3,
                      'unusual': 3,
                      'holster': 8,
                      'clasped': 1,
                      'ostensible': 1,
                      'obedience': 1,
                      'flow': 2,
                      'white': 26,
                      'dreamed': 2,
                      'thimble-sized': 1,
                      'hoof': 1,
                      'desperadoes': 1,
                      'lacked': 1,
                      "Dill's": 3,
                      'task': 1,
                      ...}),
             'belles_lettres': Counter({'Without': 12,
                      'unanswered': 1,
                      'taxing': 1,
                      'More': 8,
                      '23': 1,
                      'festival': 1,
                      'speak': 30,
                      'located': 3,
                      'economics': 3,
                      'Bang-Jensen': 25,
                      'intimidation': 1,
                      'house': 40,
                      'promises': 2,
                      'dispatch': 5,
                      'Corporations': 3,
                      'invulnerability': 1,
                      'building': 11,
                      'arithmetized': 1,
                      'waters': 4,
                      'crocketed': 1,
                      'Between': 2,
                      'restrain': 2,
                      'sieben': 1,
                      'use': 56,
                      'dissatisfactions': 1,
                      'killed': 7,
                      'experience': 92,
                      'endure': 1,
                      'horseman': 1,
                      'served': 20,
                      'Principal': 1,
                      'corroding': 1,
                      'population': 11,
                      'warrants': 1,
                      'titanic': 1,
                      'mana': 2,
                      'Viscount': 1,
                      'gibe': 1,
                      'Sex': 1,
                      'Dreiser': 1,
                      'surrendering': 1,
                      'pleading': 3,
                      "Whitman's": 1,
                      'Tessie': 7,
                      'H.M.': 2,
                      'wages': 1,
                      'zeal': 1,
                      'Continental': 4,
                      'shade': 6,
                      'sunshine': 1,
                      'mercy': 2,
                      'grew': 12,
                      'Convocation': 3,
                      "Peoples'": 1,
                      'chancellor': 2,
                      'beautiful': 15,
                      'Analogously': 1,
                      'marriages': 2,
                      'relay': 1,
                      'Karamazov': 2,
                      'denying': 1,
                      'Eternal': 1,
                      'rheum': 1,
                      'sorted': 1,
                      'Madeleine': 3,
                      'faires': 1,
                      'thrillers': 1,
                      'hardships': 3,
                      'constituents': 1,
                      'chicken': 3,
                      'barrage': 1,
                      'materially': 1,
                      'commentator': 1,
                      'reborn': 1,
                      'glance': 8,
                      'cups': 1,
                      'quiescent': 1,
                      'tangential': 1,
                      'kitchen': 2,
                      'amount': 10,
                      'pursuits': 1,
                      'crack': 1,
                      'Called': 1,
                      'pressure': 14,
                      'Short': 1,
                      'Creator': 3,
                      'legitimacy': 1,
                      'archaic': 3,
                      'evils': 1,
                      "wouldn't": 9,
                      'reinforcing': 1,
                      'through': 135,
                      '$300,000': 1,
                      "James'": 1,
                      'migrate': 1,
                      'Relativism': 1,
                      'renamed': 2,
                      'Just': 11,
                      'specific': 18,
                      'deemed': 2,
                      'Because': 12,
                      'Ah': 2,
                      'ascertain': 2,
                      'Quod': 1,
                      'hug': 2,
                      'Justice': 4,
                      'Sumter': 1,
                      'surreptitiously': 1,
                      'timid': 1,
                      'menaced': 1,
                      'grimmer': 1,
                      'entrepreneur': 6,
                      "yesterday's": 1,
                      'Democrats': 11,
                      'myne': 1,
                      'toil': 1,
                      'cottage': 1,
                      'raised': 11,
                      'stray': 1,
                      'flocking': 1,
                      'meanings': 1,
                      'stream': 9,
                      'revealed': 8,
                      'grave': 7,
                      'Rousseau': 2,
                      'suburbs': 1,
                      'eternal': 5,
                      'epitomized': 1,
                      'experimenter': 1,
                      'cheere': 1,
                      'At': 73,
                      ')': 251,
                      'college-educated': 1,
                      'Boheme': 1,
                      'non-Jew': 2,
                      'stimulated': 3,
                      'mastery': 3,
                      'Gullah': 1,
                      'quoted': 4,
                      'accumulates': 1,
                      'Those': 15,
                      'surprisingly': 4,
                      'enlist': 2,
                      'Nelson': 2,
                      'recurred': 2,
                      'inculcation': 1,
                      'lucky': 2,
                      'prophets': 2,
                      'hesitant': 1,
                      'bet': 1,
                      'English-Scottish-French': 1,
                      'subtly': 2,
                      'Peruvian': 1,
                      'Cruel': 1,
                      'sands': 1,
                      'slip': 2,
                      'negative': 17,
                      "patient's": 1,
                      'small': 48,
                      'doorways': 1,
                      'enemy': 25,
                      'Apologie': 1,
                      'understatement': 1,
                      'quo': 6,
                      'eventfully': 1,
                      'revolutions': 1,
                      'knife': 1,
                      'perceptual': 4,
                      'Verner': 1,
                      'enforced': 5,
                      'Sighting': 1,
                      'consultative': 1,
                      "lover's": 1,
                      'scornfully': 1,
                      'reared': 1,
                      'petitioned': 2,
                      'strike': 7,
                      'four': 30,
                      'Dorado': 1,
                      'e.g.': 10,
                      'really': 36,
                      'peoples': 13,
                      'lends': 1,
                      'deferents': 1,
                      'striving': 1,
                      'signals': 3,
                      'effect': 36,
                      'sports': 9,
                      'shawl': 1,
                      'Anything': 1,
                      'huckster': 1,
                      'four-thirty': 1,
                      'falcon': 1,
                      'King': 16,
                      'artists': 6,
                      'sanctimonious': 1,
                      'good-bye': 2,
                      'confronting': 2,
                      'evening': 22,
                      'prisoners': 7,
                      'Joseph': 12,
                      'relax': 2,
                      "B-52's": 1,
                      'wives': 2,
                      'recalcitrant': 2,
                      'runaway': 3,
                      'regretted': 5,
                      'Cam': 1,
                      'relief': 6,
                      'prettier': 1,
                      'stead': 2,
                      'ask': 12,
                      'gret': 1,
                      'America': 42,
                      'found': 81,
                      'Christie': 2,
                      'stewed': 1,
                      'resurrecting': 1,
                      'barefooted': 1,
                      'wonders': 2,
                      'ethos': 2,
                      'mother': 38,
                      'Garry': 4,
                      'things': 69,
                      'shaven': 1,
                      'severe': 2,
                      'alienated': 2,
                      'Abstract': 1,
                      'capacities': 2,
                      'independent': 12,
                      'nomination': 3,
                      'chow': 1,
                      'Hall': 5,
                      'singularly': 1,
                      'transported': 1,
                      "buildin'": 1,
                      'dextrous-fingered': 1,
                      'appreciative': 1,
                      'inaugural': 2,
                      'stops': 1,
                      "Coleridge's": 2,
                      'gallop': 1,
                      'Plummer': 2,
                      'unprofessional': 1,
                      'constituted': 4,
                      'Ratified': 1,
                      'sombre': 2,
                      'Welfare': 2,
                      'neighbor': 3,
                      'vehicles': 2,
                      '1959': 5,
                      'managing': 1,
                      'False': 1,
                      "Childhood's": 5,
                      'yielded': 1,
                      'Mannerhouse': 1,
                      'shadow': 2,
                      'concerto': 1,
                      'generalized': 2,
                      'fabric': 3,
                      'Hutton': 1,
                      'hogs': 1,
                      'adieu': 1,
                      'indecision': 1,
                      'Bright': 6,
                      'viability': 2,
                      'Tuscany': 1,
                      'outset': 2,
                      'dying': 1,
                      'inability': 6,
                      'allegro': 1,
                      'Merchant': 2,
                      'candid': 1,
                      'Gaieties': 3,
                      '1961': 4,
                      'breadth': 2,
                      '6th': 2,
                      'hunting': 1,
                      'run': 18,
                      'absorbing': 1,
                      'web': 1,
                      'demanded': 4,
                      'entitle': 3,
                      'grandchildren': 1,
                      'songbook': 1,
                      'uncomfortable': 3,
                      'denies': 3,
                      'knack': 1,
                      'themselves': 72,
                      'stronghold': 1,
                      'logically': 2,
                      'Whig': 6,
                      'interludes': 1,
                      'Santa': 5,
                      'Plymouth': 4,
                      'actuality': 3,
                      'Good': 3,
                      'Howard': 2,
                      'S-D': 1,
                      'somnolence': 1,
                      'speaks': 4,
                      'elegant': 2,
                      'lift': 3,
                      'pulled': 5,
                      'sprinkling': 1,
                      'domestic': 6,
                      'pursues': 1,
                      'radio': 5,
                      'university-educated': 1,
                      'coat': 4,
                      'Oscar': 1,
                      'Kitchin': 1,
                      'Frederick': 5,
                      'due': 8,
                      'fiction-writing': 1,
                      'attainment': 1,
                      'demander': 1,
                      'seeming': 2,
                      'semi-catatonic': 1,
                      'business-minded': 1,
                      'problem': 40,
                      'pulse': 2,
                      'infantile': 1,
                      '1930': 2,
                      'Kingsley': 1,
                      'percentage': 4,
                      'Apostles': 1,
                      'shawls': 2,
                      'dares': 1,
                      'Rhine': 6,
                      'looking': 21,
                      'dictate': 2,
                      'Journal': 1,
                      'nuns': 1,
                      'sixth': 1,
                      'Watch': 1,
                      'clever': 2,
                      'humor': 6,
                      'baggy': 1,
                      "Agamemnon's": 1,
                      'Pendleton': 2,
                      'delegation': 5,
                      'sentimental': 2,
                      'introductions': 1,
                      'supervened': 1,
                      "Bradbury's": 2,
                      'ranked': 2,
                      'exemplar': 2,
                      'slowly': 10,
                      'transcendence': 1,
                      'H.L.': 1,
                      'bombers': 4,
                      'filth': 1,
                      'atmosphere': 10,
                      '4,585': 1,
                      '1821': 1,
                      '1859': 1,
                      'announcements': 1,
                      'Eh': 1,
                      'equal': 9,
                      'instrument': 14,
                      'Especially': 1,
                      'Lawrence': 10,
                      'Beauchamps': 1,
                      'stripes': 1,
                      'rich': 12,
                      'Wilson': 4,
                      'Gatsby': 1,
                      'spurious': 1,
                      'pride': 6,
                      'Franklin': 9,
                      'compulsion': 3,
                      'brute': 1,
                      'Reflections': 1,
                      'Makers': 1,
                      'ran': 22,
                      'wagons': 1,
                      'optimism': 3,
                      'habitually': 1,
                      'morals': 3,
                      'tiny': 6,
                      'Reformation': 2,
                      'Tomonggong': 1,
                      'inventory': 2,
                      'socio-political': 1,
                      'Kooning': 1,
                      'A.M.': 2,
                      'besmirching': 1,
                      'adapted': 2,
                      'stairway': 1,
                      'cutthroat': 1,
                      'buildings': 9,
                      'between': 146,
                      'cost': 5,
                      '1643': 1,
                      'courteously': 1,
                      'intimate': 2,
                      'premieres': 2,
                      'outmoded': 1,
                      'solar': 3,
                      "Toscanini's": 1,
                      'smacks': 1,
                      'proletariat': 2,
                      'tombstone': 1,
                      'masterpiece': 5,
                      'Atlas': 5,
                      'Regiment': 2,
                      'improve': 2,
                      'Anderson': 2,
                      'typifying': 1,
                      'Olgivanna': 7,
                      'companions': 5,
                      'higher': 12,
                      'constructively': 1,
                      'Ptolemaists': 1,
                      'foliage': 1,
                      'Stanley': 2,
                      'Sadie': 5,
                      'firmly': 12,
                      'Until': 4,
                      'Hideous': 1,
                      'complection': 1,
                      'obscenity': 1,
                      'planetoid': 1,
                      'braweling': 1,
                      'hospitable': 2,
                      'dismissed': 3,
                      'shabbily': 1,
                      'told': 57,
                      'composers': 3,
                      'discovery': 7,
                      'transcend': 1,
                      'The': 1030,
                      'oratory': 1,
                      'heat': 5,
                      'D.': 4,
                      'facile': 1,
                      'repudiating': 1,
                      'merited': 1,
                      'of': 6289,
                      'wherefores': 1,
                      "Morgan's": 5,
                      'non-interference': 1,
                      'withdrawn': 1,
                      'law': 58,
                      'bankers': 1,
                      '1920s': 1,
                      'flappers': 1,
                      "barn-burner's": 1,
                      'two-room': 1,
                      "Pandora's": 2,
                      'police': 21,
                      'Seebohm': 4,
                      'impending': 2,
                      'Proprietorships': 3,
                      'conception': 12,
                      'bold': 4,
                      'Silk': 1,
                      'Wordsworth': 2,
                      'self-destructive': 1,
                      'wars': 6,
                      'Sierra': 1,
                      'gastronomy': 1,
                      'dispersed': 1,
                      'maid': 6,
                      'facts': 19,
                      'recreation': 1,
                      'unusually': 1,
                      'eschew': 1,
                      'terror-stricken': 1,
                      'peculiar': 8,
                      'Hooghli': 1,
                      'factory': 2,
                      'entry': 2,
                      "America's": 4,
                      'carelessly': 1,
                      'outside': 25,
                      'detain': 1,
                      'arc': 1,
                      'Apology': 1,
                      'book': 58,
                      "Ward's": 1,
                      'heliotrope': 1,
                      'immoralities': 1,
                      'mentions': 3,
                      'jacket': 2,
                      'Resolves': 1,
                      'psychopomp': 1,
                      'sullen': 1,
                      'Brigade': 1,
                      'presents': 6,
                      'pump': 1,
                      'endlessly': 1,
                      'give': 56,
                      'loathing': 1,
                      'Sentiments': 1,
                      'Dashiell': 3,
                      'presaged': 1,
                      'incompatible': 1,
                      'boring': 1,
                      'cheek': 1,
                      'dishes': 1,
                      'treason': 1,
                      'mobility': 1,
                      'infatuation': 2,
                      'hindmost': 1,
                      "Auerbach's": 1,
                      "Czarina's": 3,
                      'tunnel': 1,
                      'blond': 1,
                      'Nineteenth': 4,
                      'waving': 3,
                      'diverse': 2,
                      'strenuous': 3,
                      'protestations': 2,
                      'wholeness': 2,
                      'applying': 2,
                      'critical': 12,
                      'hoodlums': 1,
                      'missing': 2,
                      'strivings': 2,
                      'explanation': 7,
                      'softened': 1,
                      'tablet': 2,
                      'fiendish': 1,
                      'yff': 1,
                      'N': 2,
                      'after': 123,
                      'conjectures': 1,
                      'questioning': 5,
                      'reached': 21,
                      'arrangements': 6,
                      'Riesman': 1,
                      'nectar': 1,
                      'response': 16,
                      'Japan': 2,
                      'bargain': 1,
                      'authority': 9,
                      'gingham': 1,
                      'Perhaps': 19,
                      'grace': 10,
                      'fitness': 2,
                      'punctuated': 2,
                      'missile': 3,
                      'strikingly': 1,
                      'big': 18,
                      'hadd': 1,
                      'Soiree': 1,
                      'centre': 2,
                      'Sidney': 5,
                      'nourished': 1,
                      'Erde': 1,
                      'attempted': 6,
                      'completely': 15,
                      'facaded': 1,
                      'cold': 12,
                      'opened': 9,
                      'left': 48,
                      'radioactive': 1,
                      'scaffoldings': 1,
                      'Willa': 1,
                      'Holy': 7,
                      'blast': 2,
                      'stood': 19,
                      'unfortunately': 2,
                      'deck': 4,
                      'lend': 2,
                      'reminding': 1,
                      'street': 14,
                      'doorknob': 1,
                      'sins': 4,
                      'utmost': 2,
                      'Burma': 7,
                      'clothier': 1,
                      'hookup': 1,
                      'patent': 2,
                      'ludicrous': 1,
                      'priest': 2,
                      '$200': 1,
                      'cares': 1,
                      'heresy': 1,
                      'town': 35,
                      'fulfilled': 6,
                      'compresses': 1,
                      'Lymington': 1,
                      'Knightes': 1,
                      'Manley': 3,
                      'mai': 1,
                      'commonplace': 4,
                      'cab': 2,
                      'colder': 2,
                      'Yankee': 2,
                      'steer': 1,
                      'cheerfulness': 1,
                      'three': 59,
                      'shortsighted': 2,
                      'declaring': 4,
                      'dessier': 1,
                      'alcoholics': 1,
                      'wander': 1,
                      'presidential': 1,
                      'wage': 1,
                      'concerted': 1,
                      'virtue': 12,
                      'pure': 19,
                      'Court-packing': 1,
                      'extended': 8,
                      'sluices': 1,
                      'patiently': 1,
                      'intrigues': 1,
                      'singularity': 1,
                      'joy': 10,
                      'beast': 1,
                      'invasions': 9,
                      'continuity': 10,
                      'sundry': 2,
                      'rehearsed': 1,
                      'complaisance': 1,
                      'A.': 9,
                      'self-analysis': 1,
                      'treasured': 1,
                      'grows': 2,
                      'Merit': 1,
                      'Smithsonian': 3,
                      'receives': 5,
                      'be': 843,
                      'wealth': 8,
                      'names': 11,
                      'informational': 2,
                      'tarnished': 1,
                      'Crawford': 1,
                      'culture': 25,
                      'witches': 2,
                      'abundance': 4,
                      'forget': 13,
                      'heap': 1,
                      'conclusion': 10,
                      'grocers': 1,
                      'reasons': 16,
                      'uncompromising': 3,
                      'invent': 4,
                      'minutes': 11,
                      'remote': 4,
                      'touching': 4,
                      'Head': 1,
                      'joined': 10,
                      "He's": 6,
                      'bullshit': 1,
                      'docile': 1,
                      'converge': 3,
                      "Lilly's": 3,
                      'farfetched': 2,
                      'sharpened': 1,
                      'afternoon': 13,
                      'Presupposed': 1,
                      'Fascism': 2,
                      'barnsful': 1,
                      'entirety': 1,
                      'commercial': 3,
                      'exceptional': 3,
                      'central': 18,
                      'questioners': 1,
                      'soloist': 2,
                      'fail': 2,
                      'Stratford': 6,
                      'burial': 2,
                      'suitably': 1,
                      'force': 36,
                      'employ': 3,
                      'replaced': 5,
                      'youth': 15,
                      'prevented': 3,
                      'obligations': 1,
                      'dinners': 1,
                      'brushwork': 1,
                      "child's": 4,
                      'ate': 2,
                      'alone': 19,
                      'Cumberland': 1,
                      'beneath': 12,
                      'compels': 1,
                      'schoolmaster': 3,
                      'backbone': 1,
                      'into': 246,
                      'principally': 1,
                      'solstice': 1,
                      'monument': 5,
                      'edited': 2,
                      'deviation': 1,
                      'Abraham': 1,
                      'address': 9,
                      'skipped': 1,
                      'let': 53,
                      'lewdly': 1,
                      'sprung': 1,
                      "Paula's": 2,
                      'lodgment': 1,
                      'postman': 1,
                      'inhumane': 1,
                      'necessity': 8,
                      'brother': 23,
                      'unworthy': 2,
                      'managed': 5,
                      'sterile': 1,
                      'grapple': 1,
                      'render': 2,
                      'method': 22,
                      'murder': 9,
                      'shakes': 1,
                      'Travelers': 1,
                      'grown-up': 2,
                      'sweeping': 3,
                      'Lillian': 1,
                      '2': 25,
                      'emigrating': 1,
                      'serene': 3,
                      "Dick's": 1,
                      'blinked': 1,
                      'dispatched': 1,
                      'artist-author': 1,
                      'elaborated': 2,
                      'patter': 1,
                      'bicycles': 1,
                      'short-range': 1,
                      'style': 13,
                      'switch': 3,
                      '1639': 1,
                      'Amra': 1,
                      'Gordon': 1,
                      'electric': 1,
                      'reunion': 1,
                      'jammed': 1,
                      'ruffles': 1,
                      'honeymoon': 1,
                      'hypothesis': 4,
                      'absurd': 6,
                      "I'm": 9,
                      'corners': 3,
                      'convalescence': 1,
                      'volunteered': 1,
                      'engages': 1,
                      'Englishy': 1,
                      'Otherwise': 2,
                      'Date': 1,
                      'pastime': 2,
                      'Containment': 1,
                      'focused': 1,
                      'Land': 6,
                      'overly': 1,
                      'tawny': 2,
                      "Williamson's": 1,
                      'preoccupied': 4,
                      'naked': 7,
                      'commands': 10,
                      'clothes': 3,
                      'Constitution': 30,
                      'qualities': 16,
                      'converse': 1,
                      'gruff': 1,
                      'incestuous': 3,
                      'released': 2,
                      'bench': 1,
                      'aggravated': 1,
                      'alleged': 4,
                      'tortures': 2,
                      'lodging': 1,
                      'chronology': 2,
                      'attracted': 6,
                      'rechartering': 1,
                      'hide': 5,
                      'feudal': 5,
                      'ingeniously': 1,
                      'asinine': 1,
                      'dynasties': 1,
                      'malnourished': 1,
                      'obsolete': 2,
                      'invasion': 4,
                      'picket': 2,
                      'parked': 2,
                      'from': 649,
                      'atypical': 1,
                      'B.C.': 1,
                      "four-o'clock": 1,
                      'Hot': 2,
                      'ages': 1,
                      'libraries': 3,
                      'frame': 11,
                      'happenstance': 1,
                      'atheists': 1,
                      'reprinted': 1,
                      'frail': 1,
                      'modulation': 2,
                      'Mother': 2,
                      'peonies': 1,
                      'unreconstructed': 5,
                      'environing': 1,
                      'possessor': 1,
                      "han'": 4,
                      'Communion': 1,
                      'adjunct': 1,
                      'studied': 12,
                      'Moloch': 1,
                      'recognizable': 3,
                      'dystopias': 8,
                      'sticks': 2,
                      'East': 15,
                      'retainers': 1,
                      'orphans': 1,
                      'frequently': 18,
                      'minutiae': 1,
                      'During': 21,
                      'Danchin': 1,
                      'brittle': 1,
                      'scientifically-trained': 1,
                      'sells': 1,
                      'Obviously': 5,
                      'apocryphal': 1,
                      'larkspur': 1,
                      'estates': 1,
                      'unpacking': 1,
                      'destruction': 8,
                      'Prime': 1,
                      "Washington's": 2,
                      'hub': 1,
                      'city': 36,
                      'attributes': 2,
                      'sex': 32,
                      'wall': 5,
                      'active': 9,
                      'hosts': 1,
                      'keeping': 5,
                      'Bruno': 1,
                      'imbedded': 1,
                      'easier': 4,
                      'consciousness': 10,
                      'plans': 9,
                      'accordance': 3,
                      'Shotwell': 1,
                      'hearsay': 1,
                      "Hammarskjold's": 1,
                      'Sam': 7,
                      'invite': 1,
                      'heart': 22,
                      'primary': 17,
                      'national': 37,
                      'Simon': 1,
                      'enables': 1,
                      'Brownlow': 2,
                      'amenable': 1,
                      'father-brother': 1,
                      'Photographer': 1,
                      'Israelite': 1,
                      'authoritarian': 2,
                      'rage': 5,
                      'conscientious': 1,
                      "lady's": 1,
                      'called': 62,
                      'opinions': 13,
                      'slanderer': 3,
                      '1865': 2,
                      'blues': 4,
                      'fictional': 10,
                      'reverse': 2,
                      'victory': 11,
                      'divinity': 1,
                      'adjusted': 2,
                      'Straits': 1,
                      'merry': 3,
                      'Too': 1,
                      'sighted': 1,
                      'slumped': 1,
                      'G.': 4,
                      'shopping': 2,
                      'deals': 4,
                      'childlike': 3,
                      'Esther': 8,
                      'ripening': 1,
                      'Equator': 1,
                      'organized': 18,
                      'irksome': 1,
                      'solution': 6,
                      'born': 24,
                      'propel': 1,
                      'Shirt': 1,
                      'chance': 20,
                      'spirits': 7,
                      "Alfred's": 3,
                      'compel': 1,
                      'Southern': 68,
                      'fellow': 13,
                      'Market': 3,
                      'pigpens': 1,
                      'patrolling': 1,
                      'Amadee': 4,
                      'commonly': 2,
                      'Bean': 1,
                      'headquarters': 10,
                      'utopia': 3,
                      'mars': 1,
                      'Dwight': 1,
                      'basket': 2,
                      'suffer': 12,
                      'generates': 2,
                      'complained': 6,
                      'arcaded': 1,
                      '19': 3,
                      'output': 2,
                      'relationships': 10,
                      '1632': 4,
                      'sixty-five': 1,
                      'face-lifting': 1,
                      'shattering': 1,
                      'college': 27,
                      'envisioned': 1,
                      'unremitting': 1,
                      'public-school': 1,
                      'Frederik': 1,
                      'represent': 3,
                      'distinct': 8,
                      'intentions': 2,
                      'noble': 3,
                      'Nor': 4,
                      'sharers': 1,
                      'Unlike': 3,
                      'dinosaur': 1,
                      '$20,000,000': 1,
                      'hated': 3,
                      'therewith': 1,
                      'imposed': 1,
                      'campus': 5,
                      'contemptuously': 1,
                      'contrived': 1,
                      'obituaries': 1,
                      'self-employed': 1,
                      'Sut': 1,
                      'five': 27,
                      'repulsed': 1,
                      'snapped': 2,
                      'overtake': 1,
                      'common': 49,
                      'poems': 29,
                      'less': 67,
                      'outgoing': 2,
                      'scraping': 1,
                      'thrusting': 1,
                      'coined': 2,
                      'congruent': 2,
                      'participate': 5,
                      'Hearst': 48,
                      "Banks's": 1,
                      'deacon': 1,
                      'antiquarian': 1,
                      'temperament': 1,
                      'Pomham': 2,
                      'smaller': 5,
                      'ceremonies': 2,
                      'Rousseauan': 1,
                      'volens': 1,
                      'differentiate': 1,
                      'shall': 34,
                      'Greek': 23,
                      'larks': 1,
                      'innocence': 19,
                      'magnificent': 4,
                      'cynically': 1,
                      'unconsciously': 1,
                      'decreed': 1,
                      'Report': 1,
                      'detectives': 1,
                      'thinks': 4,
                      'shells': 1,
                      '$5000': 4,
                      "We'd": 1,
                      'primitive': 14,
                      'exclamations': 2,
                      'intrusive': 1,
                      'lange': 1,
                      'Crittenden': 3,
                      'eclectic': 1,
                      'father-murder': 1,
                      'topped': 2,
                      'confiscating': 1,
                      'claims': 13,
                      'executives': 1,
                      'Van': 12,
                      'scion': 1,
                      'seventeenth': 5,
                      'conductor': 7,
                      'grape-arbor': 3,
                      'wallet': 1,
                      'symbolize': 1,
                      'Cooper': 1,
                      'restitution': 1,
                      '2,000': 2,
                      'Start': 1,
                      'swim': 3,
                      'radius': 1,
                      'U.': 1,
                      'Kurd': 1,
                      'Brother': 1,
                      ...}),
             'editorial': Counter({'Without': 5,
                      'vacuum': 1,
                      'lighted': 1,
                      'contented': 1,
                      'RCA': 1,
                      'Dupont': 1,
                      'taxing': 1,
                      'More': 6,
                      '23': 5,
                      'festival': 1,
                      'speak': 4,
                      'located': 2,
                      'education': 6,
                      'house': 8,
                      'knobby-knuckled': 1,
                      'promises': 4,
                      'skilled': 1,
                      'muscles': 1,
                      'dispatch': 3,
                      'darling': 1,
                      'Wilmette': 1,
                      'epoch-making': 1,
                      'building': 4,
                      'Adams': 2,
                      'infernally': 1,
                      'Spirits': 2,
                      'operating': 2,
                      '$133': 1,
                      'use': 30,
                      'enticing': 1,
                      'Lower': 1,
                      'killed': 5,
                      'handicrafts': 1,
                      'experience': 8,
                      'work-weary': 1,
                      'served': 6,
                      'committee': 10,
                      'bars': 4,
                      'sports': 2,
                      'population': 8,
                      'Escalation': 2,
                      'tempted': 2,
                      'migrating': 1,
                      'thoroughly': 2,
                      'February': 1,
                      'surrendering': 2,
                      'Much': 2,
                      "lyin'": 1,
                      'forum': 2,
                      'enamel': 1,
                      "Mayor's": 2,
                      'certain': 13,
                      'season': 6,
                      'zeal': 2,
                      'demanded': 5,
                      'shade': 2,
                      'others': 19,
                      "Luther's": 1,
                      'budgets': 2,
                      'button-down': 1,
                      'transition': 3,
                      'Laboratory': 1,
                      'Persianesque': 1,
                      'catch': 1,
                      'absurdity': 1,
                      'chancellor': 2,
                      'beautiful': 8,
                      'intimidation': 1,
                      'shorter': 2,
                      'Loon': 1,
                      'Mar.': 2,
                      'Jemela': 1,
                      'dictator': 1,
                      'denying': 1,
                      "Democrats'": 1,
                      'second-class': 1,
                      'retains': 5,
                      'taverns': 1,
                      'grotesquely': 2,
                      'imposition': 1,
                      'tyrant': 1,
                      'Enough': 2,
                      'Stevens': 3,
                      'constituents': 1,
                      'chicken': 1,
                      'determine': 3,
                      'amend': 1,
                      '57': 1,
                      'correct': 3,
                      'toughest': 1,
                      'glance': 1,
                      'kitchen': 1,
                      'amount': 11,
                      'glances': 1,
                      'pressure': 14,
                      'Short': 1,
                      'scientists': 5,
                      'besetting': 1,
                      'adviser': 2,
                      "Times'": 1,
                      "wouldn't": 4,
                      'N.': 2,
                      'scratching': 1,
                      'through': 23,
                      'Volumes': 1,
                      'Just': 7,
                      'specific': 3,
                      'battle-cry': 1,
                      'each': 30,
                      'Because': 2,
                      "Board's": 1,
                      "Lucas's": 1,
                      'Justice': 4,
                      'tons': 4,
                      'France': 7,
                      'ceased': 1,
                      'Democrats': 12,
                      'appropriating': 1,
                      'cottage': 1,
                      'shy': 1,
                      'raised': 9,
                      'stray': 1,
                      'stream': 1,
                      'revealed': 3,
                      'grave': 4,
                      'Fortress': 1,
                      'sprung': 2,
                      'believing': 1,
                      'oversimplified': 1,
                      'fabulous': 1,
                      'eternal': 1,
                      'Kokoschka': 1,
                      'situations': 1,
                      'tacitly': 1,
                      'blackout': 2,
                      'At': 16,
                      ')': 95,
                      'seas': 3,
                      'economizing': 1,
                      'colleagues': 1,
                      'staked': 1,
                      'Those': 1,
                      'surprisingly': 1,
                      'enlist': 1,
                      'Nelson': 1,
                      'Murat': 1,
                      'operational': 1,
                      'unforseen': 1,
                      'career': 1,
                      'idly': 1,
                      'husband': 5,
                      'friends': 12,
                      'Prime': 1,
                      'shouting': 2,
                      'sands': 1,
                      'slip': 1,
                      'negative': 4,
                      'thoroughfare': 1,
                      'small': 27,
                      'disreputable': 1,
                      'jam': 2,
                      'enemy': 5,
                      'live-oak': 1,
                      'competition': 4,
                      'recruits': 1,
                      'sect': 1,
                      'Q3': 1,
                      'recession': 3,
                      'missiles': 1,
                      'recently': 11,
                      'enforced': 1,
                      'unbalanced': 2,
                      'steamship': 1,
                      'legislation': 4,
                      'next': 25,
                      'accelerating': 1,
                      'strike': 4,
                      'dead-end': 1,
                      'shall': 19,
                      'searching': 3,
                      'really': 19,
                      'reputation': 3,
                      'broad': 3,
                      'external': 2,
                      "McCone's": 1,
                      'Please': 1,
                      'comforting': 1,
                      'rushed': 2,
                      'Over': 2,
                      'signals': 1,
                      'effect': 13,
                      'whites': 5,
                      'sufficiency': 1,
                      'hunk': 1,
                      'cell': 3,
                      'King': 10,
                      'Nashville': 4,
                      'Nkrumah': 2,
                      'spending': 4,
                      'Remembering': 1,
                      'outsmarted': 1,
                      'evening': 6,
                      'prisoners': 2,
                      'probation': 1,
                      'survivors': 1,
                      'twelve': 2,
                      'Joseph': 1,
                      'hymn': 1,
                      'Dunes': 1,
                      'tip': 1,
                      'relax': 1,
                      'zoned': 1,
                      'wives': 1,
                      'Possibly': 1,
                      'intends': 2,
                      'ask': 7,
                      'assassination': 1,
                      'found': 18,
                      'Truman': 3,
                      'win': 8,
                      '$1,450,000,000': 1,
                      'Cooch': 2,
                      'domes': 1,
                      'wonders': 2,
                      'mother': 16,
                      'Caribbean': 2,
                      'Chiang': 2,
                      'things': 20,
                      'Now': 13,
                      'thank': 2,
                      'oratorical': 1,
                      'department': 6,
                      'independent': 5,
                      'divided': 2,
                      'suppose': 4,
                      'feed': 6,
                      'vertical': 1,
                      '18': 6,
                      'drawn': 4,
                      'stops': 1,
                      'bat': 1,
                      'Lebanese': 2,
                      'advisers': 3,
                      'half-million': 1,
                      'one-third': 1,
                      'hatred': 1,
                      'hooliganism': 1,
                      'German': 17,
                      'Molly': 1,
                      'compulsive': 1,
                      'Engaged': 1,
                      'signal': 1,
                      '1959': 3,
                      'Pauling': 1,
                      'musicians': 1,
                      'yielded': 1,
                      'tenacious': 1,
                      'enjoyable': 1,
                      'fabric': 1,
                      'painstakingly': 1,
                      "K's": 1,
                      'flights': 1,
                      'Hence': 2,
                      'outset': 2,
                      'dying': 3,
                      'cycles': 1,
                      'required': 5,
                      'front': 2,
                      'conceivable': 3,
                      '1961': 8,
                      'strip': 1,
                      'curiously': 1,
                      'Revised': 2,
                      'God': 11,
                      'run': 11,
                      'discussed': 1,
                      'stores': 4,
                      'I.': 3,
                      'careful': 4,
                      'restudy': 1,
                      'lilting': 1,
                      'barest': 1,
                      'feather': 4,
                      'dimension': 1,
                      'uncomfortable': 1,
                      'themselves': 22,
                      'developing': 2,
                      'academic': 1,
                      'dome': 1,
                      'sunshine': 1,
                      'loads': 1,
                      'manmade': 1,
                      '$8.5': 1,
                      'replacing': 2,
                      'Church': 15,
                      'Dec.': 1,
                      "Young's": 1,
                      'execution': 1,
                      'Howard': 1,
                      'reader': 2,
                      'clean': 3,
                      'sophomore': 1,
                      'pulled': 1,
                      'continuous': 2,
                      'cautioned': 1,
                      'framed': 1,
                      'mercy': 1,
                      'sentences': 2,
                      'methods': 4,
                      'visited': 4,
                      'disposed': 1,
                      'Also': 4,
                      'radio': 8,
                      'defensive': 1,
                      'sadder': 1,
                      'northeast': 1,
                      'Elmer': 1,
                      "they're": 3,
                      'Oscar': 1,
                      'Japanese': 2,
                      'exchange': 3,
                      'Frederick': 3,
                      'due': 12,
                      'seeming': 1,
                      'unabashed': 1,
                      'furniture': 1,
                      'sixty': 1,
                      'manpower': 1,
                      'broad-brimmed': 1,
                      'problem': 18,
                      'curtail': 1,
                      'modification': 1,
                      'winter': 3,
                      'distance': 3,
                      'walking': 2,
                      'percentage': 2,
                      'receive': 9,
                      'specialize': 1,
                      'proffered': 1,
                      'enlightened': 1,
                      'looking': 5,
                      'Journal': 3,
                      'British': 18,
                      'trading': 1,
                      'expressway': 2,
                      'Time': 2,
                      'periods': 1,
                      'spenders': 1,
                      'heyday': 1,
                      'signposts': 1,
                      'lavishing': 1,
                      'collected': 2,
                      'Big': 1,
                      'descending': 2,
                      'pointed': 2,
                      'tentacle': 1,
                      'Usually': 3,
                      'boxed': 1,
                      'believes': 3,
                      'determination': 3,
                      '21-22': 1,
                      'evacuation': 1,
                      'slowly': 2,
                      'place': 16,
                      'piece': 9,
                      'treelike': 1,
                      'committees': 2,
                      'payroll': 1,
                      'underestimate': 1,
                      'first': 66,
                      'atmosphere': 12,
                      'buildup': 1,
                      'union': 7,
                      'melting': 2,
                      "Batista's": 2,
                      '1821': 1,
                      'land': 18,
                      'announcements': 1,
                      '110': 1,
                      'unfairly': 1,
                      'could': 56,
                      'Especially': 1,
                      'enjoying': 2,
                      'Lawrence': 2,
                      'conform': 2,
                      'audience': 1,
                      'rich': 4,
                      'stopped': 4,
                      'Wilson': 3,
                      'related': 4,
                      'Mayor-elect': 1,
                      'appreciation': 3,
                      'developed': 3,
                      'pride': 2,
                      'amalgamated': 1,
                      'Franklin': 3,
                      'margins': 1,
                      'uni-directional': 1,
                      'Juan': 1,
                      'Ind.': 1,
                      'search': 1,
                      'comprising': 1,
                      'ran': 3,
                      'optimism': 1,
                      'meant': 4,
                      'reek': 1,
                      'intent': 2,
                      'tiny': 2,
                      'M-1': 1,
                      'accountability': 3,
                      'picture': 7,
                      'Yorkers': 1,
                      'curious': 1,
                      'unemployment': 8,
                      'adapted': 1,
                      'Interpretation': 2,
                      'buildings': 1,
                      'between': 36,
                      'cost': 10,
                      '1953': 1,
                      'premieres': 1,
                      'beam': 1,
                      'Inter-American': 3,
                      'outmoded': 1,
                      'proletariat': 1,
                      'masterpiece': 1,
                      'recognize': 8,
                      'Montgomery': 1,
                      "En-lai's": 1,
                      'marked': 2,
                      'wagon': 3,
                      'higher': 4,
                      "Lincoln's": 2,
                      'inflicting': 1,
                      'Security': 2,
                      'Traffic': 1,
                      'vopos': 1,
                      'Stanley': 2,
                      'nod': 1,
                      'Heights': 1,
                      'firmly': 3,
                      'Until': 4,
                      'backed': 3,
                      'banner': 2,
                      'handicapped': 7,
                      'unemployed': 3,
                      'endowments': 1,
                      'dismissed': 1,
                      'told': 9,
                      'greatness': 1,
                      'civic': 5,
                      'Transportation': 1,
                      'waits': 1,
                      'The': 453,
                      'promise': 3,
                      'D.': 6,
                      'Hodges': 3,
                      'riders': 2,
                      'of': 1976,
                      'Pets': 1,
                      'AEC': 2,
                      'celebrated': 1,
                      'law': 23,
                      'tackle': 1,
                      'Moon-faced': 1,
                      'Reply': 5,
                      'police': 4,
                      'Snodgrass': 1,
                      "sanctuary's": 1,
                      'impending': 1,
                      'transact': 1,
                      'awkwardly': 1,
                      'corner': 3,
                      'Trujillos': 2,
                      'bold': 1,
                      'contribution': 4,
                      'protest': 3,
                      'embroidery': 4,
                      'resigned': 1,
                      'lo': 1,
                      "ladies'": 1,
                      'numb': 1,
                      'Sp': 2,
                      'facts': 9,
                      'recreation': 2,
                      'Confidence': 1,
                      'Hazlitt': 1,
                      'disastrous': 5,
                      '285': 1,
                      'peculiar': 2,
                      'display': 2,
                      'well': 32,
                      'scale': 1,
                      'funny': 1,
                      'datelined': 4,
                      'factory': 4,
                      'entry': 2,
                      "America's": 4,
                      'outside': 10,
                      'zones': 1,
                      'hung': 1,
                      'relation': 3,
                      'book': 19,
                      "Rogues'": 1,
                      'suggested': 6,
                      'harshly': 1,
                      'employees': 2,
                      'whipping-boys': 1,
                      'jacket': 2,
                      'bound': 2,
                      'conventions': 3,
                      'intend': 1,
                      'presents': 2,
                      'endlessly': 1,
                      'purpose': 10,
                      'broadened': 1,
                      'give': 20,
                      'outgrow': 1,
                      'contorted': 1,
                      'absolute': 2,
                      'membership': 6,
                      'Richards': 7,
                      'housing': 2,
                      'beauty': 4,
                      'gamut': 1,
                      'Calvin': 1,
                      'glove': 1,
                      'dealers': 2,
                      'Richard': 5,
                      'sanction': 1,
                      'allow': 2,
                      'critical': 1,
                      'library': 1,
                      'bifocals': 1,
                      'boost': 3,
                      'missing': 3,
                      'actually': 9,
                      'Several': 5,
                      'explanation': 2,
                      'replied': 1,
                      'after': 26,
                      'questioning': 1,
                      'reached': 9,
                      'M-4': 1,
                      'earthly': 2,
                      'anyway': 1,
                      'response': 3,
                      'authority': 10,
                      'Perhaps': 7,
                      'grace': 3,
                      'confused': 4,
                      'Audubon': 1,
                      'commercials': 2,
                      'football': 3,
                      'gallant': 1,
                      'strikingly': 1,
                      'big': 17,
                      'Sidney': 1,
                      'grenades': 1,
                      'riddled': 1,
                      'fills': 1,
                      '13-16': 1,
                      'Cod': 2,
                      'attempted': 1,
                      'completely': 5,
                      'alike': 3,
                      'cold': 11,
                      'shantung-like': 1,
                      'opened': 9,
                      'augmented': 2,
                      'Elliott': 1,
                      'rookies': 1,
                      'radioactive': 3,
                      'Broxodent': 2,
                      'nurses': 3,
                      'Crossman': 2,
                      'fate': 2,
                      'Holy': 2,
                      'woven': 1,
                      'blast': 2,
                      'stood': 1,
                      'unfortunately': 1,
                      'drinking': 1,
                      'Korea': 6,
                      'street': 5,
                      "CDC's": 2,
                      'bow': 1,
                      'doorknob': 1,
                      'long': 36,
                      'sins': 3,
                      'Immigration': 2,
                      'tooth': 1,
                      'current': 6,
                      'locally': 1,
                      'draftee': 1,
                      'bonus': 1,
                      'Enlargement': 1,
                      '$200': 1,
                      'worst-marked': 1,
                      'wears': 2,
                      'accuse': 1,
                      'town': 9,
                      'dropouts': 2,
                      'now': 76,
                      'Starlings': 1,
                      'dramatic': 4,
                      'taunts': 1,
                      "emperor's": 1,
                      'commonplace': 1,
                      'upsets': 1,
                      'reads': 4,
                      'suddenly': 4,
                      'De': 2,
                      'gets': 3,
                      'turtle': 5,
                      'Kerr': 1,
                      "we'll": 1,
                      'lay': 4,
                      'loss': 8,
                      'bigotry': 1,
                      'three': 15,
                      'declaring': 2,
                      'Ahmad': 1,
                      'automatic': 3,
                      'cage': 1,
                      'demand': 10,
                      'stock': 4,
                      'Inauguration': 1,
                      'Spring': 1,
                      "Griffin's": 1,
                      'downed': 1,
                      'lawmakers': 1,
                      'presidential': 1,
                      'wage': 2,
                      'clearer': 6,
                      'industrial': 18,
                      'perform': 2,
                      'organize': 2,
                      'extended': 2,
                      'pursuit': 1,
                      "Louis's": 1,
                      'patiently': 1,
                      'blossom': 1,
                      'stodgy': 1,
                      'motives': 1,
                      "community's": 1,
                      'forceful': 2,
                      'car': 12,
                      'A.': 10,
                      'crickets': 1,
                      "region's": 2,
                      'salad': 1,
                      'present': 25,
                      'describes': 1,
                      'comest': 1,
                      'receives': 1,
                      'be': 421,
                      'wealth': 2,
                      'alternative': 1,
                      'racing': 1,
                      'names': 4,
                      'ready': 16,
                      'exhaustion': 1,
                      'tarnished': 1,
                      'public-spirited': 1,
                      'Ten': 1,
                      'shares': 2,
                      'cons': 1,
                      'Publications': 1,
                      'engaged': 6,
                      'conclusion': 2,
                      'expected': 13,
                      'uncompromising': 1,
                      'auto': 2,
                      'minutes': 3,
                      'Montevideo': 1,
                      'Treasury': 2,
                      'remote': 1,
                      'joined': 1,
                      'Relative': 1,
                      "He's": 3,
                      'ye': 2,
                      'Organization': 8,
                      'load': 1,
                      'Breeding': 2,
                      'Word': 1,
                      'enforcement': 4,
                      'prisoner': 1,
                      'status': 4,
                      'commercial': 3,
                      'exceptional': 1,
                      'central': 5,
                      'fall-outs': 1,
                      'restaurants': 1,
                      'Vacancy': 1,
                      'fail': 3,
                      'bully': 1,
                      'hazard': 3,
                      'Glenn': 1,
                      'dignity': 1,
                      'force': 15,
                      'employ': 1,
                      'Mekong': 1,
                      'youth': 3,
                      'reassert': 1,
                      'abolition': 1,
                      'Transit': 1,
                      'sizes': 1,
                      'obligations': 5,
                      'first-class': 1,
                      'plates': 2,
                      'blackbirds': 1,
                      "child's": 1,
                      'Antoine': 1,
                      'lighting': 1,
                      'alone': 3,
                      'statutes': 2,
                      'cent': 19,
                      'Probable': 1,
                      'beneath': 1,
                      'parole': 1,
                      "You'd": 1,
                      'breathe': 1,
                      'declaration': 1,
                      'counseling': 1,
                      'into': 78,
                      'principally': 1,
                      'Initial': 1,
                      'confirms': 1,
                      'affiliated': 1,
                      'intriguingly': 1,
                      'edited': 2,
                      'deviation': 1,
                      'haul': 1,
                      'address': 2,
                      'keel': 1,
                      'else': 6,
                      'Robby': 1,
                      'justify': 6,
                      'let': 15,
                      'forecasts': 1,
                      'libertarians': 1,
                      'genuinely': 1,
                      'becoming': 1,
                      'May': 5,
                      'promising': 2,
                      'necessity': 4,
                      'team': 5,
                      'unworthy': 1,
                      'managed': 2,
                      'Hughes': 1,
                      'ruthlessly': 1,
                      'method': 4,
                      'murder': 2,
                      'election': 11,
                      'forgiven': 4,
                      'conceptions': 1,
                      'landscapes': 1,
                      '$1,750,000': 1,
                      '2': 19,
                      'decides': 1,
                      'hold': 6,
                      '400': 1,
                      'violent': 1,
                      'expert': 2,
                      'length': 4,
                      'directors': 1,
                      'revisions': 1,
                      "Russia's": 3,
                      'pupils': 3,
                      'rockets': 4,
                      'canning': 1,
                      'straining': 1,
                      'unmoved': 1,
                      "workers'": 1,
                      'resilience': 1,
                      'bruise': 1,
                      'Legion': 4,
                      'administrator': 1,
                      'wisely': 1,
                      'Communications': 2,
                      'reunion': 4,
                      'shots': 6,
                      'moneys': 1,
                      'jammed': 1,
                      'subjugate': 1,
                      'Atty.': 2,
                      'reservations': 2,
                      'totalitarianism': 2,
                      "I'm": 2,
                      'forgive': 9,
                      'attain': 1,
                      'Single-color': 1,
                      'Parkway': 3,
                      'around': 20,
                      'understands': 1,
                      'microfilm': 1,
                      'Curtain': 1,
                      'textures': 3,
                      'Continental': 2,
                      'Otherwise': 1,
                      'S.': 3,
                      'week-ends': 2,
                      '22,000': 1,
                      'Turkish': 3,
                      'growers': 1,
                      'go-it-alone': 1,
                      'Land': 1,
                      'shockingly': 1,
                      'naked': 1,
                      'standards': 4,
                      'qualities': 1,
                      'Authority': 4,
                      'underpaid': 1,
                      'secondary': 1,
                      'released': 1,
                      'ends': 2,
                      'possibilities': 1,
                      'aggravated': 1,
                      'alleged': 1,
                      'armaments': 3,
                      'Kirk': 1,
                      'Guevara': 1,
                      'Objects': 1,
                      'attracted': 2,
                      'controlled': 2,
                      'hide': 1,
                      'asinine': 1,
                      'Jagan': 2,
                      'interposed': 1,
                      'invasion': 2,
                      'substance': 1,
                      'from': 214,
                      'bogey-symbol': 1,
                      'ages': 3,
                      'warning': 1,
                      'appearance': 1,
                      'Karshilama': 1,
                      'ball': 4,
                      'migrates': 1,
                      'demonstrates': 1,
                      'grants': 1,
                      'donates': 1,
                      '72': 1,
                      'reality': 6,
                      'Taken': 1,
                      'continent': 3,
                      'jealousy': 1,
                      'instruction': 3,
                      'studied': 4,
                      'twenty-five': 2,
                      'Moloch': 2,
                      'appreciate': 1,
                      'fields': 2,
                      'years': 63,
                      'sticks': 1,
                      'East': 52,
                      'probably': 11,
                      'Representative': 1,
                      'Mo.': 1,
                      'frequently': 4,
                      "Sunday's": 1,
                      'During': 6,
                      'cans': 3,
                      'dusty-green': 1,
                      'Cold': 2,
                      'wry': 2,
                      'Obviously': 1,
                      'unsurpassed': 1,
                      'Paint': 1,
                      'graduates': 2,
                      'McCloy': 2,
                      '2-year-old': 3,
                      'or': 194,
                      'Law': 1,
                      "Don't": 3,
                      'states': 12,
                      'fifteen': 2,
                      'destruction': 3,
                      'Skill': 1,
                      "Washington's": 2,
                      'city': 40,
                      'flaming': 1,
                      'sex': 2,
                      'middle-age': 1,
                      'stomach': 1,
                      'folk': 1,
                      'chestnuts': 1,
                      "Congo's": 1,
                      'paramilitary': 1,
                      'lesser': 1,
                      'three-man': 1,
                      'backing': 1,
                      'five-hundred-year-old': 1,
                      'easier': 3,
                      'consciousness': 1,
                      'plans': 9,
                      'accordance': 1,
                      'uplift': 1,
                      'allowing': 3,
                      'general': 11,
                      'sprawled': 1,
                      "Hammarskjold's": 4,
                      'Gore': 1,
                      'presentations': 1,
                      'Sam': 8,
                      'invite': 1,
                      'diplomat': 1,
                      'heart': 12,
                      'primary': 4,
                      'bleak': 2,
                      'national': 25,
                      'tomorrow': 1,
                      'wishful': 2,
                      'top-drawer': 1,
                      'half-city': 1,
                      'expanding': 1,
                      'grasp': 2,
                      'conscientious': 3,
                      'collections': 1,
                      'called': 19,
                      'opinions': 3,
                      'financed': 1,
                      'Felix': 2,
                      'eye-to-eye': 1,
                      'pervades': 1,
                      'year': 52,
                      'Carbondale': 2,
                      'Moluccas': 1,
                      'victory': 3,
                      'Inquirer': 13,
                      'Too': 4,
                      'courage': 4,
                      'sighted': 1,
                      'painted': 4,
                      'agricultural': 5,
                      'Radio': 1,
                      'owes': 1,
                      'G.': 2,
                      'shopping': 2,
                      'year-earlier': 1,
                      'encampment': 1,
                      'organized': 2,
                      'wonder': 4,
                      'ascend': 1,
                      'objective': 7,
                      'solution': 7,
                      'squirms': 1,
                      'directly': 5,
                      'born': 5,
                      'propel': 2,
                      'sharply': 2,
                      'preference': 1,
                      'chance': 7,
                      'kinda': 1,
                      'watchdog': 1,
                      'Methodist': 3,
                      'situation': 15,
                      'Market': 5,
                      'mission': 1,
                      'optimistic': 2,
                      'not': 301,
                      'avail': 1,
                      'hard-liquor': 2,
                      'falsehood': 1,
                      'Morale': 1,
                      'taught': 5,
                      'headquarters': 2,
                      'atone': 1,
                      'Lili': 1,
                      'Dwight': 2,
                      'basket': 1,
                      'isolating': 1,
                      'M': 1,
                      'minorities': 1,
                      'suggesting': 1,
                      '19': 4,
                      'growth': 11,
                      'respective': 5,
                      'ethical': 6,
                      'suspensions': 1,
                      'pray': 3,
                      'college': 16,
                      'inborn': 1,
                      '15th': 2,
                      'placed': 7,
                      'distinct': 2,
                      'intentions': 3,
                      'Marcos': 1,
                      'noble': 3,
                      'illustrated': 2,
                      'retraction': 1,
                      'Nor': 3,
                      'Bell': 1,
                      'sliding': 1,
                      'peace-treaty': 1,
                      'Streets': 1,
                      'Bust': 1,
                      'arrangement': 2,
                      'campus': 2,
                      ...}),
             'fiction': Counter({'Without': 3,
                      'vacuum': 1,
                      'lighted': 4,
                      'laden': 1,
                      'dispatching': 1,
                      'More': 2,
                      'Anthony': 3,
                      'anyhow': 2,
                      'speak': 11,
                      'economics': 1,
                      'infectious': 1,
                      'education': 1,
                      'house': 54,
                      'skilled': 1,
                      "She'd": 2,
                      'building': 7,
                      'Apparel': 1,
                      'waters': 2,
                      'Between': 2,
                      'Ballestre': 2,
                      'junction': 1,
                      'restrain': 1,
                      'highboard': 1,
                      'use': 6,
                      'crags': 1,
                      'killed': 3,
                      'Ludie': 13,
                      'experience': 8,
                      'served': 5,
                      'jubilantly': 1,
                      'sports': 1,
                      'Plunking': 1,
                      'colts': 1,
                      'tempted': 1,
                      'Sex': 1,
                      'nods': 1,
                      'thoroughly': 1,
                      'February': 1,
                      'author': 1,
                      'pleading': 1,
                      'Bishopsgate': 2,
                      'plush': 1,
                      'abrupt': 1,
                      'certain': 8,
                      'wages': 3,
                      'heaved': 1,
                      'Tomorrow': 2,
                      'deal': 6,
                      'demanded': 7,
                      'shade': 1,
                      'others': 16,
                      'Wait': 2,
                      'half-understood': 1,
                      'mercy': 1,
                      'taut': 1,
                      'catch': 2,
                      'nauseated': 1,
                      'beautiful': 5,
                      'amazing': 3,
                      'shorter': 1,
                      'marriages': 1,
                      'truthfulness': 1,
                      'indefinable': 1,
                      'Mystery': 1,
                      'cheer': 1,
                      'cha-chas': 1,
                      'believes': 1,
                      'chicken': 3,
                      'determine': 1,
                      'drab': 1,
                      'slow-moving': 1,
                      'brimmed': 1,
                      'reborn': 1,
                      'cabins': 1,
                      'beadsman': 1,
                      'cups': 2,
                      'kitchen': 21,
                      'damnation': 1,
                      'amount': 2,
                      'Burt': 1,
                      'few': 28,
                      'crack': 3,
                      'Seeing': 2,
                      'cheek': 3,
                      'northward': 2,
                      'pompously': 1,
                      "wouldn't": 9,
                      'cheated': 1,
                      'scratching': 7,
                      'through': 60,
                      'Leona': 5,
                      'Know': 3,
                      'Just': 5,
                      "we'll": 1,
                      "Jessica's": 1,
                      'each': 33,
                      'Because': 1,
                      'hacked': 1,
                      'proclamation': 2,
                      'Coffee': 1,
                      'Sumter': 1,
                      'stakes': 1,
                      'Dimes': 1,
                      'France': 3,
                      "yesterday's": 1,
                      'ye': 2,
                      'urns': 1,
                      'postscript': 1,
                      'cottage': 1,
                      'shy': 2,
                      'raised': 9,
                      'stray': 3,
                      'stream': 4,
                      'ate': 6,
                      'Half': 2,
                      'clubbed': 1,
                      'Joshua': 2,
                      'Rousseau': 17,
                      'buggy': 2,
                      'keyhole': 1,
                      'fabulous': 1,
                      'eternal': 4,
                      'blackout': 1,
                      'fences': 1,
                      'At': 37,
                      "I'm": 28,
                      'apex': 1,
                      'colleagues': 1,
                      'Those': 5,
                      'surprisingly': 2,
                      'citations': 1,
                      'sizzled': 1,
                      'drop': 4,
                      'career': 4,
                      'baggage': 1,
                      'husband': 6,
                      'friends': 11,
                      'bet': 3,
                      'concierge': 2,
                      'shouting': 8,
                      'worse': 1,
                      'full-grown': 1,
                      'slip': 1,
                      'Suddenly': 2,
                      'consented': 2,
                      'small': 31,
                      'bull': 1,
                      'jam': 1,
                      'doorways': 1,
                      'enemy': 6,
                      'amuse': 1,
                      'tidings': 1,
                      'bitters': 1,
                      'knife': 7,
                      'hostess': 3,
                      'recently': 4,
                      'distinct': 1,
                      'darkly': 2,
                      'next': 29,
                      'armchair': 2,
                      'reared': 5,
                      'disagreed': 1,
                      'strike': 4,
                      'four': 14,
                      'searching': 3,
                      'really': 25,
                      'putting': 3,
                      'broad': 7,
                      'securely': 1,
                      'striving': 1,
                      'Styka': 16,
                      'borrowed': 1,
                      'stiffness': 1,
                      'effect': 8,
                      'bunk': 2,
                      'exact': 1,
                      'shawl': 2,
                      'whites': 4,
                      'wraith-like': 2,
                      'oranges': 2,
                      'shift': 3,
                      'giggled': 2,
                      'King': 3,
                      'poultry': 1,
                      'decrees': 1,
                      'spending': 3,
                      'evening': 17,
                      'prisoners': 2,
                      'dripped': 4,
                      'maniacal': 1,
                      'pungent': 1,
                      'Houses': 1,
                      'cannon': 2,
                      'telegraph': 1,
                      'Joseph': 4,
                      'tip': 3,
                      'coward': 1,
                      'relax': 3,
                      'Bureau': 1,
                      'McFeeley': 12,
                      'relief': 7,
                      'sixty-eight': 1,
                      'ask': 16,
                      'invested': 1,
                      'found': 42,
                      'hope': 5,
                      "one's": 5,
                      'Laurel': 1,
                      'mittens': 1,
                      'pretty': 10,
                      "afternoon's": 1,
                      'barges': 1,
                      'mother': 9,
                      'Jorge': 3,
                      'things': 31,
                      'Now': 30,
                      'independent': 1,
                      'squealed': 1,
                      'lewd': 1,
                      'wattles': 2,
                      'permission': 2,
                      'divided': 1,
                      'suppose': 5,
                      'Hall': 1,
                      'lyrics': 1,
                      'vertical': 1,
                      'cocked': 2,
                      'Wised': 1,
                      'drawn': 2,
                      'spur': 1,
                      'mischief': 1,
                      'sir': 6,
                      'Gilkson': 1,
                      'Verdi': 2,
                      'dark-brown': 1,
                      'yellowing': 2,
                      'hatred': 2,
                      'German': 1,
                      'signpost': 1,
                      'needled': 1,
                      'scream': 4,
                      'signal': 2,
                      'stropping': 1,
                      'deportees': 1,
                      'musicians': 4,
                      'Timon': 1,
                      'shoot': 2,
                      'shadow': 4,
                      "Eileen's": 3,
                      'marched': 4,
                      'stiffened': 3,
                      'Standing': 1,
                      'Tuscany': 1,
                      'dying': 10,
                      'required': 2,
                      'Cadillac': 4,
                      'season': 2,
                      'Eyke': 1,
                      'Cat': 1,
                      'Lord': 6,
                      'Double': 1,
                      'betrays': 1,
                      'gritty': 1,
                      'strip': 3,
                      'curiously': 2,
                      'run': 12,
                      'discussed': 5,
                      'merging': 1,
                      'stores': 2,
                      'textiles': 1,
                      'lust': 1,
                      "C'est": 2,
                      'web': 1,
                      'poncho': 3,
                      'effected': 2,
                      'unconcerned': 1,
                      'dimension': 1,
                      'uncomfortable': 2,
                      'themselves': 5,
                      'developing': 1,
                      'Anabaptists': 1,
                      'experiences': 2,
                      'Corault': 1,
                      'floundering': 1,
                      'stumps': 1,
                      'Church': 17,
                      'Santa': 1,
                      'Plymouth': 1,
                      'campmate': 1,
                      'sluggishly': 1,
                      'Concorde': 1,
                      'Howard': 13,
                      'reader': 1,
                      'clean': 12,
                      'sophomore': 1,
                      'pulled': 6,
                      'framed': 2,
                      'sentences': 1,
                      'visited': 2,
                      'loaded': 1,
                      'messieurs': 1,
                      'meticulously': 1,
                      'Also': 1,
                      'wanting': 3,
                      'coat': 3,
                      'delivery': 1,
                      'trees': 17,
                      'northeast': 1,
                      "mother's": 3,
                      "they're": 4,
                      'lecturing': 1,
                      'Japanese': 3,
                      'exchange': 2,
                      'Frederick': 3,
                      'due': 4,
                      'Fletcher': 5,
                      'seeming': 2,
                      'warmed': 1,
                      'furniture': 1,
                      'sixty': 3,
                      'checked': 4,
                      'overcomes': 1,
                      'problem': 2,
                      'curtail': 1,
                      'unashamedly': 1,
                      'winter': 5,
                      'distance': 6,
                      'walking': 12,
                      'crashing': 1,
                      'silly': 2,
                      'insurmountable': 1,
                      'looking': 19,
                      'fourth': 6,
                      'embellished': 1,
                      'Rivers': 1,
                      'nuns': 1,
                      'Murray': 1,
                      'sixth': 1,
                      'suggest': 1,
                      'tasting': 2,
                      'British': 6,
                      'celebrate': 1,
                      'Time': 4,
                      'humor': 4,
                      "John's": 5,
                      'Smiling': 2,
                      'entranceway': 1,
                      "We're": 4,
                      'collected': 1,
                      'ungodly': 2,
                      'Big': 16,
                      'descending': 2,
                      'seahorse': 1,
                      'rebutted': 1,
                      'pointed': 9,
                      'wounded': 6,
                      'stoutly': 1,
                      'hey': 2,
                      'slowly': 14,
                      'unlocked': 1,
                      'place': 36,
                      'piece': 15,
                      'rude': 1,
                      'drunk': 6,
                      'rode': 5,
                      'Carrara': 1,
                      'tasted': 1,
                      'atmosphere': 3,
                      'blessed': 1,
                      'melting': 1,
                      'urgent': 2,
                      'track': 1,
                      'bait': 1,
                      'files': 1,
                      'Over': 3,
                      'concussion': 1,
                      'riffle': 1,
                      'instrument': 1,
                      'Heart': 1,
                      'could': 166,
                      "liar's": 1,
                      'enjoying': 2,
                      'Lawrence': 10,
                      'coffeecup': 1,
                      'Federal': 3,
                      'rich': 7,
                      'tap': 1,
                      'Wilson': 19,
                      'Ernest': 2,
                      'aloud': 5,
                      "Pa'd": 1,
                      'appreciation': 2,
                      'planting': 1,
                      'Along': 1,
                      'pride': 5,
                      'reprovingly': 1,
                      'tapping': 2,
                      'rattle': 1,
                      'Nassau': 1,
                      'rites': 2,
                      'Prickly': 1,
                      'Juan': 1,
                      'search': 2,
                      'berated': 1,
                      'ran': 23,
                      'wagons': 1,
                      'optimism': 2,
                      'meant': 7,
                      'pimples': 1,
                      'tiny': 4,
                      'Christianity': 1,
                      'forbids': 1,
                      'Thirty-four': 1,
                      'picture': 8,
                      "Pilate's": 1,
                      'bassinet': 1,
                      'sympathetic': 1,
                      'adapted': 1,
                      'gaunt': 2,
                      'buildings': 2,
                      'Immediately': 1,
                      'between': 13,
                      'cost': 3,
                      'host': 1,
                      'intimate': 3,
                      'beam': 1,
                      'seed': 2,
                      'converts': 1,
                      'eighth': 1,
                      'diagrams': 1,
                      'recognize': 1,
                      'marked': 1,
                      'Anderson': 1,
                      'wagon': 1,
                      'self-flagellation': 1,
                      'higher': 3,
                      "Lincoln's": 1,
                      'Dutch': 2,
                      'swells': 1,
                      'foliage': 1,
                      'transverse': 1,
                      'hearth': 3,
                      'Heights': 3,
                      'Until': 1,
                      'backed': 1,
                      'banner': 1,
                      'movers': 3,
                      'placid': 1,
                      'dismissed': 1,
                      'wall': 17,
                      'oughta': 1,
                      'told': 48,
                      'velvet': 2,
                      'distinctly': 1,
                      'wiping': 3,
                      'The': 369,
                      'heat': 6,
                      'downstairs': 3,
                      'coincidence': 1,
                      'Boy': 2,
                      'candles': 1,
                      'merited': 1,
                      'of': 1419,
                      'wrapper': 1,
                      'shores': 2,
                      'Defrost': 1,
                      'celebrated': 1,
                      'law': 2,
                      'stained': 1,
                      'Sleepy-eyed': 1,
                      "Gre't": 1,
                      'manservant': 1,
                      'police': 1,
                      'whole': 22,
                      'wrack': 1,
                      'rear': 3,
                      'impending': 1,
                      'metal': 5,
                      'yearned': 1,
                      'corner': 18,
                      'sworn': 1,
                      'wanna': 1,
                      'bold': 4,
                      'lived': 9,
                      'trestle': 1,
                      'protest': 5,
                      'spoiled': 1,
                      "Warren's": 1,
                      'roused': 1,
                      'synagogue': 1,
                      "ladies'": 1,
                      'maid': 6,
                      'numb': 1,
                      'stupidest': 1,
                      'facts': 1,
                      'recreation': 1,
                      'Hudson': 8,
                      'fried': 1,
                      'disastrous': 1,
                      'announcing': 1,
                      'craved': 1,
                      'clearness': 1,
                      'display': 1,
                      'Cover': 1,
                      'straightening': 1,
                      '7:25': 1,
                      'swing': 1,
                      'Supplies': 1,
                      'funny': 5,
                      'belligerently': 1,
                      'factory': 1,
                      'Carlson': 2,
                      'outside': 14,
                      'hung': 14,
                      'Stranger': 1,
                      'book': 5,
                      'balls': 2,
                      'suggested': 5,
                      'Ole': 3,
                      'barrack': 1,
                      'stirred': 1,
                      'jacket': 2,
                      'sullen': 2,
                      'intend': 1,
                      'Brigade': 1,
                      'presents': 1,
                      'tableau': 1,
                      'pump': 3,
                      'purpose': 5,
                      'broadened': 1,
                      'give': 22,
                      'clamped': 1,
                      'absolute': 2,
                      'boring': 1,
                      'dishes': 3,
                      'housing': 1,
                      'beauty': 3,
                      'moralistic': 1,
                      'coop': 2,
                      'ammunition': 3,
                      'flatly': 2,
                      'Calvin': 3,
                      'Coughlin': 5,
                      'protestations': 1,
                      'crest': 2,
                      'applying': 1,
                      'allow': 3,
                      'rag': 1,
                      'critical': 1,
                      'blushed': 3,
                      'library': 3,
                      'hawked': 1,
                      'anticipations': 1,
                      'cancer': 1,
                      'missing': 5,
                      'actually': 2,
                      'languishing': 1,
                      'Several': 1,
                      'rocked': 1,
                      'explanation': 2,
                      'replied': 2,
                      'Billy': 1,
                      'jabbed': 2,
                      'everywhere': 2,
                      'nicotine-choked': 1,
                      'questioning': 1,
                      'percolator': 1,
                      'young': 26,
                      'anyway': 3,
                      'sculpture': 5,
                      'bargain': 1,
                      'procurer': 1,
                      'authority': 2,
                      'Point': 2,
                      'Perhaps': 8,
                      'grace': 4,
                      'confused': 3,
                      'cherries': 1,
                      'football': 1,
                      'big': 36,
                      'Fowler': 1,
                      'midair': 1,
                      'Westwood': 1,
                      'fetid': 1,
                      'inspect': 1,
                      'grenades': 1,
                      'services': 1,
                      'nourished': 1,
                      'completely': 3,
                      'Waited': 1,
                      'C': 1,
                      'wooden': 4,
                      'cold': 15,
                      'opened': 17,
                      'augmented': 1,
                      'left': 41,
                      'spoonful': 1,
                      'Stroked': 1,
                      'handlebars': 1,
                      'Holy': 2,
                      'clip': 1,
                      'blast': 1,
                      'stood': 41,
                      'worst': 4,
                      'dreamin': 1,
                      'lipstick': 1,
                      'Demanded': 1,
                      'drinking': 4,
                      'misanthrope': 1,
                      'roughed': 1,
                      'tar': 1,
                      'street': 17,
                      'bow': 4,
                      'doorknob': 1,
                      'long': 53,
                      'sins': 2,
                      'utmost': 1,
                      'current': 2,
                      'senseless': 1,
                      'Huge': 1,
                      'skullcap': 1,
                      'cares': 1,
                      'valet': 2,
                      "H'all": 1,
                      'Rich': 2,
                      'compresses': 1,
                      'now': 100,
                      'composer': 8,
                      'peeked': 1,
                      'dramatic': 4,
                      'Calmly': 1,
                      'suddenly': 23,
                      'De': 6,
                      'gets': 3,
                      'specific': 1,
                      'wig': 1,
                      'bandaged': 2,
                      'lay': 20,
                      'loss': 3,
                      'entanglement': 1,
                      'three': 27,
                      'declaring': 1,
                      'shut': 7,
                      'Jee-sus': 1,
                      'cage': 2,
                      'demand': 1,
                      'stock': 7,
                      'privy': 1,
                      'Scrub': 1,
                      'downed': 1,
                      'San': 2,
                      'pokerfaced': 1,
                      'clearer': 1,
                      'pat': 1,
                      'pure': 1,
                      'extended': 1,
                      'patiently': 1,
                      'lips': 15,
                      'Rathbone': 2,
                      'joy': 5,
                      'detective': 3,
                      'car': 12,
                      'godless': 1,
                      'clambered': 3,
                      'cap': 4,
                      'grows': 1,
                      'sonofabitch': 1,
                      'science': 2,
                      'mackinaw': 1,
                      'be': 254,
                      'wealth': 1,
                      'names': 6,
                      'ready': 17,
                      'Mutton': 5,
                      'glistened': 1,
                      'Ten': 3,
                      'abundance': 1,
                      'forget': 2,
                      'posing': 1,
                      'heap': 2,
                      'engaged': 4,
                      'Fujimoto': 2,
                      'expected': 5,
                      'janitor': 1,
                      'minutes': 12,
                      'Treasury': 1,
                      'remote': 2,
                      'intertwined': 1,
                      'touching': 3,
                      'joined': 2,
                      "He's": 13,
                      'bullshit': 1,
                      'hush': 1,
                      'stalking': 1,
                      'tyrannis': 1,
                      'lettering': 1,
                      'slovenly': 3,
                      'covering': 2,
                      'afternoon': 17,
                      'teasing': 1,
                      'meditate': 1,
                      'status': 1,
                      'rut': 1,
                      'elapsed': 2,
                      'brandishing': 1,
                      'fail': 3,
                      'boarding': 1,
                      'burial': 3,
                      'weeds': 2,
                      'battling': 1,
                      'dignity': 2,
                      'Discourse': 3,
                      'force': 4,
                      'replaced': 1,
                      'overlook': 1,
                      'pipers': 1,
                      'sizes': 1,
                      'first-class': 1,
                      'hire': 1,
                      'wildly': 4,
                      "child's": 2,
                      'tepid': 1,
                      'lighting': 1,
                      'alone': 16,
                      'worshippers': 1,
                      'cent': 2,
                      'Dalloway': 3,
                      'beneath': 5,
                      'bullet': 2,
                      "You'd": 1,
                      'declaration': 1,
                      'into': 147,
                      'strife': 1,
                      'Sunday-school': 1,
                      'Fooled': 1,
                      'snoring': 5,
                      'countenance': 3,
                      'else': 15,
                      'Kahler': 7,
                      'let': 28,
                      'becoming': 5,
                      'May': 2,
                      'promising': 1,
                      'grovel': 1,
                      'necessity': 1,
                      'brother': 7,
                      'Somebody': 3,
                      'managed': 2,
                      'separating': 1,
                      'coyness': 1,
                      'relationship': 2,
                      'murder': 2,
                      'Rabbi': 4,
                      'hocking': 1,
                      'scarf': 2,
                      'mantrap': 1,
                      'sweeping': 2,
                      'associates': 1,
                      'calamities': 1,
                      'bawling': 1,
                      'cleat': 1,
                      'blinked': 3,
                      'ferociously': 1,
                      'length': 6,
                      'involved': 3,
                      'nothings': 1,
                      'straining': 1,
                      'style': 3,
                      'Soothsayer': 1,
                      'unmoved': 1,
                      'switch': 1,
                      'lentils': 1,
                      "Shakespeare's": 1,
                      'electric': 4,
                      'participated': 1,
                      'outdoors': 1,
                      'muttered': 5,
                      'Bud': 1,
                      'absurd': 2,
                      ')': 23,
                      'secrets': 1,
                      'attain': 2,
                      'hoist': 1,
                      'ledgers': 1,
                      'Zionists': 1,
                      'volunteered': 1,
                      'edges': 2,
                      'Otherwise': 1,
                      "I'll": 15,
                      'mosaic': 1,
                      'Turkish': 1,
                      'default': 1,
                      'substituted': 1,
                      'crunched': 1,
                      'Waldensian': 1,
                      'naked': 3,
                      'indefinitely': 1,
                      'merit': 1,
                      'growling': 1,
                      'clothes': 17,
                      'playful': 1,
                      'shutters': 4,
                      'tail': 1,
                      'released': 1,
                      'housework': 1,
                      'aggravated': 1,
                      'long-shanked': 1,
                      'letter': 11,
                      'hide': 3,
                      'feudal': 1,
                      'blueberry': 1,
                      'Its': 2,
                      'Slice': 2,
                      'picket': 1,
                      'parked': 2,
                      'from': 222,
                      'devilish': 1,
                      'jumble': 1,
                      'Ever': 2,
                      'Fat': 1,
                      'warning': 1,
                      'appearance': 4,
                      'Murder': 1,
                      'ball': 9,
                      'Mother': 2,
                      'peonies': 1,
                      'mantel': 1,
                      'melodies': 3,
                      'reality': 4,
                      'undo': 1,
                      'hamper': 2,
                      'instruction': 1,
                      'studied': 8,
                      'twenty-five': 2,
                      'appreciate': 1,
                      'Cleaned': 1,
                      'years': 44,
                      'sticks': 3,
                      'East': 6,
                      'probably': 6,
                      'voulez': 1,
                      'frequently': 1,
                      "Sunday's": 1,
                      'During': 4,
                      "Heat's": 1,
                      'Drury': 1,
                      'chinked': 1,
                      'belch': 1,
                      'tore': 2,
                      'swamps': 2,
                      'drunkenly': 1,
                      'or': 167,
                      'obscenities': 1,
                      'Law': 2,
                      'estates': 1,
                      "Don't": 10,
                      'fifteen': 3,
                      "Alix's": 1,
                      'likes': 1,
                      'city': 18,
                      'sex': 2,
                      'obstruct': 1,
                      'Looking': 2,
                      "'mon": 1,
                      'active': 2,
                      'sensing': 1,
                      'Heaven': 2,
                      'armful': 1,
                      'verandah': 1,
                      'backing': 2,
                      'Shelley': 5,
                      'easier': 2,
                      'thoughtfully': 1,
                      'drive': 5,
                      'general': 2,
                      'sprawled': 3,
                      'Sam': 1,
                      'Afterwards': 1,
                      'heart': 15,
                      'bleak': 2,
                      'national': 2,
                      'History': 1,
                      'ah': 1,
                      'specters': 1,
                      'tomorrow': 8,
                      'foreman': 1,
                      'excellences': 1,
                      'notes': 2,
                      'Myra': 3,
                      'corsage': 1,
                      'Andruses': 1,
                      'called': 28,
                      'opinions': 1,
                      'financed': 1,
                      'workshop': 1,
                      'boaters': 1,
                      'fictional': 1,
                      'year': 9,
                      'Majdan-Tartarski': 1,
                      "Seward's": 3,
                      'victory': 6,
                      'ginmill': 2,
                      'adjusted': 3,
                      'merry': 1,
                      'Pardon': 1,
                      'Too': 3,
                      'courage': 2,
                      'subdued': 1,
                      'red': 13,
                      'bring': 10,
                      'pitcher': 3,
                      'deals': 1,
                      'matriarch': 1,
                      'uneasy': 5,
                      'wonder': 2,
                      'near-absence': 1,
                      'objective': 3,
                      'solution': 1,
                      'directly': 2,
                      'born': 3,
                      'sharply': 2,
                      'makeshift': 1,
                      'Jew': 9,
                      'chance': 7,
                      'Corcoran': 1,
                      'black-market': 1,
                      'scuff': 1,
                      'spirits': 2,
                      'Methodist': 2,
                      'mutilated': 1,
                      'steeple': 9,
                      'Southern': 2,
                      'fellow': 5,
                      'halfway': 2,
                      'Market': 1,
                      'geneticist': 1,
                      'not': 290,
                      'perturbation': 1,
                      'sons': 1,
                      'taught': 4,
                      'fuss': 1,
                      'crook': 1,
                      'headquarters': 3,
                      'basket': 4,
                      'arthritis': 1,
                      'suffer': 1,
                      'minorities': 1,
                      'expiating': 1,
                      'soared': 1,
                      'Rebel': 4,
                      'Eyes': 1,
                      'herding': 1,
                      'pray': 3,
                      'college': 3,
                      'yawning': 2,
                      'climbed': 6,
                      'placed': 7,
                      "Rheinholdt's": 1,
                      'boat': 6,
                      'noble': 1,
                      'Buckra': 2,
                      'set': 17,
                      'sliding': 1,
                      'calumny': 1,
                      'glanced': 5,
                      'winked': 2,
                      'worried': 6,
                      'hated': 6,
                      'arrangement': 1,
                      'campus': 2,
                      'Rebs': 1,
                      'alongside': 1,
                      'contemptuously': 1,
                      'contrived': 1,
                      'crowd': 6,
                      'five': 14,
                      'Bowman': 1,
                      'draper': 1,
                      'Luisa': 1,
                      'common': 4,
                      'poems': 1,
                      'less': 2,
                      'crowed': 1,
                      "rooster's": 1,
                      'Moccasin': 1,
                      'movement': 8,
                      'decorator': 1,
                      'scraping': 1,
                      'thrusting': 1,
                      'dwindling': 1,
                      'united': 2,
                      'Geneva': 3,
                      'Vienot': 1,
                      'Warmongering': 1,
                      'deacon': 1,
                      'straight-backed': 1,
                      'smaller': 1,
                      'cheerful': 2,
                      'gravity': 1,
                      'protect': 1,
                      ...}),
             'government': Counter({'Without': 2,
                      'read': 4,
                      '$1.8': 2,
                      'taxing': 5,
                      'More': 8,
                      '23': 4,
                      'Anthony': 1,
                      'weekend': 1,
                      'entails': 1,
                      'speak': 3,
                      'located': 14,
                      'relocation': 1,
                      'avoiding': 2,
                      'grow': 3,
                      'education': 12,
                      'house': 4,
                      'promises': 1,
                      '1450': 1,
                      'skilled': 8,
                      'Buy': 1,
                      'thickness': 1,
                      'Covington': 1,
                      'tube': 1,
                      'Ballet': 5,
                      'travelers': 1,
                      'endorsement': 1,
                      '607-608': 1,
                      'automotive': 3,
                      'alleged': 2,
                      'individuals': 4,
                      'Between': 2,
                      'threshold': 1,
                      'OBE': 1,
                      'apparatus': 2,
                      'restrain': 2,
                      'operating': 20,
                      'Swift': 1,
                      'softwood': 1,
                      'use': 85,
                      'Procedure': 1,
                      'antennae': 1,
                      'shifts': 1,
                      'lodging': 1,
                      'experience': 10,
                      'applies': 3,
                      'served': 8,
                      'committee': 10,
                      '139': 1,
                      'uncertified': 1,
                      'sports': 5,
                      'population': 12,
                      'Force': 6,
                      'warrants': 2,
                      'attracted': 2,
                      'has': 153,
                      'controlled': 6,
                      'thoroughly': 4,
                      'February': 4,
                      "day's": 1,
                      'maintenance': 22,
                      'when': 56,
                      'our': 144,
                      'along': 7,
                      'procedural': 4,
                      'economics': 2,
                      'Land-based': 1,
                      'certain': 18,
                      'season': 3,
                      'God': 3,
                      'inward': 1,
                      'Reine': 1,
                      'Specifically': 1,
                      'Budgeting': 1,
                      'others': 20,
                      'republic': 2,
                      'budgets': 1,
                      'editorial': 2,
                      'manufacturers': 9,
                      'more': 118,
                      'anyone': 2,
                      'Stamford': 2,
                      'transition': 12,
                      'Laboratory': 6,
                      'contributes': 2,
                      'resolve': 2,
                      'beautiful': 2,
                      'averaged': 2,
                      'mailing': 1,
                      'Electronic': 2,
                      'marriages': 1,
                      'unabated': 1,
                      'initiating': 1,
                      'first': 58,
                      'subsequent': 4,
                      'self-respect': 1,
                      'wires': 2,
                      'related': 12,
                      'developed': 32,
                      'Christiana': 9,
                      'introducing': 1,
                      'films': 2,
                      'sorted': 1,
                      'adding': 1,
                      'cosponsored': 1,
                      'imposition': 1,
                      'company': 11,
                      'constituents': 1,
                      'Radiation': 2,
                      'wall': 6,
                      'page': 10,
                      'greatness': 1,
                      'determine': 15,
                      'Soccer': 2,
                      'under': 89,
                      'Tumor': 4,
                      'correct': 1,
                      'assigns': 2,
                      'corporation': 12,
                      'close': 21,
                      'metal': 9,
                      'roughly': 2,
                      'microscopes': 1,
                      'kitchen': 1,
                      'amount': 43,
                      'NAIR': 1,
                      'blunt': 2,
                      'orders': 7,
                      'tables': 2,
                      'virile': 1,
                      'sue': 1,
                      'ship': 1,
                      'planes': 3,
                      'arranges': 1,
                      'Surgery': 2,
                      'pressure': 11,
                      'seldom': 1,
                      'Md.': 1,
                      '60%': 1,
                      'scientists': 1,
                      'directed': 15,
                      'diminishes': 1,
                      'amended': 11,
                      'Keynotes': 1,
                      'priority': 5,
                      'man': 12,
                      'evils': 3,
                      '606': 1,
                      'draft': 1,
                      'Valley': 1,
                      'sunspot': 1,
                      'action-oriented': 1,
                      'through': 56,
                      'Section': 33,
                      'Following': 2,
                      'anew': 1,
                      'Mussolinis': 1,
                      'reads': 2,
                      'specific': 9,
                      'each': 71,
                      'undue': 1,
                      'Because': 4,
                      'Delaware': 13,
                      'proclamation': 6,
                      'boon': 1,
                      'motion': 1,
                      'Justice': 11,
                      'waters': 2,
                      '1.8': 2,
                      'plain': 1,
                      'forget': 1,
                      'resolute': 1,
                      'burden': 5,
                      'controls': 4,
                      'vegetation': 1,
                      'molecular': 1,
                      'favored': 1,
                      'balanced': 5,
                      'wisdom': 2,
                      'annually': 3,
                      'excludes': 2,
                      'raised': 3,
                      'love': 1,
                      'rise': 3,
                      'approval': 11,
                      'commemorating': 1,
                      'mileage': 8,
                      'sewing': 5,
                      'repeated': 1,
                      'eventually': 5,
                      'Center': 3,
                      'Players': 2,
                      'eternal': 1,
                      'features': 7,
                      'situations': 2,
                      'thereunder': 1,
                      'service': 68,
                      'Haven': 1,
                      'hard': 3,
                      'At': 18,
                      ')': 345,
                      'seas': 3,
                      'Coordination': 1,
                      'mastery': 1,
                      'spindle': 2,
                      'merit': 3,
                      'vessels': 3,
                      'colleagues': 7,
                      'merges': 1,
                      'april': 1,
                      'exigencies': 1,
                      'wire': 4,
                      'Those': 1,
                      'ought': 1,
                      'enlist': 1,
                      '1868': 2,
                      'rejection': 1,
                      '1776': 1,
                      'facility': 3,
                      '125th': 1,
                      'drop': 1,
                      'career': 7,
                      'idly': 1,
                      'calculated': 1,
                      'cause': 13,
                      'friends': 2,
                      'roll': 1,
                      'integral': 2,
                      "pool's": 2,
                      'battery-powered': 2,
                      'incipient': 1,
                      'correlation': 1,
                      'acceptable': 1,
                      'manage': 1,
                      'treasury': 1,
                      'consented': 1,
                      'purchasing': 6,
                      'small': 54,
                      'removal': 1,
                      'Miss': 7,
                      'Times': 11,
                      '6-inch': 1,
                      'hostility': 1,
                      'performances': 1,
                      'continue': 10,
                      'genuinely': 1,
                      'discharged': 2,
                      'warning': 1,
                      'revolutions': 1,
                      'weather': 2,
                      'forward': 8,
                      'recession': 1,
                      'generates': 1,
                      'recently': 10,
                      'tournament': 1,
                      'Macropathology': 1,
                      'distinct': 3,
                      'built-in': 1,
                      'allocable': 3,
                      'legislation': 4,
                      'next': 13,
                      'Registry': 5,
                      'p.': 1,
                      'incurred': 4,
                      "Brown's": 9,
                      'strike': 1,
                      'revenue': 4,
                      'searching': 1,
                      'examined': 1,
                      'appropriating': 1,
                      'really': 1,
                      'peoples': 2,
                      'broad': 11,
                      'lends': 1,
                      'Tube': 1,
                      'passed': 2,
                      'external': 4,
                      'contention': 2,
                      'understanding': 13,
                      'amazement': 1,
                      'mountains': 1,
                      'rushed': 1,
                      'borrowed': 2,
                      'efforts': 17,
                      'signals': 7,
                      'effect': 15,
                      'mountainous': 1,
                      'shielding': 4,
                      'heretofore': 2,
                      'selected': 9,
                      'clothed': 1,
                      'reception': 4,
                      'Anything': 1,
                      'Uses': 1,
                      'ad': 1,
                      'sciences': 1,
                      'embodying': 1,
                      'shift': 5,
                      'King': 1,
                      'Transition': 1,
                      'refund': 8,
                      'Open-flame': 1,
                      'recommending': 4,
                      'family': 10,
                      'physiological': 1,
                      'Lighting': 2,
                      '64-page': 1,
                      'evening': 1,
                      'characteristically': 1,
                      "nation's": 2,
                      'lessen': 2,
                      'heater': 2,
                      'double-wall': 4,
                      'Joseph': 5,
                      'participates': 4,
                      'merits': 1,
                      'define': 2,
                      'Bureau': 10,
                      'daylight': 1,
                      'wives': 1,
                      '106,500': 1,
                      'destructive': 8,
                      'needed': 20,
                      'late': 6,
                      'open': 12,
                      'Communism': 3,
                      'Waltham': 1,
                      'thyratron': 1,
                      'intends': 1,
                      'ask': 3,
                      'invested': 3,
                      'Mediterranean': 1,
                      'found': 18,
                      'Witnesses': 2,
                      'dissent': 1,
                      'overgrazing': 1,
                      'Cement': 1,
                      'Nevertheless': 3,
                      'Major': 1,
                      'machinist': 1,
                      "one's": 3,
                      ':': 110,
                      'cosponsors': 1,
                      'sixty-one': 7,
                      '72': 1,
                      'Membership': 2,
                      'Manuscript': 2,
                      'schedule': 7,
                      'Ventilation': 2,
                      'scholarships': 1,
                      'supported': 3,
                      'foodstuffs': 1,
                      'things': 5,
                      'mycobacteria': 1,
                      'thank': 2,
                      'can': 117,
                      'oratorical': 1,
                      'department': 8,
                      'independent': 7,
                      'similarly': 2,
                      'trailers': 4,
                      'duces': 1,
                      '348': 4,
                      'permission': 1,
                      'divided': 5,
                      'Hall': 3,
                      'Oral': 2,
                      'poured': 1,
                      'manned': 3,
                      "Churchill's": 1,
                      '18': 5,
                      'fleet': 3,
                      'appreciative': 1,
                      'spur': 1,
                      'skeptically': 1,
                      'maps': 3,
                      'cities': 26,
                      'investigations': 2,
                      "Motors'": 3,
                      'advisers': 1,
                      'assessed': 1,
                      'aspire': 1,
                      'formal': 1,
                      'documents': 2,
                      'feelings': 1,
                      'constituted': 1,
                      'intermolecular': 1,
                      'activities': 35,
                      'Early': 1,
                      'great-grandson': 1,
                      'voting': 3,
                      'twister-coners': 1,
                      'Welfare': 1,
                      'leave': 4,
                      'York': 33,
                      'chairman': 2,
                      'signal': 5,
                      'vehicles': 37,
                      '1959': 34,
                      'importance': 17,
                      'power': 17,
                      'leprae': 1,
                      'isolate': 2,
                      'Practical': 1,
                      'apportion': 1,
                      'Westerly': 1,
                      'classes': 7,
                      'any': 142,
                      'executives': 2,
                      'computes': 2,
                      "attorney's": 2,
                      'tended': 1,
                      'factors': 11,
                      'deductible': 1,
                      'scope': 6,
                      'Hence': 2,
                      'outset': 1,
                      'dying': 1,
                      'observe': 1,
                      'perhaps': 4,
                      'required': 35,
                      'talent': 2,
                      'emphasis': 7,
                      'meaningless': 1,
                      'wages': 8,
                      'gross': 4,
                      'Merchant': 4,
                      'Lord': 9,
                      'Division': 25,
                      'crews': 1,
                      '1961': 55,
                      'breadth': 1,
                      '6th': 2,
                      '1680': 1,
                      'resource': 2,
                      'implementation': 1,
                      'discussed': 2,
                      'persons': 9,
                      'experimental': 1,
                      'East': 3,
                      'I.': 3,
                      'equity': 1,
                      'K': 1,
                      'probably': 13,
                      'Atlantic': 3,
                      'shutter': 5,
                      'ordinary': 5,
                      'Projects': 1,
                      'importantly': 2,
                      'addresses': 1,
                      'effected': 2,
                      'Newark': 1,
                      'compilation': 1,
                      'dimension': 3,
                      'uncomfortable': 1,
                      'thousand': 10,
                      'File': 1,
                      'Planning': 7,
                      'govern': 3,
                      'themselves': 19,
                      'developing': 11,
                      'sketch': 1,
                      'academic': 6,
                      'occur': 1,
                      'Shelter': 1,
                      'music': 6,
                      'visit': 1,
                      'experiences': 2,
                      'moral': 1,
                      'simultaneously': 2,
                      'manmade': 1,
                      'compress': 1,
                      'interference': 31,
                      'fine': 1,
                      'remainder': 2,
                      'trustees': 2,
                      'Banker': 1,
                      'showed': 3,
                      'beachhead': 1,
                      'specially': 4,
                      'dropped': 2,
                      'parties': 3,
                      'Virgil': 1,
                      'energies': 2,
                      'terminal': 1,
                      'Mexico': 2,
                      'newsletter': 3,
                      'Forensic': 6,
                      'semi-conductors': 1,
                      '16.7': 1,
                      'unnecessary': 2,
                      'repaid': 2,
                      'her': 3,
                      'Chapter': 11,
                      'bottom': 2,
                      'dysplasia': 1,
                      'fuel': 8,
                      'continuous': 4,
                      'role': 6,
                      'polls': 1,
                      'longer': 4,
                      '328': 1,
                      'induction': 2,
                      'automobiles': 14,
                      'illustration': 3,
                      'correctly': 1,
                      'cost-billing': 1,
                      'Manchester': 3,
                      'domestic': 16,
                      'band': 2,
                      'trends': 1,
                      'visited': 1,
                      'Manufacturers': 3,
                      'ministries': 1,
                      'exploding-wire': 1,
                      'steam': 1,
                      'Also': 4,
                      'radio': 15,
                      'societies': 3,
                      'delivery': 1,
                      'treaties': 1,
                      "petitioner's": 2,
                      'expansions': 3,
                      'Some': 12,
                      'ill': 1,
                      'brochures': 1,
                      'scales': 2,
                      'exchange': 7,
                      'exploding': 3,
                      'due': 29,
                      'considering': 1,
                      'grand': 1,
                      'Fletcher': 1,
                      'selection': 7,
                      'borne': 1,
                      'devastating': 2,
                      'establishments': 1,
                      'Revise': 1,
                      'contraband': 1,
                      'Russian': 6,
                      'checked': 1,
                      'serve': 19,
                      'manpower': 6,
                      '$18.9': 1,
                      'problem': 34,
                      'curtail': 1,
                      'U.s.': 1,
                      'winter': 1,
                      'distance': 3,
                      'Despite': 2,
                      'continuously': 3,
                      'support': 31,
                      'percentage': 16,
                      'allocation': 9,
                      'stage': 4,
                      'Evening': 1,
                      'Yugoslavia': 2,
                      'Purchases': 1,
                      'Trustees': 7,
                      'Catherine': 1,
                      'approach': 13,
                      'reflect': 5,
                      'looking': 3,
                      'finance': 10,
                      'commission': 2,
                      'Rivers': 1,
                      'silicon': 1,
                      'denied': 11,
                      'sugar': 1,
                      'Journal': 11,
                      'Pestle': 1,
                      'thanks': 3,
                      'atmospheres': 3,
                      'manufacture': 9,
                      'British': 3,
                      'hereby': 6,
                      'trading': 1,
                      'moderately': 1,
                      'Time': 1,
                      'past': 13,
                      'incipiency': 2,
                      'Kaisers': 1,
                      'periods': 8,
                      'picnics': 1,
                      'fibers': 2,
                      'tubing': 1,
                      'qualified': 5,
                      'delegation': 1,
                      'exclusive': 3,
                      'collected': 4,
                      'Varying': 1,
                      'attract': 2,
                      'unexpended': 1,
                      'rebutted': 1,
                      'pointed': 3,
                      'infestation': 1,
                      'realized': 4,
                      'determination': 12,
                      'jeopardize': 1,
                      'same': 51,
                      'Americas': 1,
                      'Clinico-pathologic': 3,
                      'servicing': 1,
                      'place': 30,
                      'piece': 3,
                      'foreknowledge': 1,
                      'Education': 3,
                      'vents': 1,
                      'hardly': 1,
                      'committees': 6,
                      'bombers': 6,
                      'variables': 1,
                      'Stalins': 1,
                      'reform': 1,
                      'substantiates': 1,
                      'buildup': 1,
                      'trend': 3,
                      'Communist': 4,
                      'urgent': 5,
                      'track': 2,
                      'mandamus': 1,
                      'safety': 4,
                      '$184': 1,
                      'state-owned': 12,
                      '162,400': 1,
                      'greater': 22,
                      'A.M.A.': 1,
                      'therefore': 19,
                      'expend': 1,
                      'remain': 5,
                      'files': 1,
                      'Over': 2,
                      'three-dimensional': 1,
                      'instrument': 3,
                      'component': 3,
                      'could': 38,
                      'framing': 1,
                      'forte': 1,
                      'Smith': 2,
                      'Lawrence': 1,
                      'conform': 1,
                      'Federal': 43,
                      'rich': 2,
                      'stopped': 1,
                      '10.4': 1,
                      'elaborate': 2,
                      'planting': 1,
                      'Rusk': 1,
                      'creating': 3,
                      'pride': 3,
                      'sequence': 2,
                      'tapping': 1,
                      'Courses': 2,
                      'N.Y.': 2,
                      'wise': 2,
                      'monopolize': 2,
                      'Soviet': 5,
                      'intereference': 1,
                      '1': 107,
                      'Dr.': 7,
                      'optimism': 2,
                      'Cars': 2,
                      'Heywood': 2,
                      'meant': 1,
                      'Monroe': 5,
                      'Refund': 1,
                      'integration': 1,
                      'Prior': 3,
                      'rupees': 14,
                      'sentenced': 1,
                      'utensils': 1,
                      'retains': 1,
                      'endow': 1,
                      'post-World': 1,
                      'lengthened': 1,
                      'inventory': 14,
                      'Rico': 18,
                      'contain': 3,
                      'underground': 4,
                      'guarding': 1,
                      'prevents': 1,
                      'substantially': 13,
                      'for': 806,
                      'adapted': 1,
                      'performance': 10,
                      'count': 2,
                      'calorimeter': 2,
                      'slipped': 1,
                      'equally': 3,
                      'buildings': 5,
                      'between': 44,
                      'rare': 2,
                      'amici': 2,
                      'cost': 34,
                      'original': 8,
                      'host': 7,
                      'reading': 3,
                      'welding': 1,
                      '185th': 1,
                      'corrosive': 1,
                      'Richmond': 3,
                      'heads': 1,
                      'Inter-American': 1,
                      'External': 2,
                      'employee': 5,
                      'report': 39,
                      'ring': 1,
                      'extinction': 1,
                      'subscription': 1,
                      'dweller': 1,
                      'appear': 3,
                      'Instrument': 1,
                      'Roman': 3,
                      'Atlas': 6,
                      'recognize': 2,
                      'improve': 3,
                      'World': 7,
                      'Body': 2,
                      'inhabitants': 1,
                      '603': 1,
                      '$15': 1,
                      'Greene': 1,
                      'higher': 14,
                      '270,000': 1,
                      'Pakistan': 1,
                      'Vice': 2,
                      'diplomatic': 1,
                      'Embassy': 1,
                      '$10.1': 1,
                      'eyes': 3,
                      'Security': 8,
                      "Sprague's": 1,
                      'filled': 5,
                      'energy': 7,
                      'simple': 5,
                      'Harvard': 1,
                      'Until': 1,
                      'Airpark': 1,
                      'student-directed': 1,
                      'marketing': 7,
                      'handicapped': 2,
                      'rehearsals': 1,
                      'retained': 1,
                      'fruit': 1,
                      'sixties': 1,
                      'dismissed': 2,
                      'oppression': 1,
                      'Wildlife': 1,
                      'composers': 1,
                      'they': 92,
                      'done': 6,
                      'delivered': 4,
                      'discovery': 1,
                      'Provost': 1,
                      'civic': 4,
                      'banks': 4,
                      'may': 153,
                      'The': 478,
                      'casual': 1,
                      'promise': 1,
                      'D.': 6,
                      'gases': 2,
                      'cycled': 1,
                      '$.027': 2,
                      'Boston': 4,
                      'Pooling': 1,
                      'Pageants': 2,
                      'cm': 1,
                      'of': 3031,
                      'Addabbo': 2,
                      'Application': 4,
                      'materially': 1,
                      '542,250': 1,
                      'celebrated': 1,
                      '47': 1,
                      'redistributed': 1,
                      'law': 21,
                      '$.07': 3,
                      'abroad': 19,
                      '1920s': 1,
                      'nearing': 2,
                      'Illustrations': 1,
                      'defense': 17,
                      'quarter': 1,
                      'combined': 3,
                      'Selling': 1,
                      'acting': 2,
                      'observed': 3,
                      'sixteen': 1,
                      'noticeable': 1,
                      'White': 1,
                      'felt': 6,
                      'private': 12,
                      'corner': 3,
                      'contribution': 6,
                      'Forty-seven': 1,
                      'Migs': 1,
                      'Low': 1,
                      'wars': 1,
                      'Rehabilitation': 2,
                      'seized': 1,
                      'spoiled': 1,
                      'directionally': 1,
                      'drivers': 2,
                      'lack': 8,
                      'together': 8,
                      "producers'": 1,
                      'High': 1,
                      'compute': 1,
                      'unchallenged': 1,
                      'low': 5,
                      'facts': 7,
                      'recreation': 8,
                      'microwaves': 1,
                      'musical': 3,
                      'cetera': 1,
                      'assembled': 4,
                      'Subsequent': 1,
                      'survival': 1,
                      'L.': 4,
                      "Armada's": 1,
                      'B-47': 1,
                      'reflects': 3,
                      'pre-1960': 1,
                      'display': 2,
                      'documentary': 1,
                      'faster': 1,
                      'too': 13,
                      'thermoplastic': 1,
                      'jacketed': 1,
                      'quirk': 1,
                      'expended': 3,
                      'exciting': 1,
                      'Theater': 1,
                      'factory': 1,
                      'entry': 2,
                      'overhead': 1,
                      'undigested': 1,
                      'Remarks': 4,
                      'outside': 16,
                      'general': 34,
                      'astrophysics': 2,
                      'arc': 3,
                      'faces': 1,
                      'relation': 7,
                      'affect': 1,
                      'aids': 6,
                      'upheld': 1,
                      'sets': 23,
                      'following': 16,
                      'Adjusting': 1,
                      'Leesona-Holt': 2,
                      'calendar': 20,
                      'Basic': 2,
                      'event': 7,
                      'encouragement': 2,
                      'employees': 25,
                      'stabilization': 2,
                      'Washington': 29,
                      'bearer': 1,
                      'mentions': 1,
                      'subtracting': 2,
                      'decree': 1,
                      'recommendation': 15,
                      'Rural': 2,
                      'how': 16,
                      'publishes': 1,
                      'towns': 29,
                      'International': 5,
                      'Musical': 2,
                      'occupants': 1,
                      'marking': 1,
                      'presents': 1,
                      'watchmaker': 1,
                      'above': 35,
                      'fees': 5,
                      'purpose': 22,
                      'relative': 5,
                      'olim': 1,
                      'broadened': 1,
                      'give': 21,
                      'one-twentieth': 1,
                      'dire': 1,
                      'absolute': 7,
                      'Thus': 9,
                      'membership': 3,
                      'assemble': 1,
                      'successfully': 3,
                      'medieval': 1,
                      'portable': 1,
                      'enormous': 1,
                      'Splinting': 1,
                      'Indirectly': 1,
                      'practice': 11,
                      'housing': 2,
                      'beauty': 1,
                      'minute': 3,
                      'ammunition': 1,
                      'unsuccessful': 1,
                      'tunnel': 1,
                      'dealers': 1,
                      'Amendment': 2,
                      'Richard': 1,
                      'thermometer': 6,
                      'semester': 3,
                      'Continuation': 1,
                      'mapping': 5,
                      'mutual': 2,
                      'solid-fueled': 1,
                      'statistics': 2,
                      'Chambers': 1,
                      'applying': 1,
                      'allow': 3,
                      'critical': 7,
                      'library': 2,
                      'fertilizer': 1,
                      'payments': 27,
                      'invariably': 1,
                      'Leading': 1,
                      'identified': 2,
                      '10-year': 5,
                      'withholding': 5,
                      'inadequacy': 1,
                      'performing': 3,
                      'actually': 2,
                      'advice': 10,
                      'Several': 2,
                      'Paragraph': 1,
                      'explanation': 2,
                      'priced': 1,
                      'coveted': 1,
                      'overload': 1,
                      'Investigation': 5,
                      'sea': 2,
                      'public': 43,
                      'thought': 7,
                      'camera': 3,
                      'employed': 7,
                      'Very': 1,
                      'corrugated': 1,
                      'on': 401,
                      'response': 5,
                      'conviction': 4,
                      'upgrading': 2,
                      'favorable': 6,
                      'authority': 14,
                      'Our': 20,
                      'Perhaps': 3,
                      'confused': 1,
                      'missile': 9,
                      'big': 4,
                      'plant': 30,
                      'commutation': 1,
                      'earnestly': 1,
                      'long-term': 15,
                      'zinc': 4,
                      'quantities': 2,
                      'inspect': 3,
                      'Agreement': 25,
                      'Saddle': 2,
                      'Vapor': 1,
                      'clearly': 5,
                      'completely': 4,
                      'abides': 1,
                      'Paced': 1,
                      'C': 28,
                      'wooden': 2,
                      'civilization': 1,
                      'cold': 2,
                      'financial': 28,
                      'notes': 13,
                      'coverage': 3,
                      'Expenditures': 2,
                      'ration': 1,
                      'Employee': 1,
                      'Cambridge': 1,
                      'normally': 1,
                      'utterly': 2,
                      'repayable': 1,
                      'hip': 1,
                      'twenty-three': 2,
                      'mere': 1,
                      '1833': 1,
                      'identifying': 2,
                      'warm': 2,
                      'proclaim': 7,
                      'stood': 2,
                      'worst': 4,
                      "Railroad's": 1,
                      'transitions': 2,
                      'lend': 1,
                      'reminding': 2,
                      'contracted': 2,
                      'impact': 5,
                      'desks': 1,
                      'physical': 6,
                      'Heritage': 3,
                      'long': 16,
                      'specimens': 2,
                      'tooth': 3,
                      'current': 21,
                      ...}),
             'hobbies': Counter({'Without': 4,
                      'vacuum': 4,
                      'festival': 1,
                      'shimmering': 1,
                      '25-cents': 1,
                      'dig': 1,
                      '23': 2,
                      'Outside': 3,
                      'lighted': 1,
                      'anyhow': 1,
                      'speak': 2,
                      'located': 12,
                      'economics': 1,
                      'infectious': 3,
                      'practising': 1,
                      'education': 13,
                      'house': 47,
                      'displaced': 1,
                      'promises': 1,
                      'skilled': 3,
                      'muscles': 9,
                      'escutcheons': 1,
                      'non-job-connected': 1,
                      'thickness': 13,
                      'darling': 1,
                      'tube': 5,
                      'building': 26,
                      'waters': 3,
                      'helpless': 1,
                      'Minnie': 1,
                      'Between': 2,
                      'More': 5,
                      'threshold': 1,
                      'monomer': 1,
                      'junction': 1,
                      'restrain': 1,
                      'operating': 10,
                      '90-degree': 2,
                      'use': 90,
                      'crags': 1,
                      'Limit': 1,
                      'experience': 20,
                      'served': 10,
                      'Raceway': 2,
                      'octagonal': 1,
                      'equalize': 1,
                      'xenon': 1,
                      'bars': 23,
                      'Girl': 1,
                      'population': 3,
                      'Sr.': 2,
                      'tempted': 1,
                      'overhangs': 2,
                      'thoroughly': 9,
                      'February': 6,
                      'Presto': 1,
                      'gaggle': 1,
                      'Much': 1,
                      'Renaults': 1,
                      'worth-while': 1,
                      'certain': 16,
                      'season': 9,
                      "NRLDA's": 1,
                      'shade': 6,
                      'inspector': 2,
                      'others': 22,
                      'stronghold': 1,
                      'vicinity': 2,
                      'manufacturers': 15,
                      'grow': 9,
                      'strains': 1,
                      'transition': 1,
                      'Only': 5,
                      'catch': 5,
                      'handiest': 1,
                      'beautiful': 10,
                      'amazing': 1,
                      'shorter': 3,
                      "They're": 6,
                      'Greenleaf': 1,
                      'Mar.': 1,
                      'well-designed': 1,
                      'denying': 1,
                      'Frisco': 1,
                      'lapses': 1,
                      'breather': 1,
                      'retains': 1,
                      'Eighty': 1,
                      '15,500-lb.': 1,
                      'releasing': 1,
                      'cosponsored': 1,
                      'company': 31,
                      'Enough': 1,
                      'Stevens': 2,
                      'mon': 1,
                      'chic': 1,
                      'greatness': 2,
                      'determine': 15,
                      'foreleg': 1,
                      'Limits': 1,
                      'correct': 8,
                      'toughest': 1,
                      'corporation': 2,
                      'predetermined': 1,
                      'cups': 5,
                      'kitchen': 9,
                      'Debonnie': 3,
                      'amount': 13,
                      'crack': 1,
                      'Called': 2,
                      'Seeing': 1,
                      'Beccaria': 2,
                      'Md.': 1,
                      'cheek': 1,
                      'Challenge': 1,
                      'lets': 2,
                      'undertow': 1,
                      'serratus': 2,
                      'reinforcing': 1,
                      'through': 77,
                      'Following': 5,
                      'company-paid': 1,
                      'Buckhannon': 1,
                      'Apple': 1,
                      'Just': 7,
                      'specific': 9,
                      'expendable': 1,
                      'each': 82,
                      'loosening': 1,
                      'Because': 11,
                      'Plus': 1,
                      'Coffee': 2,
                      'ink': 1,
                      'tons': 3,
                      'Dennis': 1,
                      'eye-filling': 1,
                      "grocer's": 1,
                      'Physique': 1,
                      'forget': 7,
                      'apparently': 2,
                      'sturgeon': 1,
                      'Shade': 1,
                      'craft': 6,
                      'fernery': 1,
                      'cottage': 3,
                      'tongue-in-cheek': 1,
                      'raised': 6,
                      'magnificence': 1,
                      'stray': 1,
                      'extent': 4,
                      'probability': 3,
                      'stream': 6,
                      'encompasses': 1,
                      'Half': 1,
                      'buggy': 1,
                      'fabulous': 2,
                      'suburbs': 1,
                      'Figs.': 4,
                      'situations': 3,
                      'Haven': 4,
                      'fences': 2,
                      'At': 25,
                      'lb-plus': 1,
                      "I'm": 2,
                      'focused': 1,
                      'economizing': 1,
                      'vessels': 3,
                      'jewels': 1,
                      'balconies': 1,
                      'scoop': 1,
                      'Those': 6,
                      'surprisingly': 2,
                      'treasures': 1,
                      'Nelson': 1,
                      'drop': 4,
                      'thermoforming': 1,
                      'career': 3,
                      'cartilage': 1,
                      'husband': 1,
                      'friends': 8,
                      'chop': 2,
                      'twists': 3,
                      'integral': 1,
                      'shouting': 1,
                      'correlation': 1,
                      'kiloton': 1,
                      'slip': 8,
                      'acetonemia': 3,
                      "patient's": 1,
                      'mattresses': 1,
                      'terry': 1,
                      'small': 55,
                      'bull': 2,
                      'jam': 1,
                      'enemy': 1,
                      'amuse': 2,
                      'lifters': 4,
                      'understatement': 1,
                      'Hanover-Misty': 1,
                      'skyline': 1,
                      'snag': 2,
                      'knife': 3,
                      'drilled': 3,
                      'missiles': 9,
                      'recently': 11,
                      'earthy': 1,
                      'landmarks': 3,
                      'built-in': 1,
                      'sportsmen': 6,
                      'Heel-Holiday': 1,
                      'next': 19,
                      'mustard': 12,
                      'strike': 3,
                      'screens': 1,
                      'shall': 5,
                      'searching': 1,
                      'e.g.': 2,
                      'really': 14,
                      'peoples': 2,
                      'broad': 6,
                      'Foliage': 1,
                      'external': 2,
                      'inflicted': 1,
                      'Eddie': 2,
                      'soup': 3,
                      'signals': 7,
                      'effect': 12,
                      'Pete': 1,
                      'exact': 6,
                      'stitch': 2,
                      'artists': 3,
                      'advantages': 3,
                      'lag': 1,
                      'hammerless': 1,
                      'Jacky': 1,
                      'masculine': 3,
                      'evening': 4,
                      'twelve': 4,
                      'Gables': 1,
                      'impersonated': 1,
                      'furbishing': 1,
                      'Joseph': 2,
                      'Drawn': 1,
                      'keen': 2,
                      'KCs': 1,
                      'relax': 1,
                      'Bureau': 3,
                      'P': 1,
                      'colts': 4,
                      'Health': 2,
                      'three-bedroom': 1,
                      "Prokofieff's": 6,
                      'life-long': 1,
                      '``': 227,
                      'assets': 1,
                      'ask': 2,
                      'pleasures': 1,
                      'found': 33,
                      'pegging': 1,
                      'profile': 1,
                      'owning': 2,
                      'win': 9,
                      'read': 8,
                      'domes': 3,
                      'pretty': 8,
                      'scholarships': 1,
                      'barges': 1,
                      "clown's": 1,
                      'things': 23,
                      'Now': 14,
                      'job': 35,
                      'tranquilizer': 1,
                      'Anders': 2,
                      'department': 13,
                      'provision': 3,
                      'independent': 3,
                      'boatels': 1,
                      'divided': 6,
                      'suppose': 2,
                      'feed': 84,
                      'Bent-Arm': 1,
                      'vertical': 7,
                      'bits': 2,
                      'tops': 2,
                      '18': 3,
                      'darkest': 1,
                      'cocked': 1,
                      'spur': 1,
                      'stops': 1,
                      'sir': 1,
                      'region': 5,
                      'bat': 2,
                      'Acadia': 1,
                      'LP': 1,
                      'pricing': 2,
                      'our': 77,
                      'German': 6,
                      'Molly': 1,
                      'Acorns': 1,
                      'dissolved': 1,
                      'signal': 14,
                      'vehicles': 3,
                      '1959': 4,
                      'Pullover': 2,
                      'managing': 1,
                      'Traditionalist': 1,
                      'Discovery': 2,
                      'taper': 1,
                      'shoot': 6,
                      'improves': 7,
                      'floured': 1,
                      'fabric': 1,
                      'deductible': 1,
                      'spacecraft': 3,
                      'unrifled': 1,
                      'Bright': 1,
                      'employee-contributed': 1,
                      'outset': 2,
                      'unaccountably': 1,
                      'cycles': 1,
                      'required': 15,
                      'Sammy': 1,
                      'wages': 1,
                      'Division': 6,
                      '1961': 9,
                      'Beautiful': 1,
                      'strip': 8,
                      'breadth': 3,
                      'gloss': 1,
                      'hunting': 17,
                      'Beach': 2,
                      'run': 23,
                      'discussed': 1,
                      'stores': 2,
                      'sentimentality': 1,
                      'pressure-sensing': 1,
                      'diarrhea': 7,
                      'wooded': 3,
                      'importantly': 1,
                      'runabout': 1,
                      'wire': 5,
                      'diameters': 1,
                      'feather': 1,
                      'dimension': 2,
                      'install': 5,
                      'handyman': 2,
                      'themselves': 17,
                      'developing': 5,
                      'academic': 10,
                      'dome': 7,
                      'timbre': 1,
                      'valves': 2,
                      'experiences': 3,
                      'eligible': 3,
                      'stumps': 1,
                      'positive': 1,
                      'Santa': 10,
                      'lawsuits': 1,
                      'now-famous': 1,
                      'provoke': 1,
                      'horse-radish': 2,
                      'Nina': 1,
                      'sizzling': 1,
                      'speaks': 1,
                      'anyone': 8,
                      'vies': 1,
                      'reader': 2,
                      'shank': 1,
                      'four-sided': 1,
                      '13/16-inch': 2,
                      'pulled': 7,
                      'background': 12,
                      'dual-ladder': 1,
                      'continuous': 4,
                      'sprinkling': 3,
                      'framed': 1,
                      'buns': 8,
                      'methods': 11,
                      'novelty': 2,
                      'soften': 1,
                      'Manufacturers': 3,
                      'loaded': 5,
                      'meticulously': 2,
                      'Greentree': 1,
                      'Also': 10,
                      'radio': 2,
                      'coat': 7,
                      'delivery': 6,
                      'seacoast': 1,
                      'trees': 15,
                      'Made': 3,
                      'biology': 1,
                      'lecturing': 2,
                      'scales': 1,
                      'Japanese': 1,
                      'exchange': 1,
                      'Frederick': 1,
                      'due': 13,
                      'attainment': 1,
                      'congratulated': 1,
                      'Turnpike': 2,
                      'unabashed': 1,
                      'sixty': 1,
                      'checked': 5,
                      'manpower': 2,
                      'midwestern': 2,
                      'problem': 29,
                      'curtail': 1,
                      'pulse': 2,
                      'winter': 13,
                      'distance': 14,
                      '5000': 2,
                      'percentage': 4,
                      'bevels': 1,
                      'tidbits': 1,
                      'dares': 1,
                      'midweek': 1,
                      'floorboards': 1,
                      'informal': 1,
                      'reflect': 5,
                      'looking': 7,
                      'Journal': 2,
                      'pleas': 1,
                      'sangaree': 1,
                      'British': 1,
                      'Unlimited': 1,
                      'Well-stretched': 1,
                      'clever': 3,
                      'niceties': 1,
                      'humor': 2,
                      'anchovy': 1,
                      'fibers': 1,
                      'collected': 2,
                      'Big': 1,
                      'bottle': 3,
                      'roofed': 1,
                      'vastly': 1,
                      'recoil': 3,
                      'donating': 1,
                      'Usually': 3,
                      'Position': 2,
                      'microscope': 4,
                      'slowly': 9,
                      'place': 48,
                      'piece': 42,
                      'Itasca': 1,
                      'Sturdy': 1,
                      'rode': 4,
                      'payroll': 1,
                      'Vases': 2,
                      'bombers': 6,
                      '1861': 1,
                      'tasted': 1,
                      'atmosphere': 2,
                      'electronically': 1,
                      'union': 4,
                      'melting': 2,
                      '1821': 2,
                      'land': 18,
                      'galvanism': 1,
                      'railroad': 2,
                      'files': 1,
                      'Over': 2,
                      'instrument': 1,
                      'think': 11,
                      'could': 58,
                      'enjoying': 1,
                      'Federal': 1,
                      'rich': 7,
                      'stopped': 1,
                      'Wilson': 1,
                      'Ernest': 2,
                      'tappet': 15,
                      'spaciousness': 1,
                      'elaborate': 2,
                      'Along': 1,
                      'Bottoms': 1,
                      'twisting': 2,
                      'tapping': 1,
                      'brute': 1,
                      'metered': 2,
                      'customs': 1,
                      'ran': 2,
                      'wagons': 4,
                      'meant': 3,
                      'grease': 2,
                      'rocket': 2,
                      'intent': 1,
                      'tiny': 4,
                      'showmanship': 1,
                      'Displacement': 1,
                      'inventory': 1,
                      '1882': 1,
                      'picture': 11,
                      'Rubber': 2,
                      'gussets': 1,
                      'curious': 2,
                      'sympathetic': 1,
                      'watercolorist': 2,
                      'buildings': 14,
                      'between': 41,
                      'cost': 49,
                      'ordinarius': 1,
                      'Cumhuriyet': 1,
                      'host': 4,
                      'intimate': 1,
                      'premieres': 1,
                      'beam': 3,
                      'butyrate': 9,
                      'outmoded': 1,
                      'diagrams': 1,
                      'masterpiece': 1,
                      'buckle-on': 1,
                      'Atlas': 1,
                      'recognize': 3,
                      'improve': 10,
                      'irreparable': 1,
                      'marked': 7,
                      'wagon': 4,
                      'plastisols': 1,
                      'higher': 13,
                      'Dutch': 1,
                      'cabinets': 2,
                      'foliage': 2,
                      'piston': 7,
                      'convenient-type': 1,
                      'firmly': 7,
                      'Until': 2,
                      'marketing': 30,
                      'backed': 1,
                      'kob': 1,
                      'gymnast': 1,
                      'extras': 1,
                      'amplified': 2,
                      'fruit': 8,
                      '$.50': 2,
                      'endowments': 1,
                      'Factors': 1,
                      'Trackdown': 1,
                      'composers': 2,
                      '1-ton': 1,
                      '2:22': 2,
                      'distinctly': 1,
                      'wiping': 1,
                      '$310': 1,
                      'Simple': 2,
                      'Transportation': 1,
                      'The': 458,
                      'cowboys': 1,
                      'promise': 4,
                      'D.': 2,
                      'vinyl': 2,
                      'sportsman': 1,
                      'rascal': 1,
                      'candles': 1,
                      'of': 2390,
                      'under': 52,
                      'screwed': 10,
                      '47': 2,
                      '$10,000': 1,
                      'law': 5,
                      'tackle': 1,
                      'decimal': 3,
                      'Rim-Fire': 1,
                      'displacement': 12,
                      'nearing': 1,
                      'stained': 1,
                      'inboard': 1,
                      'rear': 9,
                      'Creed': 1,
                      'chimiques': 1,
                      'perky': 1,
                      'conception': 1,
                      'bold': 1,
                      'intra-company': 1,
                      'upland': 2,
                      'contribution': 4,
                      'Pump': 1,
                      'gallonage': 1,
                      'Information': 1,
                      'fractions': 4,
                      'Not': 8,
                      'fecundity': 1,
                      'Juniors': 29,
                      'Bridges': 1,
                      'rehearing': 1,
                      "Dexter's": 1,
                      'sleek': 1,
                      'Finals': 7,
                      'Ritter': 9,
                      'facts': 3,
                      'recreation': 21,
                      'Hudson': 3,
                      "70's": 1,
                      'assembled': 2,
                      'tempos': 1,
                      'Showmanship': 8,
                      'Cover': 1,
                      'policing': 1,
                      'straightening': 1,
                      'jacketed': 2,
                      'Outdoor': 3,
                      'warehousing': 1,
                      'blacked-in': 1,
                      'factory': 1,
                      'entry': 3,
                      "America's": 4,
                      'outside': 15,
                      'tall-growing': 1,
                      'gloves': 2,
                      'arc': 1,
                      'relation': 9,
                      'Timothy': 8,
                      'book': 3,
                      'droves': 1,
                      'suggested': 6,
                      'minimal': 5,
                      'Basic': 1,
                      'fame': 1,
                      'employees': 15,
                      'velvety': 2,
                      'refrigeration': 2,
                      '3-1/2': 1,
                      'Mechanized': 1,
                      'pressure': 19,
                      'bound': 3,
                      'intend': 2,
                      'heartbeat': 2,
                      'old-time': 1,
                      'presents': 3,
                      'pump': 6,
                      'Reducing': 1,
                      'purpose': 11,
                      'Caution': 5,
                      'give': 23,
                      'town': 13,
                      'clamped': 2,
                      'one-twentieth': 1,
                      'absolute': 3,
                      'membership': 9,
                      'Sugar': 1,
                      'boring': 2,
                      'scientists': 2,
                      'dishes': 2,
                      'mobility': 2,
                      'beauty': 6,
                      'gamut': 1,
                      'ammunition': 6,
                      "Henri's": 2,
                      'inhibition': 1,
                      'soldering': 1,
                      'rooting': 2,
                      '1777': 1,
                      'dealers': 11,
                      'Richard': 1,
                      'directed': 2,
                      'multiplication': 3,
                      'aesthetic': 3,
                      'strenuous': 1,
                      'applying': 3,
                      'allow': 12,
                      'rag': 1,
                      'Boissoneault': 1,
                      'similarity': 1,
                      'Aj': 16,
                      'chutney': 1,
                      'Simca': 1,
                      'travelogue': 1,
                      'actually': 11,
                      'Several': 5,
                      'rocked': 1,
                      'Westinghouse': 1,
                      'softened': 1,
                      'juice': 2,
                      'Billy': 1,
                      'Glazed': 1,
                      'Gardens': 5,
                      'after': 49,
                      'young': 14,
                      'vertical-takeoff-and-landing': 1,
                      'Very': 1,
                      'Arms': 2,
                      'corrugated': 2,
                      'man': 16,
                      'response': 1,
                      'article': 3,
                      'Trim': 4,
                      'pool-side': 2,
                      'authority': 6,
                      'Point': 2,
                      'Perhaps': 1,
                      'grace': 1,
                      'confused': 1,
                      'Audubon': 1,
                      'coverings': 1,
                      'missile': 19,
                      'big': 21,
                      'long-term': 2,
                      'unbelievable': 1,
                      'inspect': 1,
                      'E': 3,
                      '260': 2,
                      'trajectory': 1,
                      'Cod': 1,
                      'attempted': 2,
                      'completely': 13,
                      'C': 7,
                      'wooden': 17,
                      'cold': 10,
                      'blanks': 2,
                      'opened': 6,
                      'left': 39,
                      'Abilene': 2,
                      '$.10-a-minute': 1,
                      'Simmer': 2,
                      'fortified': 1,
                      'Charlottesville': 1,
                      'Holy': 1,
                      'clip': 3,
                      'stood': 5,
                      'worst': 1,
                      'rounds': 1,
                      'deck': 1,
                      'lend': 1,
                      'drinking': 2,
                      'toasted': 1,
                      'AAA': 1,
                      'street': 8,
                      'ex-President': 1,
                      'bow': 2,
                      'patterns': 10,
                      'Hanover-Bertie': 1,
                      'Konzerthaus': 1,
                      'Strongheart': 1,
                      'adjoined': 1,
                      'locally': 2,
                      'award': 3,
                      'fill': 7,
                      'modernism': 1,
                      'rusting': 1,
                      '$200': 2,
                      'wears': 1,
                      'transom': 13,
                      'Harlan-Marcia': 1,
                      'now': 61,
                      'Column': 4,
                      'electrocardiograph': 2,
                      'dramatic': 2,
                      'radiant': 1,
                      'taunts': 1,
                      'calibers': 3,
                      "'round": 2,
                      "emperor's": 1,
                      'commonplace': 2,
                      'upsets': 1,
                      'colder': 1,
                      'De': 4,
                      'Sporting': 2,
                      'Yankee': 1,
                      'Karet': 1,
                      '2:06.1': 1,
                      'reckon': 1,
                      'five-cent': 1,
                      'loss': 4,
                      'three': 44,
                      'shortsighted': 1,
                      'automatic': 3,
                      'Latest': 1,
                      'demand': 14,
                      'batch': 1,
                      'stock': 14,
                      'Spring': 3,
                      'allot': 1,
                      'planks': 1,
                      'San': 4,
                      'bridge': 40,
                      'clearer': 2,
                      'industrial': 4,
                      'pure': 3,
                      'organize': 1,
                      'extended': 3,
                      'timers': 1,
                      'refers': 1,
                      'handstand': 1,
                      '06-05': 2,
                      'accorded': 1,
                      'blossom': 4,
                      'Cylinder': 2,
                      'suave': 1,
                      'joy': 1,
                      "community's": 1,
                      'continuity': 1,
                      "Wouldn't": 1,
                      'apartment': 2,
                      'motion': 3,
                      'car': 37,
                      'Analyzer': 1,
                      'miscalculations': 1,
                      'expanse': 1,
                      'Quintet': 1,
                      'A.': 1,
                      'finely': 1,
                      'Methyl': 1,
                      'salad': 5,
                      'present': 26,
                      'describes': 1,
                      'throws': 1,
                      'galley': 1,
                      'science': 8,
                      'persistence': 1,
                      'be': 508,
                      'wealth': 3,
                      'respiratory': 3,
                      'racing': 1,
                      'device': 2,
                      'ready': 8,
                      'reconsidered': 1,
                      'public-spirited': 1,
                      'formulated': 1,
                      'glistened': 1,
                      'culture': 2,
                      'abundance': 1,
                      'shares': 3,
                      'Hawaii': 2,
                      'Hackmann': 1,
                      'Plastics': 1,
                      'conclusion': 1,
                      'ant': 1,
                      'reasons': 5,
                      'auto': 8,
                      'essay': 3,
                      'minutes': 17,
                      'Missiles': 1,
                      'Gas': 2,
                      'Treasury': 1,
                      'equipped': 11,
                      'peerless': 1,
                      'joined': 4,
                      "He's": 2,
                      'grower': 1,
                      'purchasers': 1,
                      'Saracens': 1,
                      'injection-molded': 1,
                      'locale': 1,
                      'circus': 1,
                      'load': 12,
                      'distributor': 6,
                      'favored': 1,
                      'enforcement': 1,
                      'afternoon': 3,
                      'side-looking': 1,
                      'scored': 6,
                      'indistinguishable': 1,
                      'Dream': 2,
                      'drained': 1,
                      'Fascism': 1,
                      'concentrates': 1,
                      'status': 12,
                      'commercial': 7,
                      'specify': 2,
                      'exceptional': 3,
                      'central': 12,
                      'scooping': 1,
                      'tablespoons': 7,
                      'grand-looking': 1,
                      'hazard': 2,
                      'weeds': 1,
                      'vacuuming': 2,
                      'force': 23,
                      'wing-shooting': 1,
                      'scenic': 6,
                      'replaced': 12,
                      'youth': 1,
                      'prevented': 2,
                      'Dean': 2,
                      'sizes': 5,
                      'obligations': 1,
                      'cringing': 1,
                      'Aureomycin': 5,
                      'eastern': 1,
                      'Pansies': 4,
                      'RCA': 2,
                      'vacationers': 1,
                      'blood-filled': 1,
                      'Eating': 1,
                      'Naval': 1,
                      "child's": 2,
                      'lighting': 6,
                      'alone': 16,
                      'Buries': 1,
                      'pickle': 1,
                      'bullet': 4,
                      '989': 1,
                      'handgun': 3,
                      '70-mile-long': 1,
                      'into': 108,
                      'principally': 2,
                      'Initial': 1,
                      'Oranges': 2,
                      'denouncing': 1,
                      'affiliated': 1,
                      'millimeter': 1,
                      'haul': 2,
                      'Abraham': 1,
                      'Jones-Imboden': 1,
                      'keel': 4,
                      'else': 6,
                      'skipped': 1,
                      'justify': 1,
                      'let': 13,
                      'lengthwise': 2,
                      'forecasts': 1,
                      'checking': 1,
                      'localities': 1,
                      'May': 12,
                      'Laguna': 1,
                      'promising': 5,
                      'outraged': 1,
                      'encourages': 2,
                      'discovery': 5,
                      'excelsior': 1,
                      'team': 8,
                      'managed': 2,
                      'thirty-foot': 1,
                      'method': 16,
                      'accommodating': 1,
                      'election': 1,
                      'sweeping': 1,
                      'landscapes': 1,
                      'Years': 1,
                      '2': 37,
                      'hold': 15,
                      'trimming': 2,
                      'seven-shot': 1,
                      '400': 3,
                      'Chlortetracycline': 1,
                      'violent': 1,
                      'length': 14,
                      'mayonnaise': 1,
                      "Russia's": 2,
                      'pupils': 2,
                      'rockets': 4,
                      'rolls': 1,
                      'Bring': 1,
                      'switch': 5,
                      "workers'": 1,
                      'chowders': 1,
                      'SD': 1,
                      'Rodney-The': 1,
                      'entourage': 1,
                      'administrator': 1,
                      'simplified': 5,
                      'all-purpose': 1,
                      'liners': 1,
                      'electric': 9,
                      'shots': 2,
                      'participated': 3,
                      'outdoors': 2,
                      'S': 5,
                      'secretaries': 1,
                      ')': 280,
                      'garlic': 2,
                      'Contrary': 2,
                      'attain': 1,
                      'lecture': 2,
                      'corners': 4,
                      'around': 41,
                      'sayonara': 1,
                      'shakers': 1,
                      'understands': 1,
                      'over-all': 5,
                      'Topography': 2,
                      'edges': 19,
                      'Otherwise': 1,
                      'verisimilitude': 1,
                      'S.': 6,
                      'germinate': 2,
                      'Turkish': 2,
                      'verandas': 1,
                      'smoothest': 1,
                      'Sweepstakes': 1,
                      'high-velocity': 1,
                      'Push-ups': 2,
                      'Land': 1,
                      'startling': 1,
                      'clicks': 1,
                      'clothes': 3,
                      'rockbound': 1,
                      'playful': 1,
                      'relative': 4,
                      'qualities': 3,
                      'forums': 1,
                      'tail': 1,
                      'Shortage': 1,
                      'secondary': 1,
                      'ceramic': 3,
                      'released': 4,
                      'ends': 16,
                      'possibilities': 6,
                      'scanning': 3,
                      ...}),
             'humor': Counter({'Without': 1,
                      'vacuum': 1,
                      'read': 8,
                      'dig': 1,
                      'anyhow': 1,
                      'speak': 1,
                      'grow': 2,
                      'house': 13,
                      'company': 3,
                      'barrage': 1,
                      'monies': 1,
                      'darling': 1,
                      'theater': 2,
                      'building': 1,
                      'Maria': 1,
                      'bangish': 1,
                      'generation': 1,
                      'Between': 1,
                      'More': 1,
                      'summoned': 1,
                      'walked': 2,
                      'use': 10,
                      'worse': 2,
                      'killed': 1,
                      'contraptions': 1,
                      'deigned': 1,
                      'experience': 1,
                      'disorderly': 1,
                      'belonged': 2,
                      'served': 3,
                      'vase': 1,
                      'Eddie': 2,
                      'bars': 1,
                      'odious': 1,
                      'tonal': 1,
                      'gardening': 1,
                      'titanic': 1,
                      'trip': 2,
                      'Oddly': 1,
                      'has': 26,
                      'thwarted': 1,
                      "That'll": 1,
                      'thoroughly': 1,
                      "Roylott's": 1,
                      'instances': 1,
                      'pleading': 1,
                      'boosting': 1,
                      'our': 30,
                      'along': 5,
                      'forum': 1,
                      'certain': 8,
                      'God': 1,
                      'Couple': 1,
                      'demanded': 3,
                      'shade': 2,
                      'maximal': 1,
                      'others': 4,
                      'thespians': 1,
                      'editorial': 1,
                      'prowlers': 1,
                      'anyone': 6,
                      'entertain': 1,
                      'tangled': 1,
                      'grew': 2,
                      'imperilled': 1,
                      'scenario': 1,
                      'treacherous': 1,
                      'contributes': 1,
                      'sleeve': 1,
                      'beautiful': 3,
                      'marriages': 1,
                      'incident': 1,
                      'armadillo': 2,
                      'first': 13,
                      'dictator': 1,
                      'admit': 1,
                      'developed': 1,
                      'Mystery': 1,
                      'lapses': 1,
                      'updated': 1,
                      'cadenza': 1,
                      'messenger': 1,
                      'films': 1,
                      'taverns': 1,
                      'adding': 2,
                      'Reichstag': 1,
                      'promises': 1,
                      'constituents': 1,
                      'chortled': 1,
                      'wall': 6,
                      'railway': 1,
                      'chicken': 3,
                      'determine': 2,
                      'Spector': 1,
                      'under': 8,
                      'chaperon': 1,
                      'glance': 2,
                      'kitchen': 10,
                      'amount': 1,
                      'ship': 1,
                      'pressure': 1,
                      'cheek': 1,
                      "you'll": 1,
                      'directed': 1,
                      'Berlin': 1,
                      'insanity': 1,
                      'hear': 8,
                      'man': 21,
                      'martinis': 1,
                      "wouldn't": 5,
                      'Valley': 1,
                      'through': 18,
                      'Yvette': 1,
                      "victim's": 1,
                      'instigation': 1,
                      'Kodaks': 1,
                      'Just': 4,
                      'square': 2,
                      'each': 9,
                      'Downfall': 1,
                      'fashionable': 2,
                      'battle': 2,
                      'Ah': 1,
                      'boon': 1,
                      'motion': 2,
                      'Justice': 1,
                      'so-so': 1,
                      'pavement': 1,
                      "grocer's": 1,
                      'forget': 2,
                      'it': 162,
                      'burden': 1,
                      'humiliated': 1,
                      'cottage': 1,
                      'tongue-in-cheek': 1,
                      'raised': 2,
                      'misty-eyed': 1,
                      'love': 4,
                      'meanings': 1,
                      'hoping': 2,
                      'snapshots': 1,
                      'repeated': 2,
                      'eventually': 3,
                      'Readers': 1,
                      'apron': 1,
                      'Expressionism': 1,
                      'depressed': 2,
                      'Doris': 2,
                      'hard': 4,
                      "Shakespeare's": 1,
                      'At': 7,
                      'eludes': 1,
                      "I'm": 11,
                      'garlic': 1,
                      'focused': 1,
                      'Station': 1,
                      'merit': 1,
                      'Take': 1,
                      'ridiculed': 1,
                      'Those': 3,
                      'entertainments': 1,
                      'merely': 3,
                      'career': 3,
                      'husband': 8,
                      'friends': 1,
                      'Mesta': 1,
                      'slip': 1,
                      'manage': 1,
                      'vindictive': 1,
                      'small': 9,
                      'fuse': 1,
                      'Apollo': 1,
                      'bump': 1,
                      'red': 4,
                      'performances': 1,
                      'understatement': 1,
                      'commonest': 1,
                      'retracted': 1,
                      'hostess': 2,
                      'recently': 4,
                      'earthy': 1,
                      'Scandal': 2,
                      'distinct': 1,
                      'lenient': 1,
                      "Rossilini's": 1,
                      'babes': 1,
                      'peas': 1,
                      'snapped': 1,
                      'next': 8,
                      'crowed': 1,
                      'screens': 1,
                      'four': 8,
                      'badly': 1,
                      'searching': 2,
                      'really': 6,
                      'slightly': 2,
                      'predictable': 1,
                      'please': 2,
                      'crudely': 1,
                      'rushed': 2,
                      'female': 1,
                      'efforts': 1,
                      'Serbantian': 1,
                      'effect': 5,
                      'selected': 1,
                      'thin': 3,
                      'ad': 3,
                      'falcon': 1,
                      'cell': 3,
                      'cereal': 1,
                      'artists': 1,
                      'family': 6,
                      'imploring': 1,
                      'evening': 5,
                      'Spain': 1,
                      'contradiction': 1,
                      'Tschilwyk': 1,
                      'Replied': 1,
                      'fear': 5,
                      'keen': 1,
                      'rice': 2,
                      'These': 2,
                      'Cal': 1,
                      'late': 5,
                      'indefinity': 1,
                      'Communism': 1,
                      'Possibly': 1,
                      'personalities': 2,
                      'ask': 2,
                      'lemon-meringue': 1,
                      'flautist': 1,
                      'found': 12,
                      'rebuffed': 1,
                      'triumph': 1,
                      'symptoms': 1,
                      'demonstrable': 1,
                      'Nevertheless': 1,
                      'Major': 2,
                      "one's": 1,
                      ':': 40,
                      'pretty': 3,
                      'coverlet': 1,
                      'tangos': 1,
                      'mother': 12,
                      'things': 27,
                      'Now': 14,
                      "mayor's": 1,
                      'Abstract': 1,
                      'can': 16,
                      'department': 3,
                      'sees': 1,
                      'defines': 1,
                      'divided': 1,
                      'suppose': 3,
                      'charging': 1,
                      'scandal': 1,
                      'afghan': 1,
                      'sir': 2,
                      'underwear': 1,
                      'Yorker': 1,
                      'Don': 1,
                      'larder': 1,
                      'constituted': 1,
                      'activities': 1,
                      'German': 1,
                      'leave': 1,
                      'York': 3,
                      'scream': 1,
                      'reporters': 1,
                      'importance': 1,
                      'managing': 1,
                      'power': 7,
                      'gushed': 1,
                      'till': 2,
                      'cheers': 1,
                      'any': 18,
                      'stiffened': 1,
                      'painstakingly': 1,
                      'ordinarily': 1,
                      'reliably': 1,
                      'Fing': 2,
                      'observe': 1,
                      'required': 1,
                      'talent': 1,
                      'toes': 2,
                      'Lord': 1,
                      'housewives': 1,
                      'years': 21,
                      'curiously': 1,
                      'drink': 5,
                      'committing': 1,
                      'Coopers': 1,
                      'run': 4,
                      'discussed': 2,
                      'persons': 2,
                      'entitle': 1,
                      'Barrymores': 1,
                      'indelible': 1,
                      'haunted': 1,
                      'impatience': 1,
                      'playwright': 1,
                      'poised': 1,
                      'themselves': 4,
                      'guest': 1,
                      'send': 4,
                      'music': 4,
                      'visit': 3,
                      'moral': 1,
                      'reactionary': 1,
                      'fallen': 1,
                      'fine': 2,
                      'remainder': 1,
                      'replacing': 1,
                      'Santa': 1,
                      'dropped': 1,
                      'disabuse': 1,
                      'Good': 1,
                      'victim': 1,
                      'speaks': 1,
                      'berth': 1,
                      'reader': 1,
                      'insinuation': 1,
                      'clean': 2,
                      'symbolizing': 2,
                      'her': 62,
                      'pulled': 1,
                      'continuous': 1,
                      'role': 4,
                      'longer': 2,
                      'sing': 1,
                      'strange': 1,
                      'framed': 1,
                      'accents': 1,
                      'sentences': 1,
                      'band': 1,
                      'visited': 3,
                      'loaded': 1,
                      'steam': 2,
                      'disposed': 2,
                      'Also': 2,
                      'radio': 3,
                      'myriad': 1,
                      'ruse': 1,
                      'engagingly': 1,
                      "mother's": 3,
                      'Some': 1,
                      'dove': 1,
                      'Oscar': 1,
                      'due': 1,
                      'grand': 1,
                      'guys': 1,
                      'cut': 4,
                      'formation': 1,
                      'shrewdly': 1,
                      'plea': 2,
                      'appearing': 1,
                      'precious': 1,
                      'Russian': 1,
                      'checked': 1,
                      'serve': 1,
                      'meadow': 1,
                      'problem': 5,
                      'It-wit': 1,
                      'distance': 2,
                      'chancellor': 1,
                      'pegboard': 6,
                      'stage': 8,
                      'tongue-twister': 1,
                      "bee's": 1,
                      'looking': 2,
                      'iniquitous': 1,
                      'hubris': 1,
                      'sugar': 1,
                      'sandpaper': 1,
                      'stack': 1,
                      'British': 2,
                      'Time': 1,
                      'yapping': 1,
                      'livelihood': 1,
                      'heyday': 1,
                      'astounding': 1,
                      'plumber': 1,
                      'exclusive': 1,
                      'sentimental': 1,
                      'Minute': 1,
                      'bottle': 2,
                      'farmed': 1,
                      'thrills': 1,
                      'Usually': 1,
                      'realized': 2,
                      'same': 7,
                      'dragged': 2,
                      'possess': 3,
                      'finances': 1,
                      'piece': 4,
                      'relict': 1,
                      'hardly': 3,
                      'jimmied': 1,
                      'committees': 1,
                      'Marquess': 1,
                      'Belle': 1,
                      "What's": 2,
                      'pangs': 1,
                      'urgent': 1,
                      'inspiring': 1,
                      'greater': 2,
                      'therefore': 2,
                      'remain': 1,
                      'Eh': 1,
                      'Pickfair': 1,
                      'single-handedly': 1,
                      'think': 14,
                      'could': 30,
                      'Smith': 1,
                      'winking': 2,
                      'stripes': 1,
                      'rich': 2,
                      'stopped': 2,
                      'repressed': 1,
                      'blissfully': 1,
                      'pride': 2,
                      'Franklin': 1,
                      'compulsion': 2,
                      'remarkably': 1,
                      'tapping': 1,
                      'darkened': 1,
                      'reread': 1,
                      'Juan': 1,
                      'reflexes': 1,
                      'ran': 1,
                      'littered': 1,
                      'Heywood': 1,
                      'meant': 2,
                      'objets': 1,
                      'ivory': 5,
                      'inventory': 1,
                      "devil's-food": 1,
                      'picture': 7,
                      'Multiple': 1,
                      'deities': 1,
                      'readable': 1,
                      'performance': 5,
                      'count': 4,
                      'equally': 4,
                      'between': 3,
                      'rare': 2,
                      'woodwork': 1,
                      'blonde': 1,
                      'collapsing': 1,
                      'host': 5,
                      'Mad': 2,
                      'Gold': 1,
                      'intimate': 1,
                      'collation': 1,
                      'report': 1,
                      'solar': 1,
                      'subscription': 1,
                      'appear': 2,
                      'metal': 1,
                      'masterpiece': 1,
                      'Regiment': 1,
                      'World': 1,
                      'inhabitants': 1,
                      'wagon': 2,
                      "Bradley's": 1,
                      'Fascio-Communist': 1,
                      'matrimonial': 1,
                      'eyes': 10,
                      'filled': 1,
                      'simple': 1,
                      'petting': 1,
                      'Soak': 1,
                      'culprit': 1,
                      'Hastening': 1,
                      'Senator': 2,
                      'fruit': 2,
                      'remind': 1,
                      'Tasti-Freeze': 1,
                      'lit': 1,
                      'favorite': 1,
                      'oughta': 1,
                      'wither': 1,
                      'told': 20,
                      'they': 70,
                      'done': 13,
                      'panels': 1,
                      'instructed': 1,
                      'banks': 1,
                      'may': 8,
                      'paraphrase': 2,
                      'Best': 1,
                      'maxim': 1,
                      'portion': 1,
                      'bowl': 1,
                      'existentialist': 1,
                      'Pioneers': 1,
                      'convertible': 1,
                      'law': 2,
                      'abroad': 1,
                      'walrus': 1,
                      'autos': 1,
                      'twofold': 1,
                      'stained': 1,
                      'charming': 2,
                      'quarter': 2,
                      'police': 7,
                      'visiting': 2,
                      'acting': 1,
                      'observed': 1,
                      'state': 5,
                      'White': 1,
                      'felt': 7,
                      'private': 3,
                      'beaten': 1,
                      'separate': 1,
                      'bold': 1,
                      'actors': 1,
                      'lived': 2,
                      'climbed': 1,
                      'Horror': 1,
                      'applejack': 1,
                      'autumn': 1,
                      'lack': 2,
                      "ladies'": 2,
                      'maid': 2,
                      'over-spent': 1,
                      'facts': 1,
                      'Travancore': 1,
                      'musical': 1,
                      'antic': 1,
                      'Tibetan': 1,
                      'L.': 1,
                      "Letch's": 2,
                      'peculiar': 1,
                      'stumbles': 1,
                      'too': 19,
                      'balkiness': 1,
                      'Siberia': 1,
                      'bulwark': 1,
                      'bathtubs': 1,
                      'aspect': 1,
                      'funny': 17,
                      'homosexual': 1,
                      'staircase': 1,
                      'entry': 1,
                      "America's": 1,
                      'outside': 3,
                      'hung': 4,
                      'faces': 1,
                      'language': 1,
                      'book': 3,
                      'sprue': 1,
                      'following': 2,
                      'suggested': 2,
                      'helped': 2,
                      'harshly': 1,
                      'unidentified': 1,
                      'bearer': 1,
                      'instantly': 1,
                      'bound': 1,
                      'homesteads': 1,
                      'aided': 1,
                      'occupants': 1,
                      'above': 2,
                      'endlessly': 1,
                      'heartily': 2,
                      'give': 2,
                      'clumsy': 1,
                      'Hubba': 1,
                      'misfortune': 1,
                      'Thus': 2,
                      'Evidently': 1,
                      'implications': 1,
                      'enormous': 1,
                      'upset': 1,
                      'dishes': 3,
                      'practice': 1,
                      'minute': 1,
                      'comedians': 1,
                      'Coughlin': 1,
                      'bothering': 1,
                      'Richard': 1,
                      'operator': 2,
                      'horns': 1,
                      'obsessed': 1,
                      'Sanitary': 1,
                      'tells': 2,
                      'identified': 2,
                      'boost': 1,
                      'fragment': 1,
                      'wander': 1,
                      'missing': 2,
                      'actually': 2,
                      'Shapes': 1,
                      'precocious': 1,
                      'screenland': 1,
                      'explanation': 1,
                      'replied': 3,
                      'preserve': 1,
                      'barbarian': 1,
                      'N': 2,
                      'after': 19,
                      'questioning': 1,
                      'thought': 14,
                      'Diet': 1,
                      'employed': 1,
                      'world-renowned': 1,
                      'vanishes': 1,
                      'Hollywood': 5,
                      'sculpture': 2,
                      'Musee': 1,
                      'cousin': 1,
                      'Our': 2,
                      'Perhaps': 4,
                      'confused': 3,
                      'big': 3,
                      'unwittingly': 1,
                      'slips': 3,
                      'inspect': 2,
                      'clearly': 4,
                      'attempted': 1,
                      'wooden': 3,
                      'funnier': 1,
                      'cold': 3,
                      'ambiguities': 1,
                      'financial': 2,
                      'opened': 2,
                      'left': 8,
                      'hats': 1,
                      'illegitimate': 1,
                      'utterly': 1,
                      'appealed': 1,
                      'mere': 2,
                      'fate': 2,
                      'insults': 1,
                      'warm': 2,
                      'return': 3,
                      'clergyman': 1,
                      'stood': 1,
                      'unfortunately': 1,
                      'drinking': 2,
                      'jest': 1,
                      'glasses': 1,
                      'street': 2,
                      'saner': 1,
                      'sins': 1,
                      'current': 1,
                      'View': 1,
                      'bitten': 1,
                      'slob': 2,
                      'worth': 1,
                      'fill': 2,
                      'scimitar': 1,
                      'squat': 1,
                      'called': 15,
                      'town': 3,
                      'now': 14,
                      'non-time': 1,
                      'puzzled': 2,
                      'robber': 1,
                      'smart': 1,
                      'Rattzhenfuut': 1,
                      'dramatic': 1,
                      'vacated': 1,
                      'faint': 2,
                      'Meinckian': 1,
                      'suddenly': 1,
                      'Interlude': 1,
                      'Sponge': 1,
                      'gets': 2,
                      'bilharziasis': 1,
                      'lay': 2,
                      'workshop': 4,
                      'definite': 1,
                      'loss': 2,
                      'three': 6,
                      'shut': 1,
                      'alas': 1,
                      'demand': 1,
                      'stock': 1,
                      'musicals': 1,
                      'rooms': 2,
                      'bout-de-souffle': 1,
                      'San': 2,
                      'industrial': 1,
                      'pure': 1,
                      'Anna': 1,
                      'nervous': 3,
                      'choice': 1,
                      'Fredrico': 1,
                      'reluctant': 1,
                      'stretched': 1,
                      'creamed': 1,
                      'stands': 1,
                      'detective': 1,
                      'cafeteria': 1,
                      'typed': 1,
                      'gag': 2,
                      'curvaceously': 1,
                      'interweaving': 1,
                      'murders': 1,
                      'personal': 4,
                      'difficulty': 2,
                      'Carroll': 1,
                      'remarking': 1,
                      'recorded': 1,
                      'seriously': 2,
                      'Nebraska': 1,
                      'Margo': 1,
                      'afternoons': 1,
                      'present': 3,
                      'describes': 1,
                      'leitmotiv': 1,
                      'statue': 1,
                      'science': 1,
                      'resting': 1,
                      'Noel': 1,
                      'hit': 1,
                      'be': 78,
                      'beaches': 2,
                      'must': 9,
                      'names': 5,
                      'helpful': 1,
                      'ready': 3,
                      'kissing': 1,
                      'guided': 1,
                      'acknowledged': 1,
                      'outfit': 1,
                      'an': 75,
                      'abundance': 1,
                      'France': 1,
                      'enthusiasm': 1,
                      'heap': 1,
                      'questioned': 1,
                      'jail': 3,
                      'expected': 1,
                      'garlanded': 1,
                      'minutes': 4,
                      'Fogg': 1,
                      "He's": 2,
                      'Saracens': 1,
                      'leaping': 1,
                      'friendly': 1,
                      'afternoon': 3,
                      'indistinguishable': 1,
                      'central': 1,
                      'carefully': 2,
                      'cartons': 1,
                      'soloist': 1,
                      'jocular': 2,
                      'Salamander': 1,
                      'residence': 1,
                      'bantered': 1,
                      'dignity': 1,
                      'equivocal': 1,
                      'force': 1,
                      'scenic': 1,
                      'youth': 2,
                      'proper': 2,
                      'throw': 4,
                      'blossomed': 1,
                      'middle-Gaelic': 1,
                      'recapitulate': 1,
                      'collector': 1,
                      'dinners': 2,
                      'trail': 1,
                      'catch': 1,
                      'alone': 2,
                      'humans': 1,
                      'commercially': 2,
                      'rackets': 1,
                      'dumping': 1,
                      'troops': 1,
                      'unrelieved': 1,
                      'ransack': 1,
                      'breathe': 2,
                      'Haumd': 1,
                      'into': 35,
                      'vowed': 1,
                      'exterminate': 1,
                      'question': 4,
                      'rostrum': 1,
                      'haul': 1,
                      'policemen': 2,
                      'address': 2,
                      'else': 7,
                      'V': 1,
                      'cult': 1,
                      'local': 2,
                      'Nixon': 1,
                      'let': 5,
                      'doweling': 1,
                      'unborn': 1,
                      'devout': 1,
                      'gave': 5,
                      'becoming': 1,
                      'apparent': 2,
                      'inhumane': 1,
                      'seeks': 1,
                      'grown': 2,
                      'managed': 1,
                      'Hughes': 1,
                      'coyness': 1,
                      'murder': 5,
                      'existed': 1,
                      'because': 16,
                      'abstract': 1,
                      'sweeping': 1,
                      'cryptic': 1,
                      'pink': 1,
                      'hours': 3,
                      'gagline': 1,
                      'Bill': 2,
                      'variety': 2,
                      'Doors': 1,
                      'involved': 2,
                      '``': 343,
                      'short': 3,
                      'apprehended': 1,
                      'glad': 1,
                      'tragicomic': 1,
                      'arranged': 1,
                      'wait': 3,
                      'chair': 2,
                      'stardom': 2,
                      "Crombie's": 1,
                      'electric': 2,
                      'shots': 1,
                      'second': 6,
                      'absurd': 1,
                      ')': 31,
                      'forgive': 1,
                      'pinball': 1,
                      'rediscovery': 1,
                      'gage': 1,
                      'many': 15,
                      'corners': 1,
                      'around': 15,
                      'comprises': 1,
                      'spite': 2,
                      'followed': 3,
                      'reached': 4,
                      'genius': 1,
                      'gourmet': 2,
                      "I'll": 4,
                      'unreal': 1,
                      'stupefying': 1,
                      'According': 1,
                      'derision': 1,
                      'Furiouser': 1,
                      'neurotic': 1,
                      'secretary': 2,
                      'refused': 4,
                      'about': 36,
                      'suspended': 1,
                      'indefinitely': 1,
                      'Kline': 1,
                      'clothes': 1,
                      'laughingly': 1,
                      'indefinite': 4,
                      'playful': 1,
                      'unpaintable': 1,
                      'Callas': 1,
                      'dies': 1,
                      'dispose': 1,
                      'feat': 1,
                      'somehow': 1,
                      'argument': 1,
                      'released': 1,
                      'ends': 1,
                      'convulsed': 1,
                      'outer': 2,
                      'Such': 5,
                      'attracted': 1,
                      'letter': 4,
                      'Excuse': 1,
                      'proportions': 1,
                      'Ye': 1,
                      'make': 13,
                      'another': 11,
                      'disappearing': 1,
                      'from': 59,
                      'devilish': 1,
                      'sweet': 2,
                      'collapse': 1,
                      'sensed': 1,
                      'built': 3,
                      'speaker': 1,
                      'Ever': 1,
                      'Toynbee': 1,
                      'park': 3,
                      'affronted': 1,
                      'Pollock': 1,
                      'availed': 2,
                      'area': 4,
                      'Mother': 12,
                      'systems': 1,
                      'naturally': 1,
                      'studied': 1,
                      'twenty-five': 2,
                      'philosophic': 1,
                      'appreciate': 2,
                      'parent-teacher': 1,
                      'prancing': 1,
                      'Steuben': 1,
                      'forgotten': 3,
                      'fellow': 3,
                      'much-needed': 1,
                      'probably': 5,
                      'frequently': 1,
                      'During': 3,
                      'porch': 1,
                      'tension': 1,
                      'ahead': 1,
                      'speech': 1,
                      'sculptures': 2,
                      'tasks': 1,
                      'wry': 1,
                      'borrow': 1,
                      'danced': 1,
                      'speeches': 1,
                      'drunkenly': 1,
                      'coral-colored': 1,
                      'or': 66,
                      'Law': 1,
                      'states': 1,
                      'romance': 2,
                      'destruction': 2,
                      'Khrushchev': 1,
                      'English': 13,
                      'playing': 5,
                      'sexy': 1,
                      'city': 3,
                      'balding': 1,
                      'chemistry': 1,
                      'sex': 1,
                      'fat': 1,
                      'increased': 2,
                      'original': 2,
                      'actress': 2,
                      'stomach': 3,
                      'fairy-tale': 1,
                      'revolt': 1,
                      'dentist': 1,
                      'shops': 1,
                      'mud': 2,
                      'deal': 2,
                      'Vera': 1,
                      'consciousness': 1,
                      'plans': 1,
                      'duly': 1,
                      'reaches': 1,
                      'Skolkau': 1,
                      'lumbar': 1,
                      'arenas': 1,
                      'general': 2,
                      'unearthed': 1,
                      'sidle': 1,
                      'invite': 1,
                      'kicking': 1,
                      'heart': 14,
                      'saleslady': 3,
                      'shop': 1,
                      'French': 6,
                      'artistically': 1,
                      'anywhere': 1,
                      'combat': 1,
                      'notes': 1,
                      'Miss': 8,
                      'skillful': 1,
                      'conscientious': 1,
                      'Chalmers': 1,
                      'inexorable': 1,
                      'opinions': 1,
                      'dismay': 1,
                      'amid': 2,
                      'urges': 1,
                      'came': 12,
                      'snobbishly': 1,
                      'Felix': 1,
                      'plumbed': 1,
                      'year': 8,
                      'independence': 1,
                      'reverse': 1,
                      'victory': 1,
                      'divinity': 1,
                      'adjusted': 1,
                      'Too': 1,
                      'both': 9,
                      'painted': 1,
                      'Radio': 1,
                      'vary': 1,
                      'somersaulting': 1,
                      "murderer's": 1,
                      'window': 3,
                      'Classified': 1,
                      'jaded': 1,
                      'co-star': 1,
                      "Marshall's": 1,
                      'organized': 1,
                      ...}),
             'learned': Counter({'laden': 1,
                      'elaborates': 1,
                      'builders': 6,
                      'dig': 1,
                      '23': 10,
                      'Homeric': 15,
                      'speak': 13,
                      'located': 14,
                      'economics': 2,
                      'house': 20,
                      'Abelson': 1,
                      'polymeric': 2,
                      'promises': 2,
                      'adultery': 1,
                      'indispensible': 1,
                      'building': 18,
                      'waters': 4,
                      'Between': 1,
                      'More': 22,
                      'monomer': 1,
                      'suds': 6,
                      'restrain': 1,
                      'use': 120,
                      'suffixes': 1,
                      'killed': 1,
                      'experience': 53,
                      'Cumulative': 1,
                      'endure': 2,
                      'served': 17,
                      'appendages': 1,
                      'irradiation': 9,
                      'population': 64,
                      'warrants': 2,
                      'initiator': 2,
                      'all-over': 1,
                      'nomination': 1,
                      'pleading': 2,
                      'Systems': 3,
                      'Entry': 1,
                      'forum': 1,
                      'fluctuating': 1,
                      'wages': 9,
                      'zeal': 1,
                      'K': 7,
                      'demanded': 5,
                      'vicinity': 1,
                      'Wait': 5,
                      'mercy': 3,
                      'Laboratory': 3,
                      'chancellor': 1,
                      'beautiful': 2,
                      'averaged': 10,
                      'sixth': 5,
                      'cerebellum': 1,
                      'marriages': 9,
                      'programming': 4,
                      'denying': 1,
                      'Believing': 1,
                      'sorted': 1,
                      "supervisor's": 1,
                      'intrapulmonary': 1,
                      'constituents': 4,
                      'chicken': 6,
                      'materially': 1,
                      'shrapnel': 2,
                      'cups': 1,
                      'quiescent': 1,
                      'amount': 52,
                      'paleoexplosion': 1,
                      'pressure': 68,
                      'Short': 1,
                      'amended': 1,
                      '76.7': 1,
                      'legitimacy': 1,
                      'Input/Output': 4,
                      'evils': 1,
                      "wouldn't": 1,
                      'proportionally': 1,
                      'through': 141,
                      '10-foot': 1,
                      'accomplishes': 1,
                      'Just': 6,
                      'specific': 51,
                      'deemed': 5,
                      'Because': 13,
                      'ascertain': 1,
                      'Justice': 13,
                      'emission': 31,
                      'expanse': 1,
                      'surreptitiously': 1,
                      'vegetable': 1,
                      'lookup': 7,
                      'resolute': 1,
                      'W.G.': 1,
                      "yesterday's": 1,
                      'Democrats': 1,
                      'behave': 3,
                      'cottage': 1,
                      'raised': 17,
                      'fertility': 8,
                      'meanings': 14,
                      'stream': 18,
                      'revealed': 9,
                      'Ernst': 3,
                      'grave': 6,
                      'pathogenic': 2,
                      'sprung': 3,
                      'eternal': 4,
                      'epitomized': 1,
                      'tacitly': 1,
                      'experimenter': 3,
                      'Fractions': 1,
                      'At': 48,
                      'roundup': 1,
                      ')': 800,
                      'Poynting-Robertson': 5,
                      'stimulated': 1,
                      'mastery': 1,
                      'Morphophonemic': 1,
                      'accumulates': 1,
                      'exigencies': 1,
                      'Those': 6,
                      'surprisingly': 3,
                      'fibrous': 4,
                      'recurred': 1,
                      'prophets': 1,
                      'hesitant': 2,
                      'subdivisions': 1,
                      'thyrotoxic': 1,
                      'sands': 3,
                      'slip': 2,
                      'negative': 23,
                      "patient's": 13,
                      'indexes': 2,
                      'underlie': 2,
                      'small': 107,
                      'Ruling': 4,
                      'centrifuged': 7,
                      'enemy': 14,
                      'Typically': 1,
                      'directed': 11,
                      'quo': 2,
                      'revolutions': 2,
                      'knife': 36,
                      'perceptual': 3,
                      'Polaroid': 1,
                      'tournament': 2,
                      'enforced': 4,
                      'Removal': 1,
                      'strike': 10,
                      "Congress'": 2,
                      'four': 46,
                      'e.g.': 14,
                      'really': 18,
                      'Laying': 1,
                      'singled': 2,
                      'substrate': 23,
                      'lends': 1,
                      'Geiger': 1,
                      'comforting': 1,
                      'Finnsburg': 1,
                      'signals': 2,
                      'effect': 67,
                      'shielding': 1,
                      "growers'": 1,
                      'DUF': 5,
                      'smoother': 2,
                      'memory-images': 2,
                      'non-existent': 1,
                      'King': 6,
                      'lanthanum': 1,
                      'artists': 17,
                      'refund': 12,
                      'constant': 26,
                      'Stoics': 1,
                      'confronting': 1,
                      'evening': 2,
                      'Thyroglobulin': 1,
                      'hymn': 3,
                      'relax': 9,
                      '43*0C.': 2,
                      'Bureau': 3,
                      'wives': 2,
                      'retrieved': 7,
                      'tangents': 6,
                      'Annual': 1,
                      'assets': 9,
                      'ask': 5,
                      'found': 117,
                      'contingent-fee': 2,
                      'Nakamura': 3,
                      'statutory': 7,
                      'carbonates': 1,
                      'excretion': 1,
                      'sixty-one': 1,
                      'ethos': 1,
                      'mother': 6,
                      'Caribbean': 2,
                      'Chiang': 1,
                      'things': 36,
                      'powders': 4,
                      'Abstract': 2,
                      'capacities': 2,
                      'independent': 28,
                      'corporeal': 1,
                      'abrogated': 1,
                      'Boeing': 1,
                      'introjects': 1,
                      'feed': 13,
                      'nickel-iron': 1,
                      'bits': 3,
                      'transported': 1,
                      "Post's": 1,
                      '18': 13,
                      'hr.': 4,
                      'x': 2,
                      'stops': 1,
                      'operand': 16,
                      'gadgetry': 1,
                      'molten': 2,
                      'Verdi': 1,
                      'academics': 2,
                      'pricing': 2,
                      'chap.': 3,
                      'confabulation': 2,
                      'German': 2,
                      'survive': 4,
                      'antithyroid': 5,
                      'vehicles': 3,
                      '1959': 31,
                      'nucleated': 1,
                      'Routine': 2,
                      'yielded': 6,
                      'pierced': 2,
                      'shadow': 1,
                      'generalized': 7,
                      'fabric': 8,
                      'dramatizing': 1,
                      'Loomis': 4,
                      'Exceptional': 1,
                      'hogs': 1,
                      'narrowing': 2,
                      'Cox': 1,
                      'tropocollagen': 1,
                      'viability': 1,
                      'outset': 3,
                      'dying': 2,
                      'automatically': 10,
                      'milestone': 1,
                      'constructing': 2,
                      'Merchant': 2,
                      'Grabbing': 1,
                      'conceivable': 3,
                      '463-degrees-C': 1,
                      '1961': 9,
                      'strip': 3,
                      'breadth': 1,
                      'hunting': 1,
                      'Resolved': 4,
                      'run': 13,
                      'cocao': 1,
                      'Saturn': 3,
                      'equity': 1,
                      'abdominis': 1,
                      'importantly': 2,
                      'entitle': 1,
                      'unconcerned': 1,
                      'accelerometer': 17,
                      'high-positive': 1,
                      'themselves': 35,
                      'audited': 1,
                      'Niccolo': 1,
                      'Supervisor': 1,
                      'Woven': 2,
                      'diminish': 3,
                      'humanist': 2,
                      'speaks': 3,
                      'pulled': 3,
                      'thousand-fold': 1,
                      'domestic': 15,
                      'novelty': 2,
                      'soften': 2,
                      'loaded': 4,
                      'radio': 28,
                      'coat': 4,
                      'treaties': 2,
                      'scales': 2,
                      'Frederick': 1,
                      'due': 45,
                      'attainment': 1,
                      'seeming': 2,
                      'vocalic': 1,
                      'marry': 2,
                      'proviso': 1,
                      'lexicostatistics': 4,
                      'relativistic': 1,
                      'problem': 82,
                      'turrets': 1,
                      'pulse': 1,
                      '1930': 3,
                      'percentage': 6,
                      'Rhine': 1,
                      'decompose': 1,
                      'looking': 7,
                      'dictate': 1,
                      'Journal': 13,
                      'Belgium': 1,
                      'amphibious': 1,
                      'moderately': 3,
                      'clever': 1,
                      'Time-servers': 1,
                      'picnics': 1,
                      'delegation': 1,
                      'sentimental': 1,
                      'bolster': 1,
                      '22%': 1,
                      "beef's": 1,
                      'determination': 10,
                      'microscope': 2,
                      'slowly': 10,
                      '2.26': 1,
                      'transcendence': 1,
                      'neuroses': 4,
                      'frail': 1,
                      "'?'": 1,
                      'atmosphere': 20,
                      'gulf': 1,
                      '1859': 8,
                      'announcements': 1,
                      'equal': 29,
                      'instrument': 15,
                      'pork': 6,
                      'Especially': 1,
                      'unleveled': 1,
                      'Lawrence': 3,
                      'triumphant': 1,
                      'censuses': 3,
                      'rich': 6,
                      'spurious': 1,
                      'pride': 5,
                      'Franklin': 2,
                      'advocates': 1,
                      'semi-major': 2,
                      'rites': 1,
                      'inflexible': 1,
                      'monopolize': 1,
                      'reflexes': 3,
                      'ran': 3,
                      'optimism': 1,
                      '2-degrees-C': 1,
                      'unmanageably': 1,
                      'Refund': 1,
                      'tiny': 1,
                      'Rottger': 2,
                      'nemesis': 2,
                      'inventory': 1,
                      'mechanochemically': 1,
                      'phosphors': 1,
                      'Yorkers': 1,
                      'butyl-lithium': 1,
                      'indium': 1,
                      'A.M.': 1,
                      'adapted': 3,
                      'hypothalamus': 18,
                      'colossal': 1,
                      'buildings': 8,
                      'between': 191,
                      'woodwork': 2,
                      'operands': 1,
                      'monks': 3,
                      'lifeless': 1,
                      'solar': 8,
                      'sucess': 1,
                      'involutorial': 3,
                      'Jackman': 1,
                      'improve': 8,
                      'Tasmania': 1,
                      'cut-and-dried': 1,
                      'trigonal': 2,
                      'companions': 1,
                      'higher': 42,
                      'all-Negro': 6,
                      'willow': 7,
                      'foliage': 1,
                      'Alpha': 5,
                      'Stanley': 1,
                      'residences': 1,
                      'decomposing': 1,
                      'firmly': 7,
                      'Until': 6,
                      'deplorable': 1,
                      'Achievement': 1,
                      'meets': 23,
                      'movers': 1,
                      'amplified': 4,
                      'paradoxically': 1,
                      'dismissed': 1,
                      'universality': 2,
                      'told': 14,
                      'composers': 1,
                      's-values': 1,
                      'greatness': 1,
                      'Transportation': 1,
                      'The': 1458,
                      'heat': 27,
                      'D.': 10,
                      'gases': 1,
                      'Hoijer': 3,
                      'Raymond': 2,
                      'spray-dried': 1,
                      'of': 7418,
                      'restrictions': 5,
                      'law': 85,
                      'legibility': 1,
                      'Asher': 1,
                      '10.6%': 1,
                      'red-bellied': 1,
                      "2'": 1,
                      'Gertrude': 1,
                      'police': 5,
                      'visiting': 4,
                      'alienates': 1,
                      'corner': 15,
                      'vocalize': 1,
                      'directivity': 1,
                      'pregnancy': 4,
                      'circumstance': 2,
                      'protest': 1,
                      'self-confidence': 1,
                      'dispersed': 5,
                      'totality': 1,
                      "ladies'": 1,
                      'monograph': 1,
                      'facts': 12,
                      'thyrotrophin': 1,
                      'unusually': 1,
                      'announcing': 1,
                      'peculiar': 5,
                      'well': 122,
                      'linguists': 6,
                      'inciting': 1,
                      'factory': 9,
                      'entry': 9,
                      'overhead': 3,
                      'two-step': 1,
                      'outside': 23,
                      '$230,000': 1,
                      'arc': 31,
                      'interruptions': 1,
                      'book': 13,
                      'unrest': 1,
                      'linguistics': 2,
                      'Towards': 2,
                      'mentions': 1,
                      'refrigeration': 10,
                      'nightly': 1,
                      'blame': 2,
                      'EQU': 5,
                      'knowingly': 1,
                      'precaution': 1,
                      'aligned': 5,
                      'presents': 6,
                      'optics': 1,
                      'give': 56,
                      'membership': 4,
                      'dishes': 1,
                      'treason': 3,
                      'tape': 16,
                      'bushels': 1,
                      'Hollingshead': 1,
                      'over-hand': 1,
                      'inhibition': 5,
                      'Emission': 1,
                      'formalities': 1,
                      'drugstore': 5,
                      'diverse': 5,
                      'applying': 12,
                      'critical': 17,
                      'anhydrously': 1,
                      '1159': 1,
                      'anticipations': 1,
                      'withholding': 1,
                      'missing': 3,
                      'economist': 1,
                      'explanation': 17,
                      'softened': 1,
                      'N': 27,
                      'after': 95,
                      'conjectures': 1,
                      'questioning': 2,
                      'reached': 25,
                      'nectar': 2,
                      'response': 30,
                      'homemaker': 2,
                      'bargain': 2,
                      'authority': 16,
                      'Perhaps': 4,
                      '1844': 2,
                      'fitness': 3,
                      'punctuated': 1,
                      'football': 2,
                      'strikingly': 4,
                      'big': 12,
                      'Ghoreyeb': 2,
                      'Fowler': 1,
                      'long-term': 11,
                      'quantities': 5,
                      'arresting': 2,
                      '180-degrees': 1,
                      'fills': 2,
                      'attempted': 7,
                      'completely': 23,
                      'C': 74,
                      'cores': 3,
                      'cold': 16,
                      'opened': 6,
                      'Cambridge': 2,
                      'radioactive': 4,
                      'illegitimate': 2,
                      'ambitiously': 1,
                      'twenty-three': 1,
                      'Cattle': 1,
                      'prechlorination': 1,
                      'stood': 7,
                      'unfortunately': 4,
                      'deck': 2,
                      'lend': 2,
                      'street': 5,
                      'diagonalizable': 14,
                      'utmost': 1,
                      '8-mm': 1,
                      'adjectives': 3,
                      'Redevelopment': 1,
                      'mV': 1,
                      'proliferation': 1,
                      'town': 15,
                      '10-degrees-C': 1,
                      'Taft-Hartley': 2,
                      'laxness': 1,
                      'multipactor': 2,
                      'Column': 7,
                      'Boris': 10,
                      'commonplace': 3,
                      "Street's": 1,
                      'Maser': 1,
                      "l'Independance": 1,
                      'gets': 11,
                      'high-density': 1,
                      'inserts': 1,
                      'three': 103,
                      'shortsighted': 2,
                      'declaring': 1,
                      'wage': 45,
                      'concerted': 2,
                      'virtue': 4,
                      'pure': 14,
                      'organize': 6,
                      'extended': 22,
                      'recount': 1,
                      'JEDEC': 1,
                      'pursuit': 6,
                      'patiently': 1,
                      'motives': 1,
                      'beast': 1,
                      'resignations': 1,
                      'continuity': 4,
                      'forceful': 1,
                      'forge': 4,
                      'oceanography': 2,
                      'murders': 1,
                      'eerie': 1,
                      'pavement': 1,
                      'over-simplification': 1,
                      'A.': 17,
                      'finely': 2,
                      'airmail': 3,
                      'Martinique': 1,
                      'grows': 3,
                      "Whipple's": 1,
                      'carrots': 1,
                      'receives': 2,
                      'be': 1363,
                      'wealth': 1,
                      'names': 23,
                      'tarnished': 1,
                      '1875': 4,
                      'Ten': 3,
                      'transference': 2,
                      'transforms': 3,
                      'France': 1,
                      'posing': 1,
                      'Plastics': 1,
                      'orderliness': 1,
                      'conclusion': 26,
                      'reasons': 21,
                      'uncompromising': 1,
                      'invent': 1,
                      'minutes': 30,
                      'Middle-South': 3,
                      'Agriculture': 1,
                      'remote': 12,
                      'intertwined': 1,
                      'confabulations': 1,
                      'Temperatures': 1,
                      'joined': 4,
                      'loan': 4,
                      'DiGiorgio': 1,
                      'photomicrograph': 1,
                      'farfetched': 1,
                      'sharpened': 2,
                      'afternoon': 1,
                      'Money': 3,
                      'attrition': 3,
                      'entirety': 4,
                      'commercial': 25,
                      'pitfalls': 1,
                      'exceptional': 8,
                      'central': 28,
                      'steam-generation': 1,
                      'psychosomatic': 1,
                      'probes': 3,
                      'bully': 1,
                      'mergers': 1,
                      '362': 1,
                      'force': 63,
                      'demineralization': 1,
                      'employ': 3,
                      'replaced': 11,
                      'youth': 9,
                      'prevented': 7,
                      'coincide': 7,
                      'Transit': 1,
                      "N'": 6,
                      'obligations': 6,
                      'Burford': 1,
                      'dinners': 1,
                      'hardness': 1,
                      "child's": 10,
                      'refractory': 1,
                      'alone': 26,
                      'beneath': 4,
                      'compels': 1,
                      'Aerospace': 4,
                      'slugs': 1,
                      'counteract': 3,
                      'into': 222,
                      'principally': 1,
                      'confirms': 2,
                      'affiliated': 1,
                      'deviation': 11,
                      'address': 34,
                      'skipped': 1,
                      'foamed-in-place': 1,
                      'let': 23,
                      'lengthwise': 2,
                      'pertains': 5,
                      'Oder': 3,
                      'localities': 2,
                      'Dufresne': 1,
                      'ingredient': 1,
                      'ascertainable': 1,
                      'necessity': 5,
                      'unworthy': 1,
                      'depredations': 1,
                      'render': 2,
                      'method': 59,
                      'murder': 5,
                      'shakes': 2,
                      'Statistique': 1,
                      'sanctified': 1,
                      'repressive': 1,
                      'appetizing': 2,
                      'unfertilized': 1,
                      'supervisors': 1,
                      '2': 197,
                      'hold': 19,
                      'thiouracil': 3,
                      'violent': 1,
                      'elaborated': 1,
                      'pupils': 10,
                      'rockets': 2,
                      'appraisals': 1,
                      'switch': 17,
                      'incline': 1,
                      'delineating': 2,
                      'administrator': 3,
                      'profit-maximizing': 3,
                      'liners': 1,
                      'electric': 16,
                      'reunion': 1,
                      'shots': 7,
                      'lower-middle': 1,
                      'hypothesis': 11,
                      'absurd': 1,
                      'corners': 4,
                      'volunteered': 1,
                      'force-rate': 1,
                      'wage-rate': 1,
                      'RDW': 2,
                      'infer': 1,
                      'mosaic': 1,
                      'polynomial': 28,
                      'growers': 2,
                      'focused': 4,
                      'Land': 2,
                      'overly': 1,
                      'scorned': 1,
                      'DA': 6,
                      'naked': 1,
                      'commands': 1,
                      '1915': 1,
                      'integrates': 1,
                      'clothes': 4,
                      '0.3M': 1,
                      'amoral': 1,
                      'merest': 1,
                      'Constitution': 2,
                      'qualities': 11,
                      'custody': 1,
                      'converse': 1,
                      'non-pathogenic': 1,
                      'released': 5,
                      'gyro-stabilized': 6,
                      'alleged': 1,
                      'ninth': 5,
                      'chronology': 1,
                      'attracted': 3,
                      'letter': 11,
                      'hide': 3,
                      'Elaborate': 1,
                      'efficiencies': 2,
                      'Junkerdom': 1,
                      'picket': 1,
                      'Meats': 1,
                      'Yearbook': 1,
                      'from': 768,
                      'pseudophloem': 6,
                      'predominantly': 5,
                      'B.C.': 7,
                      'jumble': 2,
                      'ages': 18,
                      'stabilizing-conserving': 1,
                      'demonstrates': 3,
                      'grants': 1,
                      'Catskill': 4,
                      'Corrections': 1,
                      '72': 2,
                      'adjunct': 3,
                      'Humanism': 1,
                      'studied': 16,
                      'mg.': 11,
                      '0.24': 1,
                      're': 1,
                      'recognizable': 2,
                      'years': 87,
                      'sticks': 1,
                      'East': 14,
                      'Aces': 1,
                      'compilation': 8,
                      'burning': 8,
                      'frequently': 18,
                      'During': 32,
                      'brittle': 1,
                      'left-justified': 1,
                      'belch': 1,
                      'tore': 1,
                      'voluntary': 7,
                      'Obviously': 10,
                      'donors': 5,
                      "voice's": 1,
                      'obtrudes': 1,
                      "l'": 2,
                      'unhappy': 3,
                      'destruction': 6,
                      'Wind': 1,
                      'prototypical': 1,
                      'spiraling': 1,
                      'attributes': 7,
                      'sex': 9,
                      'active': 25,
                      'cuckoo-bumblebee': 1,
                      'hosts': 1,
                      'keeping': 7,
                      'imbedded': 1,
                      'easier': 14,
                      'consciousness': 3,
                      'plans': 13,
                      'accordance': 7,
                      'locality': 2,
                      'median': 1,
                      'heart': 7,
                      'primary': 24,
                      'bleak': 2,
                      'national': 36,
                      'wakefulness': 3,
                      'enables': 3,
                      'amenable': 2,
                      'formulae': 3,
                      'authoritarian': 2,
                      'Julio': 1,
                      "lady's": 1,
                      'called': 36,
                      'opinions': 13,
                      '1865': 1,
                      '2500': 1,
                      'reverse': 6,
                      'victory': 1,
                      'p': 6,
                      'merry': 2,
                      'AH6': 1,
                      'Too': 1,
                      'sighted': 2,
                      'agricultural': 7,
                      'susceptible': 1,
                      'G.': 9,
                      'shopping': 8,
                      'evade': 1,
                      'accelerometers': 8,
                      'deals': 7,
                      'subparagraph': 1,
                      'non-wage': 1,
                      'grindings': 1,
                      'organized': 9,
                      'partial': 4,
                      'solution': 18,
                      'demonstrations': 1,
                      'born': 10,
                      'propel': 1,
                      'Viator': 1,
                      'substructure': 1,
                      'chance': 9,
                      'roleplaying': 13,
                      'spirits': 2,
                      'dimethylglyoxime': 2,
                      'semiquantitative': 1,
                      'resonance': 10,
                      'Southern': 2,
                      'fellow': 3,
                      'halfway': 1,
                      'salty': 1,
                      'Preparation': 6,
                      'W-region': 5,
                      'Attributes': 1,
                      'pioneering': 1,
                      'taught': 1,
                      'headquarters': 3,
                      '**zq': 2,
                      'Dwight': 1,
                      'basket': 3,
                      'isolating': 3,
                      'M': 8,
                      'generates': 1,
                      'complained': 3,
                      '19': 8,
                      'legislate': 1,
                      'space-time': 1,
                      'Extensive': 1,
                      'constrictors': 1,
                      'intentions': 3,
                      'noble': 2,
                      'Nor': 4,
                      'hated': 3,
                      'imposed': 6,
                      'campus': 5,
                      'Meaning': 2,
                      'Phonemes': 1,
                      'manages': 1,
                      'five': 33,
                      'audacity': 1,
                      'dividends': 3,
                      'common': 61,
                      'Celso': 1,
                      'less': 109,
                      'outgoing': 1,
                      '153': 2,
                      'upturn': 1,
                      'participate': 1,
                      'graphed': 1,
                      'chronologically': 1,
                      'pantheon': 1,
                      'smaller': 23,
                      'purification': 2,
                      'gravity': 2,
                      'thyroxine-binding': 1,
                      'shall': 40,
                      'Greek': 17,
                      'innocence': 3,
                      'magnificent': 2,
                      'retailing': 4,
                      'circumscriptions': 1,
                      'Report': 11,
                      'thinks': 2,
                      'low-duty': 1,
                      'electrophoresis': 5,
                      'instrumentalities': 1,
                      'intrusive': 1,
                      'forward-moving': 1,
                      'usages': 2,
                      'Literal': 1,
                      'peoples': 4,
                      'accusingly': 1,
                      'Tucson': 1,
                      'claims': 22,
                      'executives': 1,
                      'Van': 4,
                      'seventy-two': 1,
                      'charred': 1,
                      'seventeenth': 1,
                      'justification': 3,
                      'symbolize': 4,
                      'Groth': 4,
                      'rationally': 1,
                      'oval': 1,
                      '2,000': 1,
                      'Start': 1,
                      'radius': 5,
                      'U.': 9,
                      'deep-tendon': 1,
                      'dioxide': 1,
                      'flow': 29,
                      'nuisances': 1,
                      'witnessing': 1,
                      'yielding': 1,
                      'concealing': 1,
                      'race': 7,
                      'task': 12,
                      'culminated': 1,
                      'bales': 1,
                      'overshoots': 1,
                      'optimization': 1,
                      'unsuitable': 1,
                      'sort': 21,
                      'hypothesized': 2,
                      'PQ': 1,
                      'sorption-desorption': 1,
                      '-': 67,
                      'uncluttered': 1,
                      'kind': 30,
                      'questionable': 1,
                      'mid-1948': 1,
                      'constitutional': 3,
                      'disbelief': 1,
                      'thermopile': 1,
                      'potential': 23,
                      'argon': 4,
                      'carboxymethyl': 1,
                      'Student': 1,
                      'recalled': 2,
                      'Byron': 1,
                      'core-jacket': 1,
                      'casually': 1,
                      'Rothko': 1,
                      'Sympathy': 1,
                      'Substantive': 1,
                      'visitors': 4,
                      'Hotel': 1,
                      'canyon': 1,
                      'swift': 1,
                      "Erasmus's": 1,
                      'Clemenceau': 1,
                      'shearing': 4,
                      'countryside': 1,
                      'anti-Soviet': 2,
                      'represents': 17,
                      'traffic': 7,
                      'ear': 1,
                      'Amaral': 1,
                      'dramatically': 2,
                      'pins': 2,
                      'ring-labeled': 1,
                      'lower': 48,
                      'nostalgically': 1,
                      'intensifiers': 6,
                      'consanguineous': 1,
                      '325-degrees-C': 1,
                      'neocortical-hypothalamic': 2,
                      'though': 57,
                      'architect': 1,
                      'successful': 15,
                      'causing': 6,
                      'AAb/': 2,
                      "Homer's": 1,
                      'defeat': 2,
                      'anorthic': 1,
                      'motif': 3,
                      'adsorbed': 3,
                      'misrepresenting': 1,
                      'saved': 9,
                      'antics': 1,
                      'kinetic': 8,
                      'grade-constructed': 1,
                      'princess': 2,
                      "aren't": 3,
                      'NC': 1,
                      'permitting': 3,
                      'designation': 1,
                      'unhealthy': 3,
                      'past': 44,
                      'non-competitive': 1,
                      'twisted': 2,
                      ...}),
             'lore': Counter({'Without': 1,
                      'vacuum': 1,
                      'lighted': 1,
                      'Parish': 1,
                      'laden': 1,
                      'judgments': 1,
                      'More': 11,
                      'tribunals': 1,
                      '23': 1,
                      'Anthony': 2,
                      'assent': 1,
                      'speak': 8,
                      'located': 7,
                      'economics': 1,
                      'Ai': 1,
                      'education': 24,
                      'house': 42,
                      'promises': 1,
                      '1906': 1,
                      'skilled': 3,
                      'conspirators': 1,
                      'overdeveloped': 1,
                      'owned': 5,
                      'tube': 1,
                      'building': 15,
                      'Adams': 1,
                      'waters': 7,
                      'preoccupation': 2,
                      'Between': 1,
                      'dig': 1,
                      'twenty-nine': 1,
                      'skirted': 1,
                      'Plaster': 1,
                      'junction': 1,
                      'roundup': 1,
                      'intrinsic': 1,
                      'Pagans': 1,
                      'use': 68,
                      'Sleight': 1,
                      'killed': 13,
                      'experience': 30,
                      'incurably': 1,
                      'served': 24,
                      'committee': 8,
                      'Doria': 1,
                      'ailments': 3,
                      'bars': 1,
                      'sports': 4,
                      'population': 9,
                      "you'uns": 1,
                      'tempted': 1,
                      'stared': 1,
                      'Sex': 1,
                      'Deck': 1,
                      'thoroughly': 4,
                      'nomination': 2,
                      'plunder': 1,
                      'procedural': 1,
                      'fluctuating': 1,
                      'certain': 31,
                      'season': 9,
                      'zeal': 2,
                      'Jenny': 1,
                      'deal': 17,
                      'demanded': 4,
                      'others': 31,
                      'child-bearing': 1,
                      'sunshine': 1,
                      'psycho-physiology': 1,
                      'manufacturers': 3,
                      'Wait': 1,
                      'tarry': 1,
                      'strains': 2,
                      'button-down': 1,
                      'transition': 2,
                      'Fuji': 1,
                      'Funari': 1,
                      'British-American': 1,
                      'catch': 3,
                      'beautiful': 17,
                      'gesture': 1,
                      'sou': 1,
                      'intimidation': 2,
                      'shorter': 3,
                      'marriages': 7,
                      'browsing': 1,
                      'programming': 1,
                      'self-respect': 1,
                      'Soukhouma': 1,
                      'denying': 2,
                      'Eternal': 1,
                      'dilapidated': 1,
                      'own': 80,
                      'Maier': 1,
                      'cheer': 2,
                      'retains': 1,
                      "countin'": 3,
                      'grotesquely': 1,
                      'imposition': 1,
                      'crasher': 1,
                      'indiscriminate': 1,
                      'Enough': 1,
                      'socked': 1,
                      'constituents': 1,
                      'paradoxically': 3,
                      'chicken': 2,
                      'determine': 4,
                      'navigation': 2,
                      'barrage': 1,
                      'materially': 1,
                      'correct': 8,
                      'volcanic': 2,
                      'Drake': 1,
                      'cabins': 4,
                      "Sorrentine's": 1,
                      'sage': 1,
                      "Side's": 1,
                      'kitchen': 6,
                      'amount': 7,
                      'colleague': 1,
                      'pursuits': 2,
                      '51': 1,
                      'Rake': 1,
                      'crack': 3,
                      'Called': 1,
                      'pressure': 26,
                      'winded': 1,
                      'Creator': 1,
                      'Grands': 3,
                      'Staffe': 4,
                      'northward': 3,
                      'way': 86,
                      'mud': 4,
                      'evils': 1,
                      'expressions': 1,
                      "wouldn't": 4,
                      'munch': 1,
                      'cheated': 1,
                      'Warfield': 1,
                      'through': 97,
                      'Following': 1,
                      "denyin'": 1,
                      'picturesque': 1,
                      'Rugged': 1,
                      'renamed': 1,
                      'Just': 11,
                      'preferring': 1,
                      'specific': 6,
                      'Sicilians': 2,
                      'each': 64,
                      'Because': 7,
                      'hacked': 1,
                      'proclamation': 2,
                      'boon': 1,
                      'Justice': 5,
                      'ink': 3,
                      'tons': 10,
                      'lift': 1,
                      'Huston': 1,
                      'vegetable': 3,
                      'France': 10,
                      'ceased': 2,
                      'Democrats': 3,
                      'craft': 1,
                      'postscript': 1,
                      'cottage': 1,
                      'shy': 1,
                      'raised': 8,
                      'fertility': 1,
                      'stray': 2,
                      'embraces': 1,
                      'stream': 5,
                      'revealed': 9,
                      'grave': 3,
                      'smell': 4,
                      'pastoral': 4,
                      'believing': 3,
                      'keyhole': 1,
                      'fabulous': 2,
                      'hustle': 1,
                      'situations': 3,
                      'irreconcilable': 1,
                      'experimenter': 1,
                      'Quacks': 1,
                      'Related': 1,
                      'Doris': 1,
                      'At': 42,
                      "I'm": 5,
                      'seas': 2,
                      'prolongation': 1,
                      'stimulated': 1,
                      'mastery': 2,
                      'operating': 3,
                      'custody': 1,
                      'mauling': 1,
                      'Messenger': 1,
                      'staked': 1,
                      'scoop': 1,
                      'Those': 4,
                      'surprisingly': 2,
                      'enlist': 1,
                      'fibrous': 1,
                      'rejection': 3,
                      'rights': 5,
                      'lucky': 3,
                      'career': 6,
                      'baggage': 1,
                      'husband': 36,
                      'friends': 15,
                      'chop': 1,
                      'bet': 1,
                      'sleet': 1,
                      'integral': 3,
                      'Prime': 4,
                      'shouting': 3,
                      'worse': 7,
                      'full-grown': 1,
                      'poark': 1,
                      'negative': 1,
                      "patient's": 5,
                      'suns': 1,
                      'small': 37,
                      'bull': 4,
                      'jam': 1,
                      'enemy': 9,
                      'Typically': 2,
                      'competition': 1,
                      'recruits': 2,
                      '$5-8,000': 1,
                      'Thevenow': 1,
                      'knife': 3,
                      'drilled': 2,
                      'chaperone': 1,
                      'generates': 1,
                      'recently': 8,
                      'earthy': 2,
                      'reined': 1,
                      'landmarks': 1,
                      'soda': 2,
                      'peas': 5,
                      'legislation': 3,
                      'next': 28,
                      'armchair': 1,
                      'reared': 1,
                      'mustard': 6,
                      'gravity': 1,
                      'shall': 12,
                      'reinstated': 1,
                      'e.g.': 2,
                      'really': 29,
                      'peoples': 9,
                      'broad': 4,
                      'lends': 1,
                      'idling': 1,
                      'Barrow': 1,
                      'external': 1,
                      'Brevet': 1,
                      'drinker': 1,
                      'disdain': 1,
                      'overloaded': 1,
                      'Please': 1,
                      'striving': 1,
                      'slated': 1,
                      'rushed': 4,
                      'borrowed': 1,
                      'uncorked': 1,
                      'fragrances': 1,
                      'signals': 4,
                      'effect': 10,
                      'rafts': 1,
                      'exact': 4,
                      'huckster': 1,
                      'attractive': 5,
                      'hunk': 1,
                      'Sibley': 1,
                      'peridontal': 1,
                      'officio': 1,
                      'shift': 4,
                      'accounted': 1,
                      'cereal': 12,
                      'fastened': 1,
                      'artists': 3,
                      'masculine': 1,
                      'confronting': 1,
                      'Force': 2,
                      'Amateur': 1,
                      'survivors': 3,
                      'twelve': 4,
                      'cannon': 2,
                      'telegraph': 1,
                      'Joseph': 5,
                      'Drawn': 1,
                      'keen': 1,
                      'chastity': 1,
                      'relax': 1,
                      'eyewitness': 1,
                      'Bureau': 5,
                      'terribly': 3,
                      'wives': 9,
                      'muddy-tasting': 1,
                      "cowhand'd": 1,
                      'runaway': 1,
                      'psychotherapy': 1,
                      'palms': 1,
                      'stead': 2,
                      'ask': 7,
                      'orgasm': 1,
                      'retrovision': 1,
                      'found': 51,
                      'detachment': 1,
                      'banditos': 1,
                      'pronouns': 1,
                      'owning': 1,
                      "one's": 14,
                      'muck': 1,
                      'cultivating': 2,
                      'mittens': 1,
                      'inflame': 1,
                      'pretty': 6,
                      'mother': 19,
                      'Caribbean': 1,
                      'Garry': 7,
                      'things': 31,
                      'Now': 18,
                      'thank': 1,
                      'Bethlehem': 1,
                      'department': 5,
                      'colorful': 3,
                      'independent': 3,
                      'lewd': 1,
                      'exported': 1,
                      'yardstick': 1,
                      'permission': 3,
                      'divided': 6,
                      'suppose': 2,
                      'feed': 7,
                      'fodder': 1,
                      'ambush': 1,
                      'prompt': 3,
                      'bits': 1,
                      'transported': 2,
                      '4,000-foot': 1,
                      '18': 2,
                      'drawn': 4,
                      'inaugural': 1,
                      'ecclesiastical': 2,
                      'Smoke': 1,
                      'rebelled': 2,
                      'bat': 1,
                      'vainly': 1,
                      'euphoric': 1,
                      'investigations': 2,
                      'gallop': 1,
                      'chiropractor': 1,
                      'one-third': 3,
                      'Anti-Americanism': 1,
                      'hatred': 4,
                      'snack': 3,
                      'reflects': 6,
                      'Doubtful': 1,
                      'Wright': 2,
                      'German': 13,
                      'obliterate': 1,
                      'well-modulated': 1,
                      'crofters': 1,
                      'dissolved': 3,
                      'neighbor': 2,
                      'scream': 2,
                      'signal': 3,
                      'vehicles': 1,
                      '1959': 2,
                      "rustlin'": 1,
                      'musicians': 2,
                      'managing': 2,
                      'objectiveness': 1,
                      'Ramsperger': 1,
                      'gushed': 1,
                      'parts-suppliers': 1,
                      'Fretting': 1,
                      'Discovery': 10,
                      'Census': 1,
                      'shoot': 5,
                      'shadow': 2,
                      'prints': 1,
                      'marched': 1,
                      'stiffened': 1,
                      'writer-turned-painter': 1,
                      'rearguard': 1,
                      'Kamchatka': 1,
                      'evidence': 20,
                      'narrowing': 1,
                      'flights': 1,
                      'virginity': 3,
                      'outset': 1,
                      'Scholar': 2,
                      'required': 13,
                      'one-reel': 1,
                      'milestone': 1,
                      'Massachusetts': 9,
                      'wages': 4,
                      'Merchant': 1,
                      'Lord': 4,
                      'Division': 2,
                      'institutionalized': 1,
                      'sticky': 1,
                      'housewives': 1,
                      'bullyboys': 1,
                      'strip': 1,
                      '6th': 1,
                      'hunting': 2,
                      'overran': 1,
                      'run': 20,
                      'discussed': 5,
                      'speedy': 1,
                      'stores': 9,
                      'equity': 2,
                      'veering': 1,
                      'manly': 2,
                      'careful': 3,
                      'Continental': 1,
                      'wooded': 1,
                      'importantly': 2,
                      'fronting': 1,
                      'ricocheted': 1,
                      'unconcerned': 2,
                      'feather': 1,
                      'dimension': 2,
                      "Sande's": 1,
                      'not-less-deadly': 1,
                      'themselves': 26,
                      'complaint': 2,
                      'academic': 14,
                      'dome': 3,
                      'Deputy': 1,
                      'phis': 1,
                      'experiences': 15,
                      'wicker': 1,
                      'Hopedale': 1,
                      'stumps': 1,
                      'reconciliation': 1,
                      'positive': 10,
                      'Santa': 4,
                      'actuality': 3,
                      'Cheyennes': 1,
                      'provoke': 1,
                      'Good': 3,
                      'Howard': 2,
                      'flier': 1,
                      'speaks': 4,
                      'Pincian': 2,
                      'reader': 1,
                      'aviator': 2,
                      'radar-controlled': 3,
                      'entertain': 2,
                      'appended': 1,
                      'pulled': 3,
                      'blacked': 1,
                      'continuous': 3,
                      'framed': 2,
                      'mercy': 2,
                      'sentences': 1,
                      'methods': 15,
                      'visited': 9,
                      'Manufacturers': 1,
                      'loaded': 3,
                      'busiest': 1,
                      'disposed': 4,
                      'Also': 6,
                      'radio': 4,
                      'myriad': 2,
                      'Thiot': 1,
                      'delivery': 2,
                      'engraved': 2,
                      'trees': 4,
                      "mother's": 5,
                      'Japanese': 17,
                      'Daer': 5,
                      'Frederick': 1,
                      'due': 5,
                      'flattening': 1,
                      'Sheldon': 1,
                      'seeming': 1,
                      'shrewdly': 1,
                      'plea': 2,
                      'warmed': 1,
                      'precious': 1,
                      'sixty': 1,
                      'checked': 2,
                      'racketeer': 1,
                      'midwestern': 2,
                      'rejoin': 1,
                      'problem': 25,
                      'freer': 1,
                      'winter': 17,
                      'distance': 10,
                      '5000': 1,
                      'crashing': 2,
                      'percentage': 5,
                      'ever-existent': 1,
                      'Hockett': 1,
                      'Portico': 1,
                      'Trustees': 2,
                      'sameness': 1,
                      'immensity': 1,
                      'reflect': 1,
                      'splattered': 1,
                      'looking': 6,
                      'fourth': 6,
                      'Rivers': 1,
                      'Journal': 1,
                      'Fall': 6,
                      'sixth': 1,
                      'sheik': 4,
                      'Semiramis': 1,
                      'British': 16,
                      'machine-gunned': 1,
                      'seismic': 1,
                      'trading': 10,
                      'moderately': 1,
                      'Time': 4,
                      'past': 24,
                      'humor': 3,
                      'periods': 4,
                      'describing': 3,
                      'fibers': 1,
                      'delegation': 1,
                      'sentimental': 4,
                      'Big': 6,
                      'fostering': 1,
                      'justice': 11,
                      'corroborated': 1,
                      'marketable': 4,
                      'Sibling': 1,
                      'remedy': 6,
                      'pointed': 7,
                      'wounded': 8,
                      'Usually': 5,
                      'boxed': 1,
                      'determination': 3,
                      'evacuation': 2,
                      'slowly': 6,
                      'Caring': 1,
                      'unlocked': 3,
                      'place': 70,
                      'piece': 3,
                      'automobiles': 2,
                      'rode': 5,
                      'truthfulness': 1,
                      'committees': 1,
                      'payroll': 1,
                      'Parks': 1,
                      'bombers': 4,
                      'Flint': 3,
                      'tasted': 1,
                      'Tombigbee': 2,
                      'first': 145,
                      'Steichen': 1,
                      'atmosphere': 5,
                      'teetering': 1,
                      'Bantus': 3,
                      'union': 9,
                      'Saratoga': 1,
                      'urgent': 1,
                      'land': 28,
                      'fade-in': 1,
                      "appearin'": 1,
                      'I.B.M.': 1,
                      'carreer': 1,
                      'blocking': 1,
                      'announcements': 1,
                      'files': 2,
                      'Natural': 1,
                      '1825': 1,
                      'instrument': 5,
                      'pork': 2,
                      'could': 141,
                      'Especially': 1,
                      'dissimilar': 1,
                      'rusty': 1,
                      'conform': 3,
                      'audience': 11,
                      'rich': 9,
                      'tap': 2,
                      'Wilson': 7,
                      "Holt's": 1,
                      'spaciousness': 1,
                      'Patton': 1,
                      'planting': 2,
                      'Gravesend': 1,
                      'Fife': 1,
                      'pride': 2,
                      'hailstorm': 1,
                      'Franklin': 6,
                      'commended': 1,
                      'sequence': 5,
                      'twisting': 1,
                      'Jean-Honore': 1,
                      'Maxentius': 1,
                      'rattle': 1,
                      'Dag': 1,
                      'brute': 2,
                      'mortally': 1,
                      'customs': 4,
                      'berated': 1,
                      'ran': 12,
                      'wagons': 2,
                      'meant': 13,
                      'Monroe': 1,
                      'felt': 17,
                      'cloudburst': 1,
                      'sentenced': 1,
                      'forbids': 1,
                      'runing': 1,
                      'inventory': 1,
                      'unsafe': 1,
                      'picture': 14,
                      'guarding': 2,
                      'sympathetic': 2,
                      'adapted': 5,
                      'Interpretation': 2,
                      'convention': 3,
                      'healthier': 1,
                      'buildings': 8,
                      'Immediately': 3,
                      'Allies': 1,
                      "weeks'": 1,
                      'woodwork': 1,
                      'Arenula': 1,
                      'exempt': 1,
                      'collapsing': 1,
                      '1953': 2,
                      'welding': 1,
                      'intimate': 2,
                      'corrosive': 1,
                      'beam': 1,
                      'cramps': 2,
                      'braces': 2,
                      'eighth': 4,
                      'counselor': 1,
                      'Loy': 1,
                      'modest': 2,
                      'presto': 1,
                      'Frank': 2,
                      'recognize': 7,
                      'improve': 4,
                      'cheetah': 1,
                      'unimpressed': 1,
                      'marked': 11,
                      'alginates': 1,
                      'companions': 1,
                      'higher': 21,
                      'constructively': 1,
                      'Dutch': 6,
                      'deplore': 1,
                      'cabinets': 1,
                      'Security': 1,
                      'Stanley': 1,
                      'nod': 2,
                      'Harvard': 7,
                      'hardships': 1,
                      'firmly': 3,
                      'Nutrition': 1,
                      'recollections': 1,
                      'backed': 2,
                      'skunks': 1,
                      'whizzing': 1,
                      'fruit': 8,
                      'Gunfire': 1,
                      'northerly': 1,
                      'planters': 2,
                      'yelps': 1,
                      'passivity': 1,
                      'triad': 1,
                      'wall': 9,
                      'told': 42,
                      'assassinated': 1,
                      'question': 25,
                      'life-like': 2,
                      'discovery': 8,
                      'civic': 1,
                      'waits': 1,
                      'The': 647,
                      'oratory': 1,
                      'promise': 2,
                      'D.': 9,
                      'non-college': 1,
                      'coincidence': 1,
                      'gases': 1,
                      'Ocean': 1,
                      'candles': 3,
                      'tuxedoed': 1,
                      'of': 3668,
                      'shores': 2,
                      'warms': 1,
                      'confiding': 1,
                      'brashness': 1,
                      '$10,000': 2,
                      'withdrawn': 1,
                      'law': 20,
                      'slugger': 1,
                      'untold': 1,
                      'police': 26,
                      'visiting': 2,
                      'inboard': 1,
                      'virility': 2,
                      'narrative': 7,
                      'rear': 6,
                      "Henry's": 1,
                      'Spokesmen': 1,
                      'metal': 2,
                      'corner': 6,
                      'sworn': 2,
                      'cups': 2,
                      'lived': 12,
                      'Ivies': 2,
                      'protest': 5,
                      'Information': 4,
                      'Anticipating': 1,
                      'self-confidence': 1,
                      'erupt': 2,
                      'resigned': 2,
                      "Knowlton's": 1,
                      'watching': 6,
                      'synagogue': 2,
                      'express': 3,
                      'maid': 3,
                      'Tapley': 1,
                      'numb': 1,
                      'conservationist': 1,
                      'investigating': 1,
                      'facts': 17,
                      'recreation': 2,
                      'Hudson': 34,
                      'unusually': 1,
                      'carts': 3,
                      'assembled': 3,
                      'plump': 1,
                      'disastrous': 4,
                      "suite's": 1,
                      'peculiar': 3,
                      'display': 4,
                      "Burlington's": 1,
                      'Dugan': 2,
                      'policing': 2,
                      'straightening': 2,
                      'linguists': 3,
                      'Outdoor': 1,
                      'homosexual': 1,
                      'factory': 1,
                      "America's": 2,
                      'poisonous': 3,
                      'hung': 2,
                      'arc': 1,
                      'relation': 9,
                      'book': 13,
                      'fragmentary': 1,
                      'Italo-American': 1,
                      'sympathetically': 1,
                      'egocentric': 1,
                      'unrest': 1,
                      'saps': 1,
                      'suggested': 11,
                      'linguistics': 3,
                      'minimal': 2,
                      'fame': 1,
                      'employees': 3,
                      'mentions': 1,
                      'Renaissance': 4,
                      'jacket': 1,
                      'nightly': 1,
                      'Interruptions': 1,
                      'bound': 7,
                      'sullen': 1,
                      'knowingly': 2,
                      'intend': 2,
                      'platted': 1,
                      'old-time': 2,
                      'presents': 3,
                      'post-Inaugural': 1,
                      'purpose': 17,
                      'give': 31,
                      'outgrow': 1,
                      'laundering': 1,
                      'puckering': 1,
                      'fluid': 2,
                      'membership': 6,
                      'surprises': 1,
                      'incompatible': 1,
                      'seismograph': 1,
                      'cheek': 1,
                      'horrors': 2,
                      'sensibilities': 2,
                      'tape': 4,
                      'bushels': 3,
                      'beauty': 9,
                      'question-and-answer': 1,
                      'coop': 1,
                      'trouser': 1,
                      'flatly': 1,
                      'idea': 14,
                      'dealers': 3,
                      "seizin'": 1,
                      'Richard': 2,
                      'directed': 6,
                      'waving': 2,
                      'graybeards': 1,
                      'premature': 1,
                      'aesthetic': 2,
                      'sanction': 1,
                      'Bey': 6,
                      'diverse': 4,
                      'strenuous': 2,
                      'Arrack': 1,
                      'crest': 7,
                      'hawker': 1,
                      'allow': 7,
                      'rag': 3,
                      'critical': 4,
                      'monumentally': 1,
                      'library': 3,
                      'diminishes': 2,
                      'soldier-masters': 1,
                      'cancer': 8,
                      'boost': 1,
                      'wander': 1,
                      'missing': 3,
                      'actually': 26,
                      'flank': 1,
                      'Several': 2,
                      'rocked': 1,
                      'strivings': 1,
                      'explanation': 3,
                      'replied': 9,
                      'juice': 1,
                      'vital': 5,
                      'after': 85,
                      'Thirdly': 1,
                      'questioning': 2,
                      'ward': 4,
                      'Kipling': 1,
                      'madstones': 1,
                      'corrugated': 1,
                      'cornmeal': 1,
                      'response': 5,
                      'barrel': 2,
                      'homemaker': 1,
                      'T.B.': 1,
                      'procurer': 2,
                      'authority': 10,
                      'Point': 1,
                      'contouring': 1,
                      '1844': 3,
                      'confused': 3,
                      'bricklaying': 1,
                      'missile': 2,
                      'schooner': 2,
                      'big': 15,
                      'ill-conceived': 2,
                      'long-term': 1,
                      'quantities': 1,
                      'armament': 1,
                      'inspect': 2,
                      'E': 1,
                      'grenades': 2,
                      'attempted': 1,
                      'completely': 16,
                      'C': 1,
                      'wooden': 4,
                      'cold': 18,
                      'opened': 6,
                      'Valley': 2,
                      'rookies': 1,
                      'stain': 1,
                      'Opportunities': 1,
                      'wary': 1,
                      'twenty-three': 1,
                      'Nathan': 1,
                      'fate': 1,
                      'Holy': 2,
                      'blast': 1,
                      'negociant': 2,
                      'unfortunately': 2,
                      'deck': 2,
                      'lend': 1,
                      'reminding': 1,
                      'Korea': 4,
                      'street': 15,
                      'regarding': 2,
                      'passable': 1,
                      'long': 71,
                      'utmost': 1,
                      'tooth': 12,
                      'current': 4,
                      'page': 5,
                      'brand-new': 1,
                      'locally': 1,
                      'cops': 2,
                      'hookup': 1,
                      'priest': 7,
                      'expectations': 1,
                      'cares': 1,
                      'dislike': 2,
                      'accuse': 2,
                      'town': 18,
                      'fulfilled': 1,
                      'dormitories': 1,
                      'compresses': 1,
                      'now': 84,
                      'peeked': 1,
                      'dramatic': 3,
                      'depositions': 3,
                      'plain-spoken': 1,
                      'nationalism': 7,
                      'commonplace': 3,
                      'rival': 4,
                      'suddenly': 11,
                      'heterogeneous': 3,
                      'De': 4,
                      '694': 1,
                      'gets': 8,
                      "we'll": 1,
                      'flat-bottomed': 1,
                      'inaudible': 1,
                      'casino': 1,
                      'loss': 13,
                      'boat': 5,
                      'three': 57,
                      'geologists': 2,
                      'automatic': 1,
                      'smolders': 1,
                      'cage': 4,
                      'demand': 8,
                      'stock': 12,
                      'Inauguration': 1,
                      'Spring': 3,
                      'Humor': 1,
                      'downed': 1,
                      'half-acceptance': 1,
                      'San': 3,
                      'drafty': 1,
                      'wage': 3,
                      'industrial': 15,
                      'virtue': 2,
                      'organizational': 2,
                      'pure': 3,
                      'organize': 1,
                      'extended': 3,
                      'twenty-nine-foot-wide': 1,
                      'patiently': 1,
                      'College': 21,
                      'Coffee': 1,
                      'odd': 1,
                      'twins': 1,
                      'frowns': 1,
                      'intrigues': 1,
                      'motives': 4,
                      'detective': 3,
                      'continuity': 6,
                      'gag': 1,
                      'forceful': 3,
                      'car': 7,
                      'murders': 9,
                      'rehearsed': 3,
                      'manipulated': 1,
                      'monastic': 1,
                      'shine': 1,
                      'cap': 1,
                      'smokescreen': 1,
                      'A.': 12,
                      'jibes': 1,
                      'Psychotherapy': 1,
                      'faker': 1,
                      'present': 21,
                      'describes': 1,
                      'sonofabitch': 1,
                      'science': 8,
                      'interdependence': 5,
                      'ranch': 4,
                      'carrots': 2,
                      'receives': 4,
                      'be': 570,
                      'wealth': 1,
                      'alternative': 2,
                      'racing': 1,
                      'device': 16,
                      'frolic': 1,
                      'ready': 16,
                      'corpses': 1,
                      'speckles': 1,
                      'pistol-whipped': 1,
                      'Ought': 1,
                      'culture': 13,
                      'rutabagas': 1,
                      'abundance': 2,
                      'shares': 1,
                      'heap': 3,
                      'engaged': 2,
                      'Plastics': 1,
                      'conclusion': 5,
                      'jail': 2,
                      ...}),
             'mystery': Counter({'Without': 3,
                      'vacuum': 1,
                      'lighted': 11,
                      'contented': 1,
                      'laden': 1,
                      'read': 6,
                      'verve': 1,
                      'dig': 1,
                      'high-pitched': 1,
                      'Anthony': 3,
                      'aboard': 1,
                      'anyhow': 4,
                      'speak': 2,
                      'unaware': 1,
                      'located': 2,
                      'plush': 1,
                      'avoiding': 1,
                      'education': 1,
                      'house': 41,
                      'desolate': 2,
                      'promises': 1,
                      'muscles': 3,
                      "She'd": 7,
                      'darling': 1,
                      'burrowing': 1,
                      'registrar': 1,
                      'building': 21,
                      'Adams': 1,
                      'helpless': 3,
                      'forearm': 2,
                      'hoping': 6,
                      'smell': 2,
                      'summoned': 1,
                      'crib': 2,
                      'walked': 31,
                      'hemorrhages': 1,
                      'Seaton': 1,
                      'dock': 2,
                      'use': 13,
                      'worse': 4,
                      'Carruthers': 3,
                      'killed': 12,
                      'experience': 2,
                      'Escape': 1,
                      'blood': 6,
                      'jarred': 1,
                      'sports': 1,
                      'outsmarted': 1,
                      'trip': 2,
                      'typewriter': 3,
                      'has': 17,
                      'Pretty': 5,
                      'thoroughly': 3,
                      'transposition': 1,
                      'pleading': 1,
                      'maintenance': 1,
                      'brushfire': 1,
                      'our': 14,
                      'along': 24,
                      'beau': 1,
                      'suspiciously': 2,
                      'hamburgers': 3,
                      'certain': 9,
                      'wages': 2,
                      'heaved': 1,
                      'sticky': 1,
                      'Gardner': 2,
                      'Beach': 10,
                      'sped': 3,
                      'demanded': 2,
                      'shade': 2,
                      'inspector': 3,
                      'others': 10,
                      'moulding': 1,
                      'half-hour': 2,
                      'shaved': 1,
                      'anyone': 9,
                      'mercy': 1,
                      'grew': 3,
                      'Jail': 1,
                      'groves': 1,
                      'taut': 1,
                      'bellboy': 4,
                      'catch': 3,
                      'sleeve': 1,
                      'nauseated': 1,
                      'beautiful': 3,
                      'Twenty-four': 1,
                      'marriages': 1,
                      'rafters': 1,
                      "Glendora's": 1,
                      'first': 37,
                      'candle': 6,
                      'hallucinating': 1,
                      'refuse': 2,
                      'Forties': 1,
                      'dilapidated': 1,
                      'developed': 1,
                      'Frisco': 2,
                      'lapses': 1,
                      'grease': 1,
                      'brushing': 1,
                      'Tracking': 1,
                      'taverns': 1,
                      'releasing': 1,
                      "supervisor's": 1,
                      'company': 5,
                      'wall': 19,
                      'convey': 1,
                      'chicken': 2,
                      'necktie': 1,
                      'under': 27,
                      'Quebec': 1,
                      'Pontchartrain': 1,
                      'correct': 3,
                      'bitch': 2,
                      'glance': 9,
                      "Joan's": 1,
                      'kitchen': 14,
                      'amount': 3,
                      'Maude': 21,
                      'blunt': 1,
                      'orders': 1,
                      'whorls': 1,
                      'parked': 19,
                      'Timothy': 3,
                      'schoolgirl': 1,
                      'omitted': 1,
                      'Aunt': 1,
                      'pressure': 3,
                      'overrun': 1,
                      'cheek': 2,
                      "you'll": 7,
                      'skylights': 2,
                      'busily': 1,
                      'directed': 1,
                      'anagram': 1,
                      'hear': 14,
                      'way': 61,
                      'long-stemmed': 1,
                      'man': 106,
                      "Ingleside's": 1,
                      'martinis': 1,
                      'unbelievable': 1,
                      "wouldn't": 32,
                      'draft': 1,
                      'hurry': 9,
                      'cheated': 1,
                      'provision': 1,
                      'bang': 2,
                      'through': 57,
                      'Following': 1,
                      'panes': 1,
                      'Ball': 1,
                      'No-o-o': 1,
                      'Just': 8,
                      'precaution': 1,
                      "we'll": 8,
                      'each': 18,
                      'Because': 4,
                      'battle': 1,
                      'That-a-way': 1,
                      'Rossi': 1,
                      'wastrel': 1,
                      'pavement': 2,
                      'Pamela': 1,
                      'stakes': 1,
                      'plain': 1,
                      'forget': 4,
                      'apparently': 7,
                      'burden': 2,
                      'ye': 1,
                      'cottage': 2,
                      'shy': 1,
                      'raised': 9,
                      'fingerprints': 4,
                      'stray': 2,
                      'love': 7,
                      'embraces': 1,
                      'ate': 1,
                      'grave': 1,
                      'mileage': 1,
                      'repeated': 5,
                      'eventually': 2,
                      'apron': 3,
                      'believing': 1,
                      'Trinidad': 1,
                      'suburbs': 1,
                      'situations': 1,
                      "Pam's": 1,
                      'service': 4,
                      'depressed': 2,
                      'Doris': 1,
                      'hard': 15,
                      'fences': 2,
                      'At': 28,
                      "I'm": 53,
                      'focused': 2,
                      'patrolman': 2,
                      'Station': 2,
                      'blustery': 1,
                      'prune': 1,
                      'Take': 5,
                      'None': 3,
                      "moraine's": 1,
                      'hurtled': 1,
                      'baby': 3,
                      'Those': 4,
                      'onlookers': 1,
                      'Nelson': 6,
                      'negligent': 1,
                      'drop': 5,
                      'career': 2,
                      'idly': 1,
                      'husband': 11,
                      'friends': 9,
                      'Quickly': 1,
                      'pulling': 5,
                      'dully': 2,
                      'wiped': 3,
                      'scanned': 1,
                      'slip': 1,
                      'manage': 3,
                      'mud-caked': 1,
                      'Brian': 5,
                      'tenuous': 1,
                      'small': 28,
                      'Miss': 11,
                      'park': 1,
                      'Petey': 1,
                      'red': 7,
                      'reverberated': 1,
                      'expressed': 1,
                      'powerful': 3,
                      'doubtfully': 1,
                      'inferred': 1,
                      'sprawl': 1,
                      'appearance': 1,
                      'forward': 8,
                      'brim': 1,
                      'bins': 1,
                      'decent': 2,
                      'Andy': 29,
                      "Harris'": 1,
                      'undulating': 1,
                      'dispel': 1,
                      'thunder': 3,
                      'peas': 1,
                      'snapped': 1,
                      'next': 29,
                      'Palmer': 1,
                      'p.': 2,
                      'reared': 1,
                      'strike': 1,
                      'screens': 2,
                      'four': 13,
                      'badly': 2,
                      'Dorado': 3,
                      'chomp': 1,
                      'waist': 2,
                      'examined': 3,
                      'advance': 1,
                      'really': 21,
                      'Murder': 1,
                      'broad': 2,
                      'predictable': 1,
                      'external': 1,
                      'scent': 1,
                      'please': 4,
                      'grandfather-father-to-son': 1,
                      'Please': 2,
                      'comforting': 1,
                      'rushed': 1,
                      'efforts': 2,
                      'signals': 2,
                      'effect': 3,
                      'bars': 1,
                      'Pete': 9,
                      'Lanesville': 1,
                      'exact': 2,
                      'clothed': 1,
                      'Anything': 5,
                      'thin': 12,
                      'Dumpty': 1,
                      'returning': 4,
                      'conspiratorial': 1,
                      'King': 7,
                      'family': 7,
                      'swimming': 1,
                      'scrutinizing': 1,
                      'alarmed': 2,
                      'evening': 11,
                      'prisoners': 2,
                      'twelve': 3,
                      'pungent': 1,
                      'heater': 1,
                      'fear': 5,
                      'tip': 4,
                      'Bureau': 2,
                      'unperceived': 1,
                      'wives': 1,
                      'retrieved': 1,
                      'complying': 1,
                      'regretted': 2,
                      'dust': 6,
                      'blaring': 1,
                      'late': 5,
                      'a.m.': 3,
                      '``': 740,
                      'relief': 4,
                      'prettier': 2,
                      'grillework': 1,
                      'ask': 17,
                      'forms': 2,
                      'found': 42,
                      'Reverse': 1,
                      'Forgot': 1,
                      'doubted': 2,
                      'pedals': 1,
                      'adventure': 1,
                      "one's": 1,
                      ':': 27,
                      'pretty': 8,
                      'telescopic': 1,
                      'bell': 1,
                      'mother': 11,
                      'things': 26,
                      'Now': 31,
                      'N-no': 1,
                      'thank': 2,
                      'can': 42,
                      'mailboxes': 4,
                      'negligence': 1,
                      'muddy': 1,
                      'dismal': 1,
                      'divided': 2,
                      'suppose': 9,
                      'reconstruction': 1,
                      'poured': 2,
                      'storehouse': 3,
                      'brown': 4,
                      'cocked': 1,
                      'drawn': 3,
                      'spur': 2,
                      'Reporters': 1,
                      'mauve': 1,
                      'sir': 8,
                      'unscrewed': 1,
                      'commences': 1,
                      'raked': 1,
                      'dialed': 2,
                      'upstairs': 3,
                      "Pops's": 1,
                      'studied': 2,
                      'feelings': 1,
                      'activities': 2,
                      'Eliminating': 1,
                      'staginess': 1,
                      'leave': 26,
                      'York': 8,
                      'signal': 11,
                      'vehicles': 1,
                      'grabbed': 4,
                      'power': 2,
                      'paunchy': 2,
                      'hafta': 1,
                      'yielded': 1,
                      'till': 8,
                      'shoot': 1,
                      'shadow': 8,
                      'prints': 3,
                      'any': 64,
                      'stiffened': 2,
                      'Revere': 1,
                      'stake-out': 3,
                      'bore': 2,
                      'brake': 1,
                      'dying': 2,
                      'observe': 2,
                      'automatically': 3,
                      'season': 1,
                      'gross': 1,
                      'odors': 2,
                      'Road': 4,
                      'deeds': 1,
                      'Joan': 2,
                      'foot': 6,
                      'pausing': 1,
                      'curiously': 3,
                      'drink': 16,
                      'mouthing': 1,
                      'God': 13,
                      'run': 18,
                      'discussed': 3,
                      'poisoned': 1,
                      'merging': 1,
                      'stores': 1,
                      'dedicated': 1,
                      'cobra': 1,
                      'eyebrows': 4,
                      'Regulars': 1,
                      'Sir': 2,
                      'Gibby': 7,
                      'flat': 3,
                      'barest': 1,
                      'unheeding': 1,
                      'uniform': 2,
                      'uncomfortable': 1,
                      'Came': 1,
                      'thousand': 14,
                      'poised': 3,
                      'obsession': 1,
                      'themselves': 8,
                      'guest': 4,
                      'sketch': 1,
                      'send': 10,
                      'slab': 2,
                      'dome': 1,
                      'music': 5,
                      'visit': 1,
                      'fifty-four': 1,
                      'experiences': 1,
                      'moral': 1,
                      'fallen': 3,
                      'fine': 4,
                      'trustees': 1,
                      'positive': 2,
                      'sweetness': 1,
                      'Plymouth': 6,
                      'actuality': 2,
                      'Dammit': 2,
                      'Birdwood': 1,
                      'execution': 1,
                      'terminal': 1,
                      'disappears': 1,
                      'victim': 2,
                      'two-line': 1,
                      'Wait': 7,
                      'so-called': 1,
                      'roof': 8,
                      'progressing': 1,
                      'Prettier': 1,
                      'her': 296,
                      'scenes': 1,
                      'bottom': 5,
                      'pulled': 15,
                      'frizzled': 1,
                      'Carl': 1,
                      'awfully': 1,
                      'setup': 1,
                      'Past': 1,
                      'longer': 11,
                      'sing': 2,
                      'strange': 10,
                      'automobiles': 1,
                      'framed': 1,
                      'Though': 1,
                      'Kee-reist': 1,
                      'domestic': 3,
                      'visited': 1,
                      'scowled': 1,
                      'loaded': 1,
                      'terrified': 1,
                      'steam': 1,
                      'Also': 7,
                      'radio': 5,
                      'coat': 5,
                      'delivery': 1,
                      'ruse': 1,
                      'trees': 4,
                      "mother's": 1,
                      "they're": 6,
                      'Some': 8,
                      'water-filled': 1,
                      'Carmody': 2,
                      'Emotionally': 1,
                      'fastidious': 1,
                      'Oscar': 1,
                      'due': 1,
                      'attainment': 1,
                      'snowed': 2,
                      'Sheldon': 2,
                      'cut': 15,
                      'preamble': 1,
                      'crossed': 5,
                      'establishments': 1,
                      'warmed': 2,
                      'sixty': 2,
                      'checked': 9,
                      'proviso': 1,
                      'serve': 1,
                      'manpower': 1,
                      'broad-brimmed': 1,
                      'butane': 1,
                      'problem': 6,
                      'pulse': 1,
                      'winter': 1,
                      'distance': 7,
                      'Quintana': 1,
                      'continuously': 1,
                      'walking': 5,
                      'commanded': 1,
                      'receive': 1,
                      'floorboards': 2,
                      'informal': 1,
                      'mesmerized': 1,
                      "money's": 1,
                      'looking': 24,
                      'fourth': 2,
                      'denied': 1,
                      'sugar': 4,
                      "Harris's": 1,
                      'thanks': 2,
                      'striking': 1,
                      'British': 2,
                      'loft': 1,
                      'Time': 1,
                      'humor': 1,
                      'Bast': 1,
                      'grooms': 1,
                      'Norberg': 6,
                      'ribs': 4,
                      'hallway': 3,
                      "We're": 6,
                      'Big': 1,
                      'bottle': 20,
                      "lieutenant's": 3,
                      'eying': 1,
                      'Examiner': 1,
                      'overlapped': 1,
                      'scientist': 1,
                      'grunted': 2,
                      'pointed': 6,
                      'Usually': 1,
                      'Expecting': 1,
                      'same': 41,
                      'dragged': 2,
                      'slowly': 11,
                      'unlocked': 6,
                      'place': 33,
                      'piece': 5,
                      'eerily': 1,
                      'gasping': 2,
                      'rode': 1,
                      'Parks': 1,
                      'tact': 1,
                      'tasted': 3,
                      'Club': 3,
                      'aroma': 1,
                      "What's": 5,
                      'drive-in': 2,
                      'apprehension': 1,
                      'Alison': 1,
                      'coldly': 2,
                      'urgent': 1,
                      'land': 2,
                      'growled': 3,
                      'Much': 2,
                      'therefore': 3,
                      'kicking': 1,
                      'fifty-five': 1,
                      'files': 4,
                      'emptier': 1,
                      'wires': 1,
                      'swamp': 1,
                      'could': 141,
                      'enjoying': 2,
                      'turkey': 2,
                      'audience': 6,
                      '275': 1,
                      'slippers': 2,
                      'stopped': 16,
                      'by-product': 1,
                      'admit': 4,
                      'ribbing': 1,
                      'delusion': 1,
                      'armpit': 1,
                      'registrations': 1,
                      'sequence': 1,
                      'okay': 3,
                      'Lila': 2,
                      'ointment': 1,
                      'wise': 2,
                      'search': 4,
                      'delinquency': 1,
                      'optimism': 1,
                      'interested': 5,
                      'wagged': 1,
                      'meant': 9,
                      'reek': 1,
                      'intent': 1,
                      'tiny': 5,
                      'seepage': 1,
                      'Jerry': 4,
                      'Dumont': 8,
                      'picture': 14,
                      'doomed': 1,
                      "shouldn't": 1,
                      'bake-oven': 1,
                      'conceive': 1,
                      'full': 12,
                      'performance': 2,
                      'count': 2,
                      'stairway': 1,
                      'slipped': 5,
                      'equally': 1,
                      'buildings': 6,
                      'between': 26,
                      'cost': 4,
                      'blonde': 1,
                      'widowed': 1,
                      'Chicagoans': 1,
                      'Gold': 1,
                      'Kidnapper': 1,
                      'beam': 1,
                      'report': 8,
                      'ring': 1,
                      'co-workers': 1,
                      'Espanol': 1,
                      'appear': 1,
                      'metal': 7,
                      'fine-boned': 1,
                      'Roman': 1,
                      'recognize': 1,
                      'improve': 1,
                      'volley': 1,
                      'marked': 5,
                      'Anderson': 2,
                      'wagon': 10,
                      'highboy': 1,
                      'active': 3,
                      'moves': 2,
                      'higher': 2,
                      'rousing': 1,
                      'deviate': 1,
                      'cabinets': 1,
                      'eyes': 48,
                      'Security': 1,
                      'Chapter': 1,
                      'filled': 5,
                      'energy': 2,
                      'Stanley': 26,
                      'nod': 1,
                      'firmly': 2,
                      'Until': 1,
                      'idiosyncrasies': 1,
                      'Soak': 1,
                      'gallstone': 1,
                      'backed': 8,
                      'heredity': 1,
                      'bedsprings': 1,
                      'Domokous': 2,
                      'sorrows': 1,
                      'fruit': 1,
                      'cameras': 1,
                      'Acourse': 1,
                      'chill': 1,
                      'nowhere': 1,
                      'dictating': 1,
                      'brakes': 2,
                      'Diets': 1,
                      'they': 106,
                      'done': 11,
                      'distinctly': 2,
                      'wiping': 1,
                      'instructed': 2,
                      'crests': 1,
                      'banks': 1,
                      'may': 13,
                      'The': 244,
                      'casual': 2,
                      'promise': 1,
                      'downstairs': 3,
                      'coincidence': 2,
                      'unjustified': 1,
                      'Boston': 1,
                      'specialist': 1,
                      'of': 903,
                      'psychopathic': 2,
                      'noisily': 1,
                      'bowl': 1,
                      'excruciating': 1,
                      'chickens': 3,
                      'trays': 1,
                      'law': 4,
                      'trickling': 1,
                      'trapped': 2,
                      'autos': 1,
                      'nearing': 1,
                      'stained': 2,
                      'Larry': 2,
                      'defense': 1,
                      'quarter': 3,
                      'appearances': 1,
                      'superstitious': 1,
                      'guess': 11,
                      'punishes': 1,
                      'Beauclerk': 8,
                      'rear': 4,
                      'bloody': 1,
                      'Kinda': 1,
                      'Croydon': 1,
                      'possum': 1,
                      'noticeable': 1,
                      "landlord's": 1,
                      'felt': 40,
                      'awkwardly': 1,
                      'beaten': 1,
                      'separate': 2,
                      'lived': 15,
                      'climbed': 8,
                      'Deal': 1,
                      'disliking': 1,
                      'seized': 3,
                      'vet': 1,
                      'mingled': 1,
                      'performer': 1,
                      'lack': 4,
                      'reprehensible': 1,
                      'Alfredo': 2,
                      'mentioned': 3,
                      'malice': 1,
                      "ladies'": 1,
                      'enviously': 1,
                      'maid': 2,
                      'Hall-Mills': 1,
                      'hate': 2,
                      'landlord': 8,
                      'bride': 7,
                      'enjoyed': 2,
                      'musical': 1,
                      'cetera': 1,
                      'assembled': 1,
                      'L.': 1,
                      'impulse': 1,
                      'crashed': 1,
                      'peculiar': 3,
                      'display': 3,
                      'too': 72,
                      'advisability': 1,
                      'advancing': 1,
                      'Supplies': 1,
                      'Theater': 1,
                      "Burton's": 2,
                      'confirmed': 1,
                      'sneaking': 2,
                      'entry': 1,
                      'overhead': 1,
                      'velour': 1,
                      'outside': 19,
                      'more-than-average': 1,
                      'hung': 12,
                      'arc': 1,
                      'faces': 8,
                      'language': 1,
                      'book': 3,
                      'jaws': 1,
                      'stains': 1,
                      'somewhat': 3,
                      'slightly': 5,
                      'screwball': 1,
                      'following': 3,
                      'suggested': 4,
                      'crack': 4,
                      'sompin': 1,
                      'event': 2,
                      'unlacing': 1,
                      'Washington': 1,
                      'cohesion': 1,
                      'instantly': 3,
                      'recommendation': 1,
                      'jacket': 7,
                      'Seeing': 2,
                      'how': 37,
                      'sullen': 2,
                      'heartbeat': 2,
                      'occupants': 3,
                      'ticket': 3,
                      'above': 6,
                      'hyped-up': 1,
                      'equidistant': 1,
                      'volumes': 3,
                      'LaSalle': 1,
                      'give': 23,
                      'clumsy': 2,
                      'clamped': 2,
                      'implications': 1,
                      'Richards': 2,
                      'Mister': 1,
                      'upset': 2,
                      'seeing': 6,
                      'beauty': 2,
                      'odor': 3,
                      'aunts': 1,
                      'minute': 10,
                      'exuded': 1,
                      'Payne': 8,
                      'blared': 1,
                      'blushing': 1,
                      'minister': 12,
                      'glove': 1,
                      'tunnel': 6,
                      'bothering': 2,
                      'tire': 1,
                      'thermometer': 1,
                      'gradually': 1,
                      'waving': 3,
                      'nakedly': 1,
                      'overnighters': 3,
                      'convenience': 3,
                      'allow': 2,
                      'rag': 2,
                      'drinks': 6,
                      'library': 1,
                      'payments': 1,
                      'Aj': 2,
                      'identified': 2,
                      'staggeringly': 1,
                      'gauged': 1,
                      'racking': 1,
                      'wander': 1,
                      'ounce': 1,
                      'actually': 4,
                      'piteous': 1,
                      'advice': 3,
                      'tightly': 2,
                      'Angelo': 5,
                      "sheriff's": 2,
                      'explanation': 1,
                      'Jeep': 9,
                      'juice': 2,
                      'preserve': 2,
                      'after': 36,
                      'heightened': 1,
                      'questioning': 2,
                      'thought': 54,
                      'camera': 1,
                      'ward': 3,
                      'reached': 16,
                      'Very': 2,
                      'anyway': 9,
                      'Fox': 2,
                      'cornmeal': 1,
                      'on': 421,
                      'response': 1,
                      'lawyer': 3,
                      'conviction': 1,
                      'convincing': 1,
                      'cousin': 1,
                      'Point': 3,
                      'Our': 3,
                      'Perhaps': 5,
                      'confused': 2,
                      'morosely': 1,
                      'Shall': 3,
                      'schooner': 1,
                      'quarry': 2,
                      'big': 32,
                      'plant': 1,
                      'earnestly': 2,
                      'fetid': 1,
                      'salvaging': 1,
                      'clearly': 3,
                      'attempted': 1,
                      'completely': 7,
                      'interrupted': 5,
                      'wooden': 3,
                      'cold': 14,
                      'freak': 2,
                      'punk': 1,
                      'financial': 2,
                      'opened': 20,
                      'still': 66,
                      'left': 56,
                      'hats': 2,
                      'normally': 1,
                      'bellows': 1,
                      'chaise': 1,
                      'utterly': 3,
                      'hip': 2,
                      'one-way': 1,
                      'twenty-three': 2,
                      'mere': 1,
                      "Dronk's": 1,
                      'warm': 1,
                      'connecting': 1,
                      'proclaim': 1,
                      'stood': 41,
                      'worst': 2,
                      'electronic': 4,
                      'Ferrell': 3,
                      'once-over': 1,
                      'yuh': 1,
                      'Demanded': 1,
                      'drinking': 6,
                      'impact': 2,
                      'alias': 1,
                      'skull': 1,
                      'glasses': 2,
                      'street': 36,
                      'conscientious': 1,
                      'grip': 2,
                      'orbiting': 1,
                      'long': 39,
                      'hemmed': 1,
                      'utmost': 1,
                      'sweaty': 1,
                      'alimony': 1,
                      'icicle': 1,
                      'slob': 1,
                      'worth': 5,
                      'cops': 13,
                      'destination': 1,
                      'bounded': 1,
                      'easiest': 1,
                      'wary': 1,
                      'Service': 1,
                      'got': 77,
                      'regulation': 1,
                      'town': 8,
                      'now': 83,
                      'wrong': 9,
                      'Drink': 1,
                      'azaleas': 1,
                      'smart': 3,
                      'delicious': 1,
                      'lady': 3,
                      'blowing': 1,
                      'cab': 4,
                      'suddenly': 19,
                      'transoms': 1,
                      'gets': 6,
                      'McIver': 3,
                      'specific': 1,
                      'torn': 2,
                      'bandaged': 2,
                      'lay': 13,
                      'enclosed': 1,
                      'three': 33,
                      'Anyone': 1,
                      "Day's": 1,
                      'figured': 5,
                      'shut': 10,
                      'consulted': 2,
                      'demand': 1,
                      'powder': 3,
                      'stock': 3,
                      'Reckon': 1,
                      'Lindsay': 1,
                      'Kimball': 1,
                      'rooms': 8,
                      'serpents': 1,
                      'San': 5,
                      'Concord': 1,
                      'clearer': 1,
                      'Las': 3,
                      'Scholarship': 1,
                      'extended': 3,
                      'taunt': 1,
                      'emphatically': 1,
                      'Added': 1,
                      'choice': 1,
                      'questioned': 1,
                      'one-thirty': 2,
                      'odd': 3,
                      'gunning': 1,
                      'stretched': 8,
                      'suave': 1,
                      'joy': 2,
                      'detective': 3,
                      'Comfortably': 1,
                      'hole': 1,
                      'tails': 3,
                      'zipper': 1,
                      'sidled': 1,
                      'fluttered': 1,
                      'car': 69,
                      'eerie': 1,
                      'west': 5,
                      'personal': 2,
                      'cap': 1,
                      'seriously': 3,
                      'discovered': 2,
                      'sigue': 1,
                      'afternoons': 1,
                      'salad': 1,
                      'present': 5,
                      'galley': 1,
                      'necessary': 3,
                      'hit': 16,
                      'treasure': 1,
                      'be': 233,
                      'depressing': 1,
                      'must': 30,
                      'racing': 2,
                      'names': 9,
                      'helpful': 1,
                      'ready': 9,
                      'faintest': 2,
                      'corpses': 1,
                      ...}),
             'news': Counter({'Without': 4,
                      'festival': 2,
                      '5-run': 2,
                      'Motorists': 1,
                      'builders': 7,
                      'taxing': 1,
                      'More': 12,
                      'tribunals': 1,
                      '23': 9,
                      'Anthony': 1,
                      'assent': 1,
                      'speak': 3,
                      'located': 6,
                      'economics': 3,
                      'contented': 1,
                      'education': 31,
                      'house': 26,
                      'promises': 3,
                      'skilled': 1,
                      'conspirators': 2,
                      'officiating': 1,
                      'overdeveloped': 1,
                      'Gerald': 3,
                      'tube': 1,
                      'Wilmette': 3,
                      'building': 15,
                      'Adams': 1,
                      'Exploratory': 1,
                      'waters': 4,
                      'Arvey': 1,
                      'ex-National': 1,
                      'Minnie': 2,
                      'Between': 1,
                      'shouldda': 1,
                      'threshold': 3,
                      'junction': 1,
                      'restrain': 1,
                      'operating': 4,
                      'drafts': 2,
                      'LSU': 1,
                      'use': 31,
                      "bridegroom's": 2,
                      'authoritative': 1,
                      'thoroughfare': 1,
                      'Olivia': 1,
                      'experience': 6,
                      'endure': 2,
                      'served': 17,
                      'committee': 38,
                      'Norm': 2,
                      'bars': 1,
                      'sports': 6,
                      'Feis': 1,
                      'population': 13,
                      'Sr.': 1,
                      "Baltimore's": 2,
                      'Mickey': 12,
                      'thoroughly': 1,
                      'nomination': 6,
                      'Omsk': 1,
                      'views': 7,
                      'Much': 2,
                      'plunder': 1,
                      'forum': 1,
                      "Mayor's": 2,
                      'abrupt': 1,
                      'certain': 18,
                      'season': 43,
                      'Gardner': 1,
                      'Coopers': 1,
                      'Melcher': 1,
                      'glutted': 1,
                      'Continental': 7,
                      'shade': 1,
                      'others': 15,
                      'budgets': 2,
                      'Durwood': 1,
                      'vicinity': 1,
                      'manufacturers': 10,
                      'Patrolmen': 1,
                      'Polls': 1,
                      '6.5': 1,
                      'Callan': 1,
                      'Zoe': 1,
                      'McN.': 1,
                      'Milwaukee': 4,
                      'beautiful': 5,
                      'gesture': 2,
                      'intimidation': 1,
                      'sixth': 10,
                      'crypt': 1,
                      'august': 1,
                      'marriages': 4,
                      'head-and-shoulders': 1,
                      'all-important': 1,
                      'self-respect': 1,
                      'Couturier': 1,
                      'denying': 1,
                      'Schrunk': 1,
                      'handy': 1,
                      'Reputedly': 1,
                      'cheer': 2,
                      'retains': 1,
                      "countin'": 1,
                      'imposition': 1,
                      'believes': 8,
                      'Stevens': 4,
                      'constituents': 1,
                      'chic': 4,
                      'chicken': 4,
                      'determine': 3,
                      'drab': 1,
                      'barrage': 1,
                      'Loen': 1,
                      'goaded': 1,
                      'pitchers': 4,
                      'correct': 3,
                      'corporation': 7,
                      'kitchen': 5,
                      'amount': 18,
                      '$7,500,000': 1,
                      'crack': 2,
                      'omitted': 1,
                      'Called': 1,
                      'pressure': 12,
                      'Md.': 2,
                      'Chinese': 2,
                      '2-score-year': 1,
                      'darling': 1,
                      '22-acre': 1,
                      'amended': 1,
                      'adviser': 3,
                      'perturbed': 1,
                      'won-lost': 1,
                      "wouldn't": 2,
                      'scholastic': 2,
                      'hard-hit': 1,
                      'scratching': 1,
                      'through': 54,
                      'classical': 3,
                      'Dallas-based': 1,
                      'halftime': 1,
                      'nationalism': 2,
                      'Just': 7,
                      'specific': 4,
                      'each': 61,
                      'Because': 4,
                      'Plus': 2,
                      'Ah': 2,
                      'Justice': 1,
                      '13-1/2': 1,
                      'tons': 1,
                      'Dennis': 1,
                      'stakes': 1,
                      'menaced': 1,
                      'Walters': 1,
                      'France': 7,
                      'inviolate': 1,
                      'ringsiders': 1,
                      'ceased': 1,
                      "yesterday's": 1,
                      'Democrats': 11,
                      'Pitcher': 1,
                      'cottage': 1,
                      'raised': 8,
                      'Todd': 1,
                      'place-kicker': 1,
                      'stream': 3,
                      'revealed': 1,
                      'Ernst': 1,
                      'enunciate': 1,
                      'grave': 2,
                      'Sunnyvale': 3,
                      'believing': 2,
                      'keyhole': 1,
                      'second-degree': 2,
                      'Hinsdale': 1,
                      'situations': 2,
                      'summoned': 4,
                      'blackout': 1,
                      'Doris': 1,
                      'At': 38,
                      'roundup': 1,
                      "I'm": 18,
                      'deodorant': 2,
                      'focused': 1,
                      'patrolman': 1,
                      '$17': 1,
                      'colleagues': 1,
                      'balconies': 1,
                      'acquittal': 2,
                      'scoop': 1,
                      'Those': 8,
                      'adoption': 2,
                      'rejection': 3,
                      'CBS': 1,
                      'Buddy': 1,
                      'Nelson': 4,
                      'Manville': 1,
                      'career': 10,
                      'baggage': 2,
                      'husband': 13,
                      'friends': 11,
                      'bet': 2,
                      'shouting': 1,
                      'worse': 6,
                      "Hallowell's": 1,
                      'slip': 1,
                      'negative': 2,
                      'Suddenly': 1,
                      'Afterwards': 1,
                      'killed': 5,
                      'small': 27,
                      'Television-Electronics': 1,
                      'bull': 1,
                      'Ruling': 1,
                      'enemy': 3,
                      'continue': 11,
                      'Equator': 1,
                      'understatement': 1,
                      'quo': 1,
                      'Peking': 1,
                      'Hamey': 1,
                      'neckline': 2,
                      'recession': 3,
                      'Worth': 3,
                      'missiles': 3,
                      'recently': 11,
                      'tournament': 13,
                      'enforced': 3,
                      'steamship': 1,
                      'legislation': 14,
                      'next': 40,
                      'Ghormley': 1,
                      'propeller': 1,
                      'sly': 1,
                      'reared': 1,
                      'disagreed': 2,
                      'strike': 12,
                      'four': 73,
                      'Larson': 5,
                      'Marines': 1,
                      'really': 13,
                      'peoples': 1,
                      'singled': 6,
                      'Foliage': 1,
                      'inflicted': 1,
                      'ideology': 1,
                      'comforting': 1,
                      'rushed': 2,
                      'borrowed': 2,
                      'uncorked': 1,
                      'signals': 1,
                      'effect': 17,
                      'senator': 3,
                      'Pete': 2,
                      'exact': 1,
                      'NBC': 2,
                      'tulle': 1,
                      'whites': 2,
                      "Pirates'": 1,
                      "Lenobel's": 1,
                      '187.5': 1,
                      'smoother': 1,
                      'cell': 2,
                      'accounted': 2,
                      'King': 3,
                      'vice-chairman': 1,
                      'artists': 5,
                      'brokers': 2,
                      'lag': 1,
                      '17,000': 1,
                      "Berry's": 1,
                      'swearing-in': 1,
                      'spending': 9,
                      'confronting': 2,
                      'Amateur': 1,
                      'prisoners': 1,
                      'probation': 6,
                      'twelve': 1,
                      'Parmer': 1,
                      'Joseph': 18,
                      'Culbertson': 1,
                      'tip': 1,
                      'Bureau': 2,
                      'daylight': 1,
                      'Carmichael': 1,
                      'colts': 1,
                      'Camilla': 1,
                      'clerk': 4,
                      'Nori': 1,
                      'regretted': 1,
                      '675': 1,
                      'Michaels': 1,
                      '687.87': 1,
                      'assets': 1,
                      'intends': 1,
                      'ask': 7,
                      'found': 30,
                      'hope': 11,
                      'writes': 1,
                      'win': 10,
                      'statutory': 2,
                      'schedule': 9,
                      'dug': 2,
                      'scholarships': 4,
                      'Haynes': 1,
                      'mother': 15,
                      'Galt': 1,
                      'Jorge': 1,
                      'things': 10,
                      'Now': 11,
                      'thank': 1,
                      'Huntingtons': 1,
                      'shortcuts': 1,
                      'alienated': 1,
                      'Ponce': 1,
                      'department': 12,
                      'colorful': 1,
                      'independent': 3,
                      'February': 10,
                      'chow': 1,
                      '3-0': 1,
                      'exported': 2,
                      '4:05': 1,
                      'yardstick': 1,
                      'permission': 2,
                      'divided': 4,
                      'reconstruction': 1,
                      'Ky.': 2,
                      '18': 20,
                      'Grove': 4,
                      'inaugural': 1,
                      'stops': 2,
                      "fund's": 3,
                      'sir': 1,
                      'bat': 6,
                      'boosting': 1,
                      'Fra': 1,
                      'investigations': 1,
                      'advisers': 7,
                      'one-third': 1,
                      'brethren': 1,
                      'pricing': 3,
                      'snack': 1,
                      'fairway': 5,
                      'constituted': 4,
                      'our': 55,
                      'Wright': 1,
                      'German': 8,
                      'Molly': 2,
                      'Nairne': 2,
                      'Bassi': 1,
                      'Welfare': 3,
                      'neighbor': 4,
                      'submarines': 5,
                      'signal': 1,
                      'vehicles': 2,
                      'battalions': 1,
                      '1959': 21,
                      'grabbed': 2,
                      'musicians': 3,
                      'managing': 2,
                      'McEachern': 1,
                      "county's": 1,
                      'taper': 1,
                      'Owls': 1,
                      'Attendance': 1,
                      'deductible': 2,
                      'ordinarily': 1,
                      'Latinovich': 1,
                      'Cox': 4,
                      'outset': 1,
                      'Gomez': 1,
                      'required': 11,
                      'Cadillac': 1,
                      'milestone': 2,
                      'Crumlish': 1,
                      'wages': 11,
                      'surpassed': 1,
                      "players'": 1,
                      'candid': 1,
                      'Lord': 2,
                      'Division': 5,
                      'chicanery': 1,
                      'crews': 1,
                      '1961': 42,
                      'strip': 4,
                      '6th': 1,
                      'hunting': 2,
                      'Joplin': 1,
                      'resource': 1,
                      'run': 35,
                      'discussed': 6,
                      'speedy': 1,
                      'stores': 4,
                      'I.': 7,
                      'Moments': 1,
                      'demanded': 3,
                      'breathes': 1,
                      'effected': 1,
                      'Evans': 1,
                      'grandchildren': 2,
                      'Cocktails': 2,
                      'Opposition': 1,
                      '72-hole': 1,
                      'Shiflett': 2,
                      'themselves': 19,
                      'developing': 4,
                      'academic': 10,
                      'Luette': 1,
                      'Deputy': 2,
                      "Wednesday's": 1,
                      'Supervisor': 1,
                      'Vernava': 3,
                      'eligible': 4,
                      'pecan': 1,
                      'capitalist': 1,
                      'positive': 2,
                      'Santa': 6,
                      'parole': 4,
                      'drunk': 1,
                      'Herridge': 1,
                      'Howard': 13,
                      'newsletter': 1,
                      'Ted': 4,
                      'elegant': 2,
                      'reader': 1,
                      'thorny': 1,
                      'Maris': 36,
                      'Second': 5,
                      'Stamford': 2,
                      'sophomore': 3,
                      'pulled': 3,
                      'blacked': 1,
                      'background': 3,
                      'continuous': 1,
                      'naughtier': 1,
                      'Taxation': 1,
                      'transition': 3,
                      'sentences': 1,
                      'methods': 10,
                      'visited': 4,
                      'loaded': 4,
                      '$5': 2,
                      'visa': 1,
                      'disposed': 1,
                      'Also': 9,
                      'radio': 6,
                      'coat': 2,
                      'Agencies': 1,
                      'delivery': 1,
                      'vindication': 2,
                      'trees': 2,
                      'melee': 2,
                      'Elmer': 1,
                      "they're": 5,
                      'Cotty': 1,
                      'brochures': 1,
                      'Oscar': 2,
                      'kickoff': 2,
                      'exchange': 5,
                      'Frederick': 3,
                      'due': 9,
                      'Fletcher': 1,
                      'congratulated': 2,
                      'Sheldon': 4,
                      'Turnpike': 6,
                      'plea': 3,
                      'warmed': 3,
                      'checked': 2,
                      'problem': 39,
                      'freer': 1,
                      'Morocco': 2,
                      'pulse': 1,
                      'modification': 1,
                      'Summer': 1,
                      'winter': 8,
                      'distance': 2,
                      '5000': 1,
                      'percentage': 4,
                      'receive': 15,
                      'Gauer': 2,
                      'Trustees': 2,
                      'Mortgage': 1,
                      'specialize': 1,
                      'notarized': 1,
                      'informal': 4,
                      'Collins': 4,
                      'looking': 7,
                      '37,679': 1,
                      'catchers': 1,
                      'picker': 1,
                      'British': 24,
                      '255': 1,
                      'Operating': 2,
                      'Time': 6,
                      'humor': 4,
                      "John's": 1,
                      'heyday': 1,
                      'Hat': 1,
                      'Achaeans': 1,
                      'delegation': 3,
                      'Briar': 1,
                      "We're": 3,
                      'collected': 6,
                      'Samoa': 1,
                      'Big': 4,
                      'attract': 2,
                      'Kimpton': 4,
                      'ranked': 2,
                      'Birgit': 1,
                      'pointed': 9,
                      'guiding': 1,
                      'Usually': 1,
                      'determination': 3,
                      'amendment': 5,
                      'evacuation': 1,
                      'slowly': 3,
                      'place': 25,
                      'piece': 3,
                      'Korman': 1,
                      'garrulous': 1,
                      'moldboard': 1,
                      'Kelly': 1,
                      "Princes'": 1,
                      'committees': 4,
                      'payroll': 13,
                      'bombers': 1,
                      'underestimate': 1,
                      'first': 143,
                      'Stetsons': 1,
                      'atmosphere': 5,
                      'union': 21,
                      'urgent': 3,
                      'track': 10,
                      'ex-gambler': 1,
                      'Salvatore': 1,
                      'gulf': 1,
                      'inflamed': 1,
                      '1859': 1,
                      'textile-producing': 2,
                      'announcements': 2,
                      'Natural': 1,
                      '1825': 1,
                      'instrument': 1,
                      'pork': 1,
                      'could': 86,
                      'Lawrence': 12,
                      'teenagers': 3,
                      'Federal': 17,
                      'gruonded': 1,
                      'slippers': 1,
                      'stopped': 6,
                      'Wilson': 6,
                      'aloud': 1,
                      'elaborate': 3,
                      'Revolutionary': 1,
                      'Along': 3,
                      'pride': 3,
                      'Soule': 1,
                      'Franklin': 8,
                      'twisting': 1,
                      'Neiman-Marcus': 4,
                      'Kanin': 1,
                      '13th': 5,
                      '111': 1,
                      'increasingly': 2,
                      'inflexible': 1,
                      'Ind.': 1,
                      'search': 3,
                      'infiltrating': 1,
                      'reflexes': 2,
                      'delinquency': 1,
                      'wagons': 1,
                      'optimism': 2,
                      'meant': 3,
                      'Monroe': 5,
                      'morals': 1,
                      'ivory': 1,
                      'luncheons': 1,
                      'forbids': 1,
                      'Successful': 1,
                      'businesses': 2,
                      'Dumont': 2,
                      'Petipa-Tschaikowsky': 1,
                      'picture': 7,
                      'cuisine': 1,
                      'guarding': 1,
                      'A.M.': 2,
                      'sympathetic': 1,
                      'unemployment': 1,
                      'Vital': 1,
                      'laurels': 1,
                      'Yogi': 2,
                      'convention': 2,
                      'Bourguiba': 1,
                      'buildings': 6,
                      'between': 47,
                      "Gannon's": 1,
                      '2269': 1,
                      'cost': 15,
                      'exempt': 2,
                      '1953': 5,
                      'R-Warren': 1,
                      'nailed': 1,
                      'Inter-American': 1,
                      'Retail': 1,
                      'outmoded': 1,
                      'eighth': 2,
                      'nine-game': 1,
                      'costlier': 1,
                      'masterpiece': 2,
                      'recognize': 1,
                      'improve': 4,
                      'lefthander': 2,
                      'feeds': 1,
                      'marked': 4,
                      'Anderson': 8,
                      '$15': 1,
                      'Pye': 1,
                      'higher': 26,
                      'lovely': 2,
                      'Dutch': 1,
                      'Boonton': 1,
                      'fast-grossing': 1,
                      'Cezannes': 1,
                      'Menderes': 2,
                      'golfing': 1,
                      'Security': 4,
                      'foliage': 1,
                      'Alpha': 1,
                      'Stanley': 2,
                      'residences': 1,
                      'penalized': 1,
                      'firmly': 5,
                      'Until': 5,
                      'marketing': 3,
                      'backed': 1,
                      'handicapped': 1,
                      'Achievement': 7,
                      'bateau': 2,
                      'Bay-front': 1,
                      'Neuberger': 1,
                      'Harveys': 1,
                      'Slate': 5,
                      'Golden': 3,
                      "carpenters'": 1,
                      'endowments': 2,
                      "union's": 2,
                      'hospitable': 1,
                      'dismissed': 1,
                      "writers'": 1,
                      'Campagnoli': 1,
                      'told': 53,
                      'Alice': 3,
                      'distinctly': 1,
                      'wiping': 1,
                      'Transportation': 1,
                      'monopolistic': 2,
                      'The': 806,
                      'promise': 3,
                      'D.': 24,
                      '5155': 1,
                      'Hodges': 7,
                      'sportsman': 1,
                      'Boy': 1,
                      'Ocean': 1,
                      'merited': 1,
                      'of': 2849,
                      'under': 83,
                      'Gateway': 1,
                      'court-appointed': 1,
                      '7.5': 1,
                      'celebrated': 2,
                      '47': 4,
                      '$10,000': 5,
                      'withdrawn': 1,
                      'law': 38,
                      'spiritual': 1,
                      'slugger': 3,
                      'bankers': 8,
                      'Innumerable': 1,
                      '1920s': 1,
                      'untold': 1,
                      'Stalling': 1,
                      'nearing': 4,
                      'forgetting': 1,
                      '553': 1,
                      'police': 31,
                      'Snodgrass': 1,
                      'inboard': 1,
                      "Angeles'": 2,
                      '155-yarder': 1,
                      'rear': 4,
                      'ova': 1,
                      'bustling': 1,
                      'metal': 1,
                      "Brooks's": 1,
                      'refocusing': 1,
                      'industrialist': 1,
                      'lived': 3,
                      'success': 4,
                      'wars': 2,
                      'resisting': 1,
                      'Sanger-Harris': 1,
                      'Eastland': 1,
                      'SMU': 5,
                      'murdered': 1,
                      'Sheets': 4,
                      'Brod': 1,
                      'Rylie': 1,
                      "ladies'": 1,
                      'maid': 3,
                      'low': 7,
                      "Beadles'": 1,
                      'Hudson': 2,
                      'unusually': 2,
                      'retaliating': 1,
                      'assembled': 1,
                      'announcing': 2,
                      'crabapple': 1,
                      'recovery': 9,
                      'peculiar': 2,
                      'display': 4,
                      'Cover': 1,
                      'Dearborn': 1,
                      'Fatima': 1,
                      'well': 43,
                      'Siberia': 2,
                      'orders': 7,
                      'bulwark': 1,
                      'Baltimorean': 1,
                      "Cotten's": 1,
                      'lucky': 1,
                      'factory': 3,
                      'entry': 2,
                      'overhead': 2,
                      'Dame': 5,
                      'outside': 10,
                      'Berger': 5,
                      '1630': 1,
                      'arc': 1,
                      '355': 1,
                      'book': 13,
                      'subscribe': 1,
                      'recommends': 1,
                      '101b': 1,
                      'Eire': 1,
                      'unrest': 1,
                      'planes': 2,
                      'suggested': 8,
                      'Basic': 1,
                      'begging': 1,
                      'Grover': 4,
                      "Grizzlies'": 1,
                      'employees': 11,
                      'mentions': 1,
                      'Notre': 5,
                      'Renaissance': 1,
                      'arrive': 5,
                      'jacket': 2,
                      'nightly': 1,
                      'bridesmaids': 1,
                      'intend': 3,
                      'pars': 3,
                      'presents': 1,
                      'Engaging': 2,
                      'purpose': 9,
                      'broadened': 1,
                      'give': 32,
                      'deferred': 1,
                      'absolute': 1,
                      'Hartman': 4,
                      'membership': 5,
                      'monopolies': 3,
                      'spotlights': 1,
                      'Richards': 1,
                      'Funeral': 5,
                      'dishes': 1,
                      'housing': 14,
                      'beauty': 4,
                      'colonialist': 1,
                      'battery': 1,
                      'Bellows': 5,
                      'Stetson': 1,
                      'Beatrice': 1,
                      'flatly': 1,
                      'glove': 4,
                      'blond': 2,
                      'Richard': 27,
                      'semester': 3,
                      'Branum': 1,
                      '18,792': 1,
                      '1966': 3,
                      'dynamic': 4,
                      'Decries': 1,
                      'applying': 1,
                      'allow': 5,
                      'critical': 6,
                      'Superior': 5,
                      'correctness': 1,
                      'Werner': 5,
                      '3505o': 1,
                      'Aj': 14,
                      'hoodlums': 2,
                      'boost': 6,
                      "Dresbachs'": 1,
                      'missing': 3,
                      'actually': 10,
                      'Several': 5,
                      'economist': 3,
                      'explanation': 3,
                      'replied': 5,
                      'juice': 1,
                      'Billy': 3,
                      'after': 127,
                      'Sees': 1,
                      'homers': 3,
                      'Hewlett-Woodmere': 1,
                      'questioning': 1,
                      'assailant': 1,
                      'ward': 7,
                      'reached': 15,
                      'anyway': 3,
                      'Balkanizing': 1,
                      '469': 1,
                      'response': 5,
                      'Japan': 3,
                      'bargain': 1,
                      'compared': 6,
                      'authority': 8,
                      'Point': 3,
                      'Perhaps': 3,
                      'grace': 2,
                      'jury-tampering': 1,
                      'Audubon': 3,
                      'troopships': 1,
                      'football': 14,
                      'strikingly': 1,
                      'big': 38,
                      'Juvenile': 3,
                      'Sidney': 2,
                      '180-degrees': 1,
                      'services': 18,
                      'nourished': 1,
                      'Cod': 2,
                      'attempted': 5,
                      'completely': 6,
                      'journey': 2,
                      'C': 4,
                      'wooden': 1,
                      'cold': 6,
                      'opened': 9,
                      'resumption': 2,
                      'Elliott': 1,
                      'rookies': 2,
                      'radioactive': 1,
                      'blabbed': 1,
                      'Baker': 6,
                      'fate': 1,
                      'Holy': 2,
                      "Tuesday's": 1,
                      'satin': 4,
                      'stood': 4,
                      'worst': 2,
                      'Boxwood': 1,
                      'lend': 1,
                      'instrumentals': 1,
                      'Milenoff': 1,
                      'Washington-Alexandria': 1,
                      'street': 5,
                      'Horstman': 1,
                      'Gets': 1,
                      '158-pounder': 1,
                      'gas-fired': 1,
                      'Parker': 5,
                      'Belleville': 1,
                      '11,744': 1,
                      'Deloris': 1,
                      'inlaid': 1,
                      'current': 13,
                      'View': 2,
                      'senseless': 1,
                      'Grayson': 1,
                      'fill': 3,
                      'Redevelopment': 3,
                      'Section': 4,
                      'expectations': 1,
                      'Sybert': 1,
                      'Allegretti': 1,
                      'modernized': 1,
                      'accuse': 1,
                      'town': 18,
                      'fulfilled': 3,
                      '$700': 2,
                      'Economically': 1,
                      'settling': 1,
                      'now': 76,
                      'Zinman': 1,
                      'solos': 1,
                      'Bonanza': 1,
                      'full-fledged': 1,
                      'dramatic': 8,
                      'erupts': 1,
                      'bestowal': 1,
                      'Cooperation': 1,
                      'cab': 3,
                      'domination': 1,
                      'De': 11,
                      'Yankee': 6,
                      '4-year-old': 1,
                      "we'll": 4,
                      'lay': 5,
                      'Belanger': 1,
                      'Carson': 1,
                      'newcomers': 1,
                      'loss': 11,
                      'three': 97,
                      "Atlanta's": 4,
                      'automatic': 5,
                      'consulted': 4,
                      'demand': 17,
                      'batch': 1,
                      'stock': 21,
                      'Garza': 1,
                      'Inauguration': 3,
                      'Spring': 1,
                      'By-passing': 1,
                      'alcoholics': 2,
                      'downed': 1,
                      '$28': 4,
                      'San': 22,
                      'furnish': 2,
                      'presidential': 5,
                      'wage': 3,
                      'split-level': 1,
                      'industrial': 5,
                      'Kunkel': 2,
                      'Convenience': 2,
                      'pure': 2,
                      'organize': 2,
                      'extended': 5,
                      'Bernhard': 1,
                      "Louis's": 1,
                      'Screw': 1,
                      'Giacomo': 1,
                      'FBI': 5,
                      'Actor-Crooner': 1,
                      'Slow': 1,
                      "Wouldn't": 1,
                      'ineffectual': 1,
                      'motion': 2,
                      'onetime': 1,
                      'car': 50,
                      'Mansion': 1,
                      'lucrative': 1,
                      'cap': 1,
                      'completions': 3,
                      '$30': 1,
                      'A.': 38,
                      '3-month': 1,
                      'Hyannis': 2,
                      'NLRB': 1,
                      "Hyde's": 1,
                      'present': 30,
                      'describes': 1,
                      'throws': 2,
                      'well-played': 1,
                      '$58,918': 1,
                      'science': 7,
                      'Merit': 1,
                      'officiated': 3,
                      'receives': 1,
                      'be': 526,
                      'wealth': 2,
                      'alternative': 4,
                      'racing': 2,
                      'names': 6,
                      'ready': 12,
                      'Exhibition': 1,
                      'baseballs': 1,
                      'Crawford': 2,
                      'cotton-growing': 1,
                      'bonding': 1,
                      'wed': 2,
                      'adjoining': 2,
                      'shares': 19,
                      'engaged': 3,
                      "Musician's": 1,
                      '220-yard': 1,
                      'Journal-American': 1,
                      'conclusion': 3,
                      'ant': 3,
                      'reasons': 12,
                      'auto': 3,
                      'janitor': 1,
                      'minutes': 25,
                      'Baseball': 4,
                      'Missiles': 1,
                      ...}),
             'religion': Counter({'resented': 1,
                      'injustice': 1,
                      'argument': 4,
                      'Seminary': 2,
                      'read': 7,
                      '23': 4,
                      'entails': 2,
                      'speak': 9,
                      'Wonderland': 1,
                      'dost': 1,
                      'grow': 4,
                      'entities': 4,
                      'vein': 1,
                      'education': 2,
                      'house': 3,
                      'believes': 5,
                      'note': 2,
                      'adultery': 1,
                      "B.B.C.'s": 1,
                      'garb': 1,
                      'building': 1,
                      'Adams': 2,
                      'Boards': 3,
                      'generation': 6,
                      'promptly': 1,
                      'pastoral': 1,
                      'morally': 1,
                      'tracks': 1,
                      'restrain': 1,
                      'operating': 1,
                      'use': 16,
                      'than': 69,
                      'wiped': 1,
                      'killed': 1,
                      'shifts': 1,
                      'recurrence': 1,
                      'experience': 27,
                      'applies': 1,
                      'endure': 1,
                      'served': 4,
                      'committee': 1,
                      'ailments': 3,
                      'Norm': 1,
                      'bars': 1,
                      'population': 10,
                      'rents': 1,
                      'Lucifer': 1,
                      'trip': 4,
                      'typewriter': 1,
                      'anyhow': 1,
                      'has': 111,
                      'letter': 5,
                      'surrendering': 1,
                      'pleading': 3,
                      'pragmatic': 1,
                      'maintenance': 2,
                      'our': 77,
                      'along': 5,
                      'certain': 17,
                      'wages': 3,
                      'zeal': 1,
                      'God': 131,
                      'inward': 1,
                      'demanded': 1,
                      'others': 13,
                      'stronghold': 1,
                      'editorial': 4,
                      'torrent': 1,
                      'strains': 1,
                      'transition': 1,
                      'grew': 2,
                      'Islamic': 2,
                      'contributes': 1,
                      'Diario': 2,
                      'beautiful': 5,
                      'gesture': 2,
                      'Ruth': 1,
                      'sixth': 1,
                      'mailing': 1,
                      'Greenleaf': 1,
                      'marriages': 1,
                      'first': 25,
                      'self-respect': 1,
                      'refuse': 3,
                      'Eternal': 1,
                      'admit': 4,
                      'developed': 5,
                      'ratiocinating': 1,
                      'Shakya': 1,
                      'seed': 1,
                      'promises': 3,
                      'hell-bound': 1,
                      'dominating': 1,
                      'Pythagoreans': 1,
                      'wall': 4,
                      'page': 3,
                      'instructed': 2,
                      'determine': 4,
                      'under': 19,
                      'correct': 3,
                      'reborn': 1,
                      'proportion': 3,
                      'Peace': 1,
                      'metal': 1,
                      'kitchen': 1,
                      "''": 266,
                      'amount': 2,
                      'orders': 2,
                      'ripples': 3,
                      'pressure': 4,
                      'Creator': 3,
                      'rejections': 1,
                      'reciting': 1,
                      'directed': 1,
                      'Berlin': 3,
                      'dynamic': 2,
                      'hear': 5,
                      'man': 64,
                      'redeemed': 1,
                      'evils': 3,
                      'Loyalty': 1,
                      'erased': 1,
                      'awe': 1,
                      'through': 48,
                      'Section': 1,
                      'Following': 1,
                      'never-to-be-forgotten': 1,
                      'anew': 1,
                      '1850': 1,
                      'nationalism': 3,
                      'Just': 2,
                      'specific': 3,
                      'each': 30,
                      'Because': 6,
                      'battle': 1,
                      'Delaware': 2,
                      'proclamation': 1,
                      'Sponsors': 1,
                      'motion': 2,
                      'tons': 2,
                      'arrogance': 1,
                      'France': 1,
                      'it': 264,
                      'burden': 1,
                      'confident': 1,
                      'urns': 1,
                      'balanced': 2,
                      'freely': 2,
                      'raised': 4,
                      'alternation': 1,
                      'love': 13,
                      'meanings': 4,
                      'rise': 2,
                      'ate': 1,
                      'Schleiermacher': 1,
                      'grave': 4,
                      'repeated': 2,
                      'eventually': 3,
                      'astonishingly': 1,
                      'believing': 1,
                      'Center': 4,
                      'eternal': 7,
                      'features': 3,
                      'epitomized': 1,
                      'distrust': 2,
                      'service': 12,
                      'hard': 3,
                      'At': 7,
                      'walked': 2,
                      ')': 127,
                      'Sky-god': 1,
                      'nonchurchgoing': 1,
                      'mastery': 1,
                      'endures': 1,
                      'colleagues': 2,
                      'Take': 1,
                      'loudspeakers': 1,
                      'baby': 1,
                      'Those': 4,
                      'doctrines': 1,
                      'rejection': 2,
                      '1776': 1,
                      'inculcation': 1,
                      'drop': 2,
                      '$2,000': 1,
                      'calculated': 3,
                      'cause': 9,
                      'friends': 5,
                      'integral': 1,
                      'subtly': 2,
                      'shouting': 1,
                      'worse': 3,
                      'Teresa': 1,
                      'anti-slavery': 11,
                      'negative': 3,
                      'truthfully': 1,
                      'thoroughfare': 1,
                      'small': 13,
                      'literal': 1,
                      'create': 2,
                      'enemy': 4,
                      'wept': 1,
                      'recruits': 1,
                      'prophesied': 1,
                      'sect': 1,
                      'Cardinal': 3,
                      'revolutions': 1,
                      'detrimental': 2,
                      'forward': 2,
                      'recently': 4,
                      'training': 3,
                      'distinct': 1,
                      'thunder': 1,
                      'Heenan': 1,
                      'peas': 17,
                      'legislation': 2,
                      'next': 9,
                      'accelerating': 1,
                      'four': 10,
                      'badly': 1,
                      'searching': 1,
                      'Africans': 1,
                      'Len': 2,
                      'bronze': 1,
                      'broad': 1,
                      'stigma': 1,
                      'ball': 1,
                      'passed': 5,
                      'external': 2,
                      'castigation': 1,
                      'please': 1,
                      'ideology': 1,
                      'frivolity': 1,
                      'mountains': 3,
                      'rushed': 1,
                      'female': 2,
                      'efforts': 12,
                      'effect': 13,
                      'lived': 5,
                      'exact': 1,
                      'clothed': 1,
                      'thin': 1,
                      'sciences': 1,
                      'returning': 2,
                      'King': 1,
                      'Interfaith': 2,
                      'sputniks': 1,
                      'brokers': 2,
                      'recommending': 1,
                      'sanctimonious': 1,
                      'family': 20,
                      'ever-changing': 1,
                      'spending': 1,
                      'modernists': 1,
                      'realities': 3,
                      'twelve': 1,
                      'demoted': 1,
                      'paralleling': 1,
                      'contradiction': 1,
                      'fear': 24,
                      'hymn': 3,
                      'merits': 1,
                      'Communists': 1,
                      'dowry': 1,
                      'rice': 1,
                      'P': 3,
                      'theses': 1,
                      'pulpits': 1,
                      'destructive': 1,
                      'needed': 1,
                      'late': 4,
                      '``': 270,
                      'Communism': 2,
                      'systematically': 1,
                      'conversely': 1,
                      'personalities': 1,
                      'marveled': 1,
                      'ask': 9,
                      'urging': 1,
                      'found': 13,
                      'Realtor': 1,
                      'triumph': 1,
                      'unction': 1,
                      'included': 1,
                      'Nevertheless': 4,
                      "one's": 4,
                      'warmly': 1,
                      'mistakes': 1,
                      'wonders': 2,
                      'ethos': 1,
                      'mother': 3,
                      'things': 23,
                      'Now': 12,
                      'Nonconformists': 1,
                      'alienated': 1,
                      'can': 82,
                      'participants': 1,
                      'independent': 2,
                      'dismal': 1,
                      'sees': 3,
                      'debunking': 1,
                      'Mountain': 1,
                      'permission': 1,
                      'suppose': 1,
                      'carnal': 1,
                      'nation-states': 1,
                      'bits': 1,
                      'are': 193,
                      '18': 3,
                      'imperial': 1,
                      'treatment': 1,
                      'drawn': 3,
                      'ecclesiastical': 2,
                      'reconcile': 1,
                      'advisers': 1,
                      'instruction': 2,
                      'one-third': 2,
                      'brethren': 4,
                      'hatred': 1,
                      'reflects': 1,
                      'feelings': 3,
                      'constituted': 1,
                      'Buddhist': 1,
                      'giveth': 2,
                      'activities': 4,
                      'Early': 2,
                      'voting': 1,
                      'leave': 8,
                      'York': 5,
                      'exquisitely': 1,
                      'importance': 3,
                      'assert': 1,
                      'power': 49,
                      'yielded': 1,
                      'till': 1,
                      'shoot': 1,
                      'shadow': 2,
                      'classes': 4,
                      'any': 60,
                      'Buddhists': 2,
                      'bore': 1,
                      'Hence': 2,
                      'dying': 3,
                      'observe': 1,
                      'automatically': 1,
                      'insurgence': 1,
                      'talent': 3,
                      'front': 2,
                      'deceived': 1,
                      'absolutes': 1,
                      'Ibn': 1,
                      'dispute': 1,
                      'gladness': 1,
                      'Odors': 1,
                      'severely': 1,
                      'institutionalized': 1,
                      '1961': 5,
                      'Beautiful': 1,
                      'Pragmatism': 1,
                      'drink': 1,
                      'Beach': 1,
                      'run': 8,
                      'discussed': 3,
                      'persons': 10,
                      'equity': 1,
                      'Continental': 1,
                      'addresses': 1,
                      'tribulation': 1,
                      'effected': 3,
                      'indelible': 1,
                      'unconcerned': 1,
                      'visitor': 1,
                      'intellectus': 1,
                      'thousand': 6,
                      '70,000,000': 1,
                      'themselves': 15,
                      'academic': 1,
                      'music': 6,
                      'visit': 3,
                      'experiences': 4,
                      'recounted': 1,
                      'fallen': 1,
                      'eighteenth': 1,
                      'Church': 43,
                      'himself': 22,
                      'metaphysic': 3,
                      'dropped': 1,
                      'parties': 4,
                      'provoke': 1,
                      'divinely': 1,
                      'speaks': 1,
                      'anyone': 1,
                      'reader': 1,
                      'unnecessary': 1,
                      'her': 8,
                      'correspond': 1,
                      'bottom': 1,
                      'spring': 1,
                      'eradicate': 1,
                      'continuous': 1,
                      'role': 5,
                      'longer': 14,
                      'sing': 1,
                      'strange': 2,
                      'Missions': 1,
                      'mercy': 4,
                      'Manchester': 2,
                      'methods': 1,
                      'band': 1,
                      'trends': 3,
                      'visited': 2,
                      'liberation': 1,
                      'hyperbole': 2,
                      'confessional': 2,
                      'disposed': 2,
                      'Also': 1,
                      'radio': 3,
                      'societies': 1,
                      'trees': 1,
                      'Some': 19,
                      'squarely': 1,
                      'pilgrim': 1,
                      'possessing': 2,
                      'Japanese': 1,
                      'exchange': 2,
                      'due': 4,
                      'considering': 3,
                      'attainment': 4,
                      'selection': 1,
                      'seeming': 1,
                      'Scotian': 1,
                      'Arhat': 1,
                      'appearing': 1,
                      'compressed': 1,
                      'sixty': 1,
                      'Virgin': 4,
                      'proviso': 1,
                      'serve': 3,
                      'overcomes': 2,
                      'problem': 12,
                      'walking': 1,
                      'percentage': 4,
                      'stage': 1,
                      'Catherine': 1,
                      'Outline': 1,
                      'enlightened': 2,
                      'reflect': 1,
                      'looking': 2,
                      'commission': 1,
                      'Bushnell': 1,
                      'nuns': 1,
                      'shorter': 1,
                      'unavoidably': 2,
                      'striking': 1,
                      'British': 1,
                      'monk': 7,
                      'Time': 1,
                      'prudential': 1,
                      'humor': 1,
                      'periods': 1,
                      'resent': 2,
                      "John's": 2,
                      'astounding': 1,
                      'exclusive': 2,
                      'descending': 1,
                      'paredon': 2,
                      'anti-Christian': 1,
                      'teacher': 4,
                      'pointed': 2,
                      'realized': 1,
                      'determination': 2,
                      'same': 28,
                      'possess': 3,
                      'place': 12,
                      'piece': 3,
                      'apprehend': 1,
                      'hardly': 2,
                      'ethicist': 1,
                      'amusement': 2,
                      '1861': 2,
                      'reform': 4,
                      'atmosphere': 5,
                      'apprehension': 2,
                      'union': 3,
                      'melting': 1,
                      "Batista's": 1,
                      'physician': 1,
                      'dictator': 2,
                      'greater': 11,
                      'gulf': 1,
                      'selflessness': 1,
                      'therefore': 9,
                      'remain': 3,
                      'equal': 7,
                      'factual': 1,
                      'think': 17,
                      'could': 59,
                      'Especially': 1,
                      'Smith': 1,
                      'Federal': 1,
                      'organizes': 1,
                      'fondness': 1,
                      'stopped': 1,
                      'alleviation': 2,
                      'unmixed': 1,
                      'related': 2,
                      'structurally': 1,
                      'circumcision': 1,
                      'frank': 2,
                      'Along': 1,
                      'pride': 1,
                      'shore': 1,
                      'remarkably': 1,
                      '13th': 1,
                      'wise': 1,
                      'customs': 1,
                      'all-inclusive': 1,
                      'Buddhism': 6,
                      'unity': 21,
                      'proximate': 1,
                      'Dr.': 7,
                      'tagged': 1,
                      'defend': 1,
                      'appetites': 1,
                      'meant': 5,
                      'shield': 1,
                      'old': 5,
                      'attested': 1,
                      'morals': 2,
                      'tiny': 1,
                      'forbids': 1,
                      'proportionate': 3,
                      'picture': 15,
                      'contain': 2,
                      'includes': 1,
                      'mystics': 1,
                      'for': 283,
                      'slavery': 11,
                      'equally': 1,
                      'buildings': 3,
                      'between': 31,
                      'excellent': 2,
                      'cost': 5,
                      'exempt': 1,
                      'host': 3,
                      'Nationalism': 1,
                      '6': 6,
                      'oceans': 1,
                      'report': 3,
                      'converts': 2,
                      'braces': 1,
                      'appear': 2,
                      'diagrams': 1,
                      'Roman': 13,
                      'irritations': 1,
                      'recognize': 5,
                      'improve': 1,
                      'World': 4,
                      'marked': 4,
                      'higher': 3,
                      'diplomatic': 2,
                      'eyes': 4,
                      'filled': 5,
                      'simple': 4,
                      'racially': 1,
                      'firmly': 1,
                      'Until': 1,
                      'hand': 10,
                      'dear': 2,
                      'fruit': 2,
                      'Taoists': 2,
                      'covenant': 1,
                      'remind': 3,
                      'Varner': 1,
                      'contingencies': 2,
                      'favorite': 1,
                      'universality': 1,
                      'brightens': 1,
                      'told': 9,
                      'they': 115,
                      'Alice': 1,
                      'done': 13,
                      'Communist': 3,
                      'discovery': 1,
                      'evangelism': 4,
                      '37': 1,
                      'may': 78,
                      'The': 185,
                      'heat': 1,
                      'benefit': 2,
                      'uselessly': 1,
                      'objectively': 1,
                      'Boston': 7,
                      'reports': 3,
                      'undermined': 1,
                      'portion': 1,
                      'emphasize': 2,
                      'rained': 1,
                      'celebrated': 1,
                      'law': 10,
                      'boot': 1,
                      'Lloyd': 1,
                      'Slough': 1,
                      'clues': 1,
                      'Buri': 3,
                      'defense': 3,
                      'quarter': 2,
                      'police': 1,
                      'world': 90,
                      'acting': 1,
                      'observed': 2,
                      'state': 14,
                      're-evaluation': 1,
                      'trend': 4,
                      'felt': 6,
                      'private': 3,
                      'conception': 3,
                      'contagion': 1,
                      'variant': 1,
                      'confuse': 1,
                      'precisely': 4,
                      'Pledge': 1,
                      'nun': 1,
                      'fighting': 1,
                      'protest': 1,
                      '1845': 1,
                      'self-confidence': 1,
                      'relaxation': 1,
                      'Confucian': 3,
                      'brutally': 1,
                      'spell': 4,
                      'relativity': 1,
                      'hate': 3,
                      'facts': 2,
                      'recreation': 1,
                      'enjoyed': 2,
                      'doctrinal': 1,
                      'assembled': 2,
                      'disastrous': 1,
                      'taking': 4,
                      'L.': 1,
                      'fascination': 1,
                      'dependable': 1,
                      'too': 28,
                      'agreed-upon': 1,
                      'exciting': 1,
                      'aspect': 2,
                      'confirmed': 2,
                      'ruler': 1,
                      'outside': 5,
                      'hung': 2,
                      'relation': 1,
                      '355': 1,
                      'book': 4,
                      'aids': 1,
                      'sets': 2,
                      'following': 11,
                      'opiates': 1,
                      'Basic': 1,
                      'event': 8,
                      'diametrically': 1,
                      'Devonshire': 1,
                      'employees': 1,
                      'specialization': 2,
                      'particularistic': 1,
                      'bound': 2,
                      'Musical': 1,
                      'intend': 1,
                      'presents': 3,
                      "Paul's": 1,
                      'above': 2,
                      'endlessly': 1,
                      'celebrating': 1,
                      'purpose': 4,
                      'volumes': 1,
                      'unpublished': 1,
                      'give': 17,
                      'catechism': 1,
                      'foundations': 2,
                      'Thus': 11,
                      'membership': 33,
                      'assemble': 1,
                      'successfully': 1,
                      'medieval': 2,
                      'implications': 3,
                      'upset': 2,
                      'scientists': 3,
                      'treason': 1,
                      'housing': 13,
                      'infatuation': 2,
                      'odor': 1,
                      'convenient': 1,
                      'infraction': 1,
                      'rapidly': 1,
                      'waving': 1,
                      'mapping': 1,
                      'mutual': 3,
                      'diverse': 1,
                      'statistics': 3,
                      'strenuous': 3,
                      'redactions': 1,
                      'allow': 3,
                      'critical': 2,
                      'circulate': 1,
                      'correctness': 1,
                      'anticipations': 1,
                      'identified': 2,
                      'grasped': 1,
                      'cancer': 2,
                      'Pope': 3,
                      'inadequacy': 1,
                      'hermeneutics': 1,
                      'actually': 8,
                      'limit': 1,
                      'advice': 1,
                      'Several': 4,
                      'explanation': 2,
                      'replied': 1,
                      'preserve': 4,
                      'Machado': 1,
                      'sea': 5,
                      'Behold': 1,
                      'thought': 12,
                      'employed': 4,
                      'earthly': 4,
                      'on': 160,
                      'response': 3,
                      'conviction': 5,
                      'compared': 2,
                      'authority': 2,
                      'Queen': 1,
                      'Our': 7,
                      'Perhaps': 5,
                      'maturation': 1,
                      'confused': 1,
                      'football': 3,
                      'big': 5,
                      'earnestly': 1,
                      'Tsou': 1,
                      'dislikes': 1,
                      'comply': 1,
                      'Sidney': 1,
                      'clearly': 9,
                      'nourished': 1,
                      'spiral': 6,
                      'attempted': 1,
                      'completely': 10,
                      'abides': 1,
                      'C': 1,
                      'wei': 2,
                      'civilization': 1,
                      'cold': 2,
                      'Reproach': 1,
                      'financial': 2,
                      'coverage': 2,
                      'apostolic': 1,
                      'excited': 1,
                      'regeneration': 1,
                      'Tai': 1,
                      'idle': 1,
                      'utterly': 4,
                      'Opportunities': 2,
                      'mere': 4,
                      'mend': 1,
                      'fate': 2,
                      'insults': 1,
                      'Heavenly': 1,
                      'clergyman': 4,
                      'intellectually': 2,
                      'worst': 2,
                      'reminding': 1,
                      'contracted': 1,
                      'impact': 1,
                      'Psalm': 3,
                      'glasses': 1,
                      'street': 1,
                      'fortiori': 1,
                      'rid': 1,
                      'self-deprecation': 1,
                      'orbiting': 1,
                      'Parker': 28,
                      'Bible-emancipated': 1,
                      'sins': 5,
                      'utmost': 1,
                      'Braille': 1,
                      'current': 2,
                      'worth': 1,
                      'thousands': 1,
                      'tradition': 8,
                      'priest': 4,
                      'expectations': 1,
                      'down-and-outers': 1,
                      '22': 1,
                      'Alfred': 1,
                      'intended': 6,
                      'town': 2,
                      'opinions': 2,
                      'capitalists': 1,
                      'now': 33,
                      'lies': 3,
                      'reign': 1,
                      'dramatic': 4,
                      'radiant': 1,
                      'expects': 1,
                      'faint': 1,
                      'demolition': 1,
                      'suddenly': 3,
                      'De': 2,
                      '260': 1,
                      'gets': 3,
                      'respond': 1,
                      'advocate': 1,
                      'individuality': 1,
                      'division': 3,
                      'loss': 2,
                      'Will': 1,
                      'bigotry': 1,
                      'three': 20,
                      'shut': 1,
                      'proclaims': 2,
                      'demand': 11,
                      'presumes': 1,
                      'stock': 5,
                      'restrictive': 3,
                      'centenary': 1,
                      'sympathy': 2,
                      'San': 1,
                      'presidential': 1,
                      'Unitarians': 2,
                      'bulletin': 1,
                      'clearer': 1,
                      'industrial': 1,
                      'virtue': 5,
                      'pure': 3,
                      'organize': 1,
                      'extended': 1,
                      'disciples': 1,
                      'alumni': 1,
                      'emphatically': 1,
                      'choice': 12,
                      'desires': 1,
                      'joy': 3,
                      '1789-1839': 1,
                      "community's": 1,
                      'hole': 1,
                      'provided': 4,
                      'Taoist': 3,
                      'forceful': 1,
                      'nation': 12,
                      'heroic': 1,
                      'narrow-minded': 1,
                      'personal': 14,
                      'difficulty': 3,
                      'Arhats': 1,
                      'monastic': 1,
                      'secularists': 1,
                      'West': 8,
                      'present': 25,
                      'describes': 1,
                      'dealing': 2,
                      'science': 8,
                      'necessary': 8,
                      'unleashing': 1,
                      'hit': 1,
                      'receives': 3,
                      'be': 243,
                      'must': 54,
                      'racing': 1,
                      'Scholars': 1,
                      'helpful': 3,
                      'ready': 1,
                      'reconsidered': 3,
                      'Mary': 4,
                      'an': 119,
                      'Troeltsch': 1,
                      'suspicion': 3,
                      'presented': 2,
                      'culture': 4,
                      'witches': 3,
                      'abundance': 1,
                      'shares': 1,
                      'sincerity': 3,
                      'Rauschenbusch': 1,
                      'engaged': 2,
                      'Movement': 1,
                      'findings': 2,
                      'designate': 1,
                      'conclusion': 3,
                      'reasons': 5,
                      'traditionally': 1,
                      'Ti': 1,
                      'prudentially': 1,
                      "T'ien": 1,
                      'remote': 1,
                      'oxygen': 1,
                      'Head': 1,
                      'joined': 2,
                      'criticize': 1,
                      'transience': 1,
                      'friendly': 2,
                      'Garrison': 3,
                      'war-dirty': 1,
                      'refusal': 2,
                      'Word': 7,
                      'regular': 1,
                      'shivering': 1,
                      'covering': 1,
                      'numerous': 1,
                      'contended': 1,
                      'status': 2,
                      'Ferris': 1,
                      'central': 3,
                      'carefully': 4,
                      'colonial': 1,
                      'twenty-eighth': 1,
                      'folly': 2,
                      'fail': 1,
                      'commands': 1,
                      'burial': 1,
                      'residence': 1,
                      'Glenn': 1,
                      'dignity': 1,
                      'how': 23,
                      'force': 9,
                      '1803-1895': 1,
                      'employ': 1,
                      'establishment': 4,
                      'weeping': 2,
                      'preparation': 8,
                      'prevented': 1,
                      'popularly': 1,
                      'traditional': 13,
                      'Bay': 3,
                      'throw': 2,
                      'key': 1,
                      'embraces': 1,
                      'obligations': 1,
                      'repressions': 1,
                      'Parris': 2,
                      'paid': 5,
                      'trail': 1,
                      'Had': 1,
                      'strongest': 2,
                      'Certainly': 1,
                      '1924': 1,
                      'revealed': 1,
                      'alone': 8,
                      'rightly': 2,
                      'humans': 1,
                      'statutes': 1,
                      'deemed': 1,
                      'cherubim': 1,
                      'Bultmann': 4,
                      '1815': 3,
                      'counseling': 1,
                      'into': 82,
                      'precepts': 1,
                      'strife': 1,
                      'question': 10,
                      'else': 3,
                      'cult': 3,
                      'local': 10,
                      'justify': 3,
                      'let': 11,
                      'promote': 1,
                      'religions': 6,
                      'libertarians': 1,
                      'devout': 2,
                      'gave': 11,
                      'becoming': 2,
                      'May': 1,
                      'Vineyard': 1,
                      'facing': 1,
                      'necessity': 3,
                      'seeks': 1,
                      'grown': 2,
                      'dare': 3,
                      'separating': 1,
                      '12': 4,
                      'render': 1,
                      'Jewish': 1,
                      'majesty': 1,
                      'octillion': 3,
                      'relationship': 3,
                      'anchorite': 1,
                      'existed': 4,
                      'because': 39,
                      'doubting': 1,
                      'chaplains': 1,
                      '2': 20,
                      'decides': 1,
                      'persisted': 2,
                      'hours': 7,
                      'hunger': 2,
                      'preconditioned': 2,
                      'variety': 2,
                      'Puritan': 2,
                      '220': 1,
                      'Fortunately': 2,
                      'technological': 1,
                      ...}),
             'reviews': Counter({'Without': 1,
                      'festival': 5,
                      'contented': 2,
                      'Wallenstein': 2,
                      'deft': 1,
                      'More': 4,
                      '23': 11,
                      'fiesta': 1,
                      'Anthony': 3,
                      'located': 1,
                      'plush': 1,
                      'infectious': 1,
                      'future-day': 1,
                      "Hendricks'": 1,
                      'education': 1,
                      'house': 6,
                      'Hewett': 1,
                      'barrage': 1,
                      "choir's": 1,
                      'outmaneuvered': 1,
                      'waters': 3,
                      'bespeak': 1,
                      'Between': 1,
                      'restrain': 1,
                      'operating': 1,
                      'use': 8,
                      'Lower': 2,
                      'killed': 4,
                      'vitally': 2,
                      'recruits': 1,
                      'endure': 1,
                      'sly': 1,
                      'bars': 1,
                      'sports': 2,
                      "Charley's": 1,
                      'Viscount': 2,
                      'thoroughly': 2,
                      'February': 1,
                      'Imaginary': 1,
                      'sextet': 2,
                      'well-received': 1,
                      'abrupt': 1,
                      'certain': 5,
                      'season': 24,
                      'Nazi': 2,
                      'K': 1,
                      'shade': 2,
                      'others': 14,
                      'Nikolai': 2,
                      'transition': 1,
                      'Laboratory': 1,
                      'catch': 2,
                      'beautiful': 11,
                      'averaged': 1,
                      'Lewisohn': 4,
                      'denying': 1,
                      'Toobin': 2,
                      'cadenza': 2,
                      'Kornevey': 1,
                      'company': 15,
                      'Stevens': 1,
                      'cliches': 2,
                      'discovery': 3,
                      'Vicenza': 2,
                      'materially': 1,
                      'Shrewd': 1,
                      'Pontchartrain': 1,
                      'correct': 1,
                      "Hampton's": 1,
                      'lumbering': 1,
                      'kitchen': 1,
                      'Gilels': 2,
                      'amount': 5,
                      'Mazowsze': 1,
                      'few': 15,
                      'omitted': 1,
                      'Renaissance': 1,
                      'pressure': 1,
                      'scientists': 4,
                      'gorgeous': 1,
                      "wouldn't": 1,
                      'Grigory': 1,
                      'through': 26,
                      'Gospel-singer': 1,
                      'polonaise': 1,
                      'Just': 2,
                      'specific': 4,
                      'each': 22,
                      'Because': 2,
                      'Boulez': 1,
                      '29-Oct.': 1,
                      'Xydis': 5,
                      'Lanza': 2,
                      'tons': 1,
                      'timid': 1,
                      'stakes': 1,
                      'decaying': 1,
                      'forget': 2,
                      'doltish': 1,
                      'ceased': 1,
                      "yesterday's": 1,
                      'Democrats': 2,
                      'cottage': 1,
                      'attractively': 1,
                      'shy': 1,
                      'raised': 1,
                      'meanings': 1,
                      'ex-prize': 1,
                      'revealed': 3,
                      'Ernst': 1,
                      'grave': 1,
                      'distaste': 1,
                      '19,000,000': 1,
                      'sweet-sounding': 1,
                      'repressive': 1,
                      'situations': 2,
                      "Shakespeare's": 2,
                      'At': 12,
                      ')': 101,
                      'mastery': 1,
                      'colleagues': 1,
                      'novelized': 1,
                      'Those': 2,
                      'Raine': 1,
                      'Nelson': 1,
                      'career': 1,
                      'Splendid': 1,
                      'friends': 2,
                      'twists': 2,
                      'Tannhaeuser': 2,
                      'subtly': 1,
                      'shouting': 1,
                      'small': 10,
                      'bull': 1,
                      'needle-sharp': 1,
                      'jam': 1,
                      'Skolovsky': 4,
                      'enemy': 3,
                      'competition': 1,
                      'experience': 6,
                      'revolutions': 1,
                      'also': 31,
                      'recently': 10,
                      'earthy': 2,
                      'tournament': 2,
                      'distinct': 2,
                      'diehards': 1,
                      'next': 10,
                      'accelerating': 1,
                      'armchair': 1,
                      'Poindexter': 1,
                      'four': 6,
                      'Serafin': 1,
                      'e.g.': 1,
                      'really': 19,
                      'peoples': 1,
                      'broad': 6,
                      'Foliage': 2,
                      'understanding': 2,
                      'drinker': 2,
                      'messy': 1,
                      'striving': 1,
                      'rushed': 1,
                      'borrowed': 1,
                      'Getz': 1,
                      'signals': 1,
                      'effect': 6,
                      'bookers': 1,
                      'Girl': 3,
                      'NBC': 3,
                      'Cologne': 1,
                      'Kingwood': 1,
                      "brain's": 1,
                      'encores': 1,
                      'on-stage': 1,
                      'population': 6,
                      'King': 3,
                      'artists': 4,
                      'physiological': 2,
                      'confronting': 1,
                      'evening': 18,
                      'Replied': 1,
                      'Joseph': 4,
                      'keen': 1,
                      'eyewitness': 1,
                      'tragi-comic': 1,
                      'retrieved': 1,
                      'regretted': 1,
                      'tempted': 1,
                      'formidably': 1,
                      'ask': 5,
                      'pleasures': 1,
                      'found': 5,
                      'Bastianini': 1,
                      'Yurochka': 1,
                      'guitarist': 1,
                      "one's": 5,
                      'domes': 1,
                      'pretty': 6,
                      "afternoon's": 1,
                      'barges': 1,
                      'mother': 3,
                      'Garry': 1,
                      'things': 10,
                      'Now': 7,
                      'thank': 1,
                      'department': 2,
                      'squealed': 1,
                      'divided': 1,
                      'suppose': 2,
                      'Hall': 14,
                      'transported': 1,
                      '18': 2,
                      'drawn': 2,
                      'spur': 1,
                      'stops': 1,
                      'monologist': 1,
                      'bat': 1,
                      'stags': 1,
                      'German': 4,
                      'obliterate': 1,
                      "Leningrad's": 1,
                      'compulsive': 1,
                      'chanted': 1,
                      'vehicles': 2,
                      'caloric': 3,
                      '1959': 2,
                      'musicians': 5,
                      "Cole's": 1,
                      'shoot': 3,
                      'prints': 2,
                      'marched': 1,
                      'Hutton': 1,
                      'flights': 1,
                      'Overture': 4,
                      'Resourceful': 1,
                      'outset': 1,
                      'Gomez': 1,
                      'cycles': 1,
                      'required': 3,
                      'Arrowhead': 1,
                      'candid': 1,
                      'Gaieties': 1,
                      'Taruffi': 1,
                      'throng': 1,
                      '1961': 1,
                      'strip': 1,
                      'Beach': 1,
                      'run': 1,
                      'discussed': 2,
                      'stores': 2,
                      'Projects': 2,
                      'semi-abstractions': 1,
                      'fronting': 1,
                      'Sonates': 1,
                      'dimension': 2,
                      'uncomfortable': 2,
                      'themselves': 9,
                      'roughness': 1,
                      'Yardumian': 1,
                      'academic': 1,
                      'experiences': 1,
                      'Pinkie': 1,
                      'Dec.': 7,
                      'speaks': 3,
                      'anyone': 8,
                      'reader': 9,
                      'pulled': 2,
                      'continuous': 1,
                      'cosmology': 1,
                      'Roxy': 1,
                      'Tenda': 1,
                      'mercy': 1,
                      'sentences': 1,
                      'domestic': 1,
                      'visited': 1,
                      "L'orchestre": 1,
                      'meticulously': 2,
                      'executing': 1,
                      'teens': 1,
                      'myriad': 1,
                      'encroaching': 1,
                      'arty': 1,
                      'delivery': 1,
                      'seacoast': 1,
                      'trees': 2,
                      "mother's": 1,
                      "Crosby's": 1,
                      'Oscar': 1,
                      'Japanese': 4,
                      'exchange': 1,
                      'Frederick': 1,
                      'due': 3,
                      'Sheldon': 1,
                      '1876': 1,
                      'devastating': 1,
                      'unabashed': 1,
                      'plea': 1,
                      'aural': 1,
                      'overcomes': 1,
                      'problem': 7,
                      'Summer': 1,
                      'winter': 1,
                      'distance': 1,
                      'goodies': 1,
                      'Nervousness': 1,
                      'specialize': 1,
                      'informal': 2,
                      'reflect': 2,
                      'looking': 3,
                      'Journal': 3,
                      'Belgium': 1,
                      'tasting': 1,
                      'British': 15,
                      'Bradley': 1,
                      'moderately': 1,
                      'clever': 5,
                      'humor': 9,
                      'periods': 1,
                      "John's": 1,
                      'Smiling': 1,
                      'sentimental': 4,
                      'Big': 2,
                      'descending': 1,
                      "Cibula's": 1,
                      'determination': 1,
                      'slowly': 4,
                      'Korneyev': 1,
                      'place': 19,
                      'piece': 7,
                      'Parks': 2,
                      'atmosphere': 5,
                      'blessed': 2,
                      'melting': 1,
                      'urgent': 1,
                      'inspiring': 2,
                      'race-driver': 3,
                      'folk-dance': 1,
                      'Eh': 1,
                      '110': 1,
                      'instrument': 1,
                      'think': 10,
                      'could': 40,
                      'dissimilar': 1,
                      'Lawrence': 1,
                      'conform': 1,
                      'teenagers': 1,
                      'audience': 32,
                      'rich': 6,
                      'Carnegie': 5,
                      'Wilson': 2,
                      'spaciousness': 1,
                      'elaborate': 2,
                      'planting': 1,
                      'Along': 1,
                      'pride': 3,
                      'Chorale': 1,
                      'Franklin': 1,
                      'Brooding': 1,
                      'compulsion': 1,
                      'Tallchief': 1,
                      'ran': 1,
                      'tiny': 1,
                      'Sonoma': 1,
                      'picture': 13,
                      'prevents': 1,
                      'sympathetic': 2,
                      'hypothalamus': 1,
                      'gaunt': 1,
                      'fan': 4,
                      'between': 20,
                      '1643': 2,
                      'host': 1,
                      'Hirsch': 2,
                      'intimate': 4,
                      'nailed': 1,
                      'gilded': 1,
                      'recognize': 1,
                      'emerge': 1,
                      'marked': 1,
                      'Toch': 1,
                      'unstilted': 1,
                      'higher': 3,
                      'Dutch': 1,
                      'Stephen': 1,
                      'foliage': 1,
                      "fan's": 1,
                      'firmly': 2,
                      'Tenderly': 1,
                      'oppression': 1,
                      'told': 7,
                      'composers': 5,
                      'velvet': 1,
                      'distinctly': 1,
                      'chicken': 1,
                      'thenceforth': 1,
                      'The': 322,
                      'collaborator': 1,
                      'benefit': 1,
                      'coincidence': 1,
                      'Hodges': 1,
                      'of': 1299,
                      'shores': 1,
                      'celebrated': 2,
                      '1920s': 1,
                      'Toast': 1,
                      'police': 1,
                      'visiting': 1,
                      "Angeles'": 1,
                      'narrative': 2,
                      'perky': 1,
                      'yearned': 1,
                      'corner': 1,
                      'bold': 3,
                      'Skating': 1,
                      'lived': 4,
                      'Sokolov': 2,
                      'wars': 3,
                      'Information': 1,
                      'Adultery': 1,
                      'Gibson': 3,
                      'Alfredo': 1,
                      'facts': 6,
                      'blockages': 1,
                      'assembled': 4,
                      'plump': 1,
                      'disastrous': 1,
                      'announcing': 2,
                      'taking': 5,
                      'recovery': 2,
                      'display': 5,
                      'preacher-singer': 1,
                      'eccentrics': 1,
                      'scale': 1,
                      'funny': 6,
                      "America's": 2,
                      'outside': 6,
                      'relation': 1,
                      'book': 26,
                      'sprue': 1,
                      'Eire': 1,
                      'unrest': 1,
                      'criticism': 3,
                      'Notre': 1,
                      'glances': 1,
                      'jacket': 2,
                      'blame': 3,
                      'conventions': 1,
                      'old-time': 1,
                      'presents': 7,
                      'purpose': 4,
                      'give': 14,
                      "Puttin'": 1,
                      'misfortune': 2,
                      'nos.': 1,
                      'horrors': 1,
                      'Terrible': 1,
                      'tape': 2,
                      'beauty': 5,
                      'flatly': 1,
                      'formalities': 1,
                      'slavish': 1,
                      'blond': 1,
                      'Richard': 6,
                      'Milstein': 4,
                      'diverse': 1,
                      'allow': 2,
                      'critical': 3,
                      'library': 1,
                      'cancer': 3,
                      'missing': 1,
                      'actually': 2,
                      'Several': 4,
                      'explanation': 2,
                      'connected': 2,
                      'juice': 1,
                      'Billy': 2,
                      'Holmes': 1,
                      'Roll': 1,
                      'bang-sashes': 1,
                      'reached': 4,
                      'anyway': 2,
                      'programed': 1,
                      'response': 2,
                      'authority': 2,
                      'Point': 1,
                      'Perhaps': 5,
                      'grace': 1,
                      'missile': 1,
                      'gallant': 1,
                      'big': 23,
                      'Zooey': 1,
                      'quantities': 1,
                      'arresting': 2,
                      'E': 1,
                      'Yea': 1,
                      'nourished': 1,
                      'attempted': 3,
                      'Mario': 1,
                      'C': 3,
                      'alike': 1,
                      'cold': 2,
                      'opened': 11,
                      'Elliott': 1,
                      'feared': 1,
                      'nurses': 1,
                      'Saturated': 1,
                      'fate': 2,
                      'clip': 1,
                      'storyline': 2,
                      'unfortunately': 3,
                      'deck': 2,
                      'lend': 3,
                      'mealie-meal': 1,
                      'street': 1,
                      'bow': 1,
                      'pessimistic': 1,
                      'Vadim': 1,
                      'current': 5,
                      'Rhythms': 1,
                      'brand-new': 1,
                      'bonus': 1,
                      'ludicrous': 1,
                      'cares': 2,
                      'glumly': 1,
                      'trumpeter': 1,
                      'metaphorical': 1,
                      'town': 1,
                      'fulfilled': 1,
                      'situated': 2,
                      'now': 20,
                      'solos': 1,
                      "Larkin's": 1,
                      'dramatic': 9,
                      'Elaine': 1,
                      'suddenly': 3,
                      'De': 12,
                      'gets': 6,
                      'Kerr': 1,
                      'lay': 4,
                      'Carson': 1,
                      'declamatory': 1,
                      'three': 13,
                      'demand': 2,
                      'stock': 1,
                      'words': 8,
                      'planks': 1,
                      'San': 6,
                      'romancing': 1,
                      'virtue': 2,
                      'pure': 1,
                      'Anna': 2,
                      'shed': 1,
                      'nervous': 1,
                      'College': 7,
                      'intrigues': 1,
                      'detective': 1,
                      'continuity': 2,
                      'forceful': 1,
                      'car': 2,
                      'Treasure': 1,
                      'rebut': 1,
                      'Quintet': 1,
                      'A.': 4,
                      'finely': 1,
                      'Triptych': 1,
                      'musicality': 1,
                      'present': 12,
                      'describes': 1,
                      'throws': 1,
                      'science': 3,
                      'influential': 1,
                      'interdependence': 1,
                      'mid-October': 2,
                      'be': 153,
                      'racing': 7,
                      'names': 4,
                      'frolic': 1,
                      'vegetable': 4,
                      'Bix': 1,
                      'Ten': 1,
                      'abundance': 1,
                      'France': 5,
                      'sincerity': 4,
                      'posing': 1,
                      'heap': 1,
                      'resolute': 1,
                      'conclusion': 2,
                      'reasons': 5,
                      'auto': 1,
                      'invent': 1,
                      'minutes': 5,
                      'Agriculture': 1,
                      'remote': 1,
                      'touching': 1,
                      'sweepings': 1,
                      'joined': 1,
                      "He's": 1,
                      'gallantry': 2,
                      'Hearts': 2,
                      'travelled': 1,
                      'docile': 1,
                      'Cesare': 1,
                      'afternoon': 6,
                      'Alvise': 2,
                      'Dream': 1,
                      'Forgotten': 1,
                      'status': 2,
                      'entirety': 1,
                      'commercial': 1,
                      'tarantara': 1,
                      'central': 3,
                      'abbreviated': 1,
                      'soloist': 6,
                      'dignity': 3,
                      'suitably': 1,
                      'force': 2,
                      'replaced': 1,
                      'sneaky': 1,
                      'traditional': 1,
                      'ballerina': 1,
                      'eastern': 2,
                      'RCA': 2,
                      'wildly': 1,
                      'Aspencades': 2,
                      'vistas': 2,
                      'encumbered': 1,
                      'lighting': 2,
                      "script's": 1,
                      'alone': 7,
                      'bracing': 1,
                      'cent': 8,
                      'breathe': 1,
                      'filagree': 1,
                      'into': 42,
                      'principally': 1,
                      'long-sought': 1,
                      'Psyche': 1,
                      'Mijbil': 3,
                      'Tintoretto': 1,
                      'affiliated': 1,
                      'quicksilver': 1,
                      'else': 10,
                      'let': 9,
                      'boatsmen': 1,
                      'Fifty-fifth': 2,
                      'genuinely': 2,
                      'becoming': 7,
                      'May': 2,
                      'promising': 1,
                      'necessity': 1,
                      'managed': 3,
                      'autobiography': 1,
                      'narrator': 1,
                      'Jewish': 1,
                      'method': 1,
                      'Mimieux': 1,
                      'murder': 1,
                      'shakes': 2,
                      'grown-up': 1,
                      'election': 2,
                      'sweeping': 1,
                      'landscapes': 2,
                      '2': 9,
                      'violin': 4,
                      'insiders': 1,
                      'ramps': 1,
                      'self-deceptions': 1,
                      'directors': 1,
                      'matchmaking': 1,
                      'revolved': 1,
                      "Russia's": 2,
                      'assimilation': 1,
                      'temperature': 1,
                      'embodiment': 1,
                      'curative': 1,
                      'entourage': 2,
                      'Iraq': 1,
                      'electric': 1,
                      'Instrumental': 1,
                      'hypothesis': 1,
                      'Bud': 1,
                      'Danish': 1,
                      'Garrick': 1,
                      'corners': 1,
                      'nightclub': 1,
                      'Countrymen': 1,
                      'Brass': 1,
                      'textures': 1,
                      'Otherwise': 1,
                      'S.': 3,
                      'Cervantes': 1,
                      'Adagio': 1,
                      'parades': 1,
                      'remarks': 3,
                      'Pavlovsky': 1,
                      'overly': 2,
                      'preoccupied': 1,
                      'commands': 2,
                      '1915': 1,
                      'standards': 1,
                      'clothes': 4,
                      'lukewarm': 1,
                      'qualities': 6,
                      'dispose': 1,
                      'converse': 1,
                      'Alger': 1,
                      'dignified': 1,
                      'monumental': 1,
                      'Plebian': 1,
                      'released': 1,
                      'ends': 1,
                      'ninth': 1,
                      'chronology': 1,
                      'letter': 4,
                      'treacherous': 1,
                      'Drifts': 1,
                      'plan': 2,
                      'infected': 1,
                      'parked': 2,
                      'startlingly': 1,
                      'substance': 4,
                      "rock'n'roll": 1,
                      'from': 138,
                      '500-mile': 1,
                      'Hot': 1,
                      'shredding': 1,
                      'appearance': 2,
                      'ball': 1,
                      'serfs': 1,
                      'reality': 1,
                      'hooting': 1,
                      'quintet': 1,
                      'studied': 3,
                      'twenty-five': 1,
                      'deeds': 1,
                      'years': 33,
                      'Nutcracker': 3,
                      'East': 5,
                      'probably': 8,
                      'frequently': 8,
                      'eagerness': 1,
                      'During': 3,
                      'corniest': 1,
                      'Cold': 1,
                      'Chorus': 1,
                      'wry': 1,
                      'Obviously': 1,
                      'trio': 1,
                      'what': 44,
                      'or': 97,
                      'states': 4,
                      'fifteen': 2,
                      'destruction': 2,
                      'Wind': 1,
                      '300th': 1,
                      'city': 7,
                      'attributes': 1,
                      'sex': 1,
                      "writers'": 1,
                      'related': 3,
                      'active': 1,
                      'Shelley': 1,
                      'easier': 1,
                      'Phil': 1,
                      'unbidden': 1,
                      'allowing': 2,
                      'general': 9,
                      'puerile': 1,
                      'presentations': 2,
                      'invite': 1,
                      'diplomat': 1,
                      'Jouvet': 4,
                      'primary': 3,
                      'national': 3,
                      'tomorrow': 2,
                      'formulae': 1,
                      'top-drawer': 1,
                      'Allied': 10,
                      'expanding': 2,
                      'notes': 5,
                      'skillful': 1,
                      'Naomi': 2,
                      'Bulba': 2,
                      'collections': 1,
                      'called': 20,
                      'opinions': 2,
                      'urges': 1,
                      'blues': 3,
                      'fictional': 1,
                      'Futhermore': 1,
                      'year': 17,
                      'reverse': 2,
                      'Rhenish': 1,
                      'Straits': 1,
                      'Coletta': 1,
                      'courage': 3,
                      'painted': 2,
                      'city-bred': 1,
                      'Radio': 2,
                      'G.': 2,
                      'stager': 1,
                      'deals': 1,
                      'scraps': 1,
                      'conspicuously': 1,
                      'ransack': 1,
                      'organized': 2,
                      'wonder': 6,
                      'partial': 1,
                      'objective': 1,
                      'coarsened': 1,
                      'three-week': 1,
                      'born': 2,
                      'sharply': 2,
                      'non-sentimental': 1,
                      'chance': 7,
                      'tarpon': 1,
                      "Shearing's": 1,
                      'spirits': 1,
                      'high-minded': 1,
                      'introspection': 1,
                      "composer's": 1,
                      'salty': 1,
                      'gaps': 1,
                      'compositions': 2,
                      'indomitable': 1,
                      'not': 143,
                      'Kennett': 1,
                      'sons': 1,
                      'unsympathetic': 1,
                      'headquarters': 2,
                      'chuckles': 1,
                      'epigrams': 1,
                      'imponderable': 1,
                      '19': 1,
                      'savory': 1,
                      'growth': 2,
                      'college': 2,
                      'legato': 1,
                      'Shearing': 3,
                      'placed': 3,
                      'enforced': 1,
                      'noble': 2,
                      'illustrated': 1,
                      'set': 16,
                      'Melodious': 1,
                      'hormones': 1,
                      'Married': 1,
                      'rationalist': 1,
                      'arrangement': 4,
                      'manages': 1,
                      'laced': 1,
                      'essential': 1,
                      'five': 6,
                      'breathtaking': 1,
                      'common': 7,
                      'less': 16,
                      'Mondays': 1,
                      'ballets': 1,
                      'thrusting': 1,
                      'trailers': 1,
                      'participate': 1,
                      'jocular': 1,
                      "Creston's": 1,
                      'application': 1,
                      'temperament': 1,
                      'chronologically': 1,
                      'calisthenics': 1,
                      'ceremonies': 1,
                      'Ancel': 1,
                      'dark': 2,
                      'Mi': 1,
                      'shall': 1,
                      '1,488': 1,
                      'word': 16,
                      'innocence': 1,
                      'magnificent': 5,
                      'showcase': 2,
                      'thinks': 3,
                      'benefactor': 1,
                      'shells': 1,
                      '211': 1,
                      'Sharing': 1,
                      'Ferguson': 1,
                      'Galina': 1,
                      'gardens': 1,
                      'sloppy': 1,
                      'Katherine': 1,
                      'Walter': 2,
                      'dam': 1,
                      'Anton': 2,
                      'bronze': 1,
                      'greenhouses': 1,
                      'Chicago-style': 1,
                      'business': 7,
                      'strength': 2,
                      'Toccata': 1,
                      'claims': 1,
                      'golfer': 1,
                      'Van': 2,
                      'conductor': 6,
                      'airdrops': 1,
                      'propaganda': 1,
                      '5-mile': 1,
                      'conclusions': 3,
                      'operatic': 3,
                      'gems': 2,
                      'publication': 3,
                      'multimillionaire': 1,
                      'troubles': 1,
                      'largely': 3,
                      'unusual': 1,
                      'Young': 1,
                      'vigilant': 1,
                      'Brother': 1,
                      'flow': 5,
                      'white': 8,
                      '1941': 1,
                      'consistency': 1,
                      'Gesangverein': 1,
                      'Brahms': 5,
                      'stares': 1,
                      'task': 3,
                      'Swedish': 1,
                      'stumbled': 1,
                      'pro-Communist': 1,
                      'warmed-over': 1,
                      'Dvorak': 1,
                      'sort': 6,
                      'temptation': 1,
                      'loudest': 1,
                      'couples': 1,
                      'LeClair': 1,
                      "Jenni's": 1,
                      'partially': 1,
                      'Preludes': 1,
                      'touring': 1,
                      'categorize': 1,
                      'artificial': 2,
                      'comedians': 1,
                      'enacting': 1,
                      'depending': 1,
                      'depression': 1,
                      'kind': 13,
                      'prolonged': 1,
                      'questionable': 1,
                      'distant': 2,
                      'bear': 3,
                      'wicked': 1,
                      'animals': 1,
                      'super-charged': 1,
                      'Picon': 1,
                      'exact': 1,
                      'experimenters': 1,
                      'recalled': 1,
                      'speedboat': 1,
                      'casually': 2,
                      'pleasantly': 1,
                      'reward': 1,
                      'visitors': 1,
                      'sneering': 1,
                      'rapists': 1,
                      'accurately': 1,
                      'forty-five': 1,
                      'marinas': 2,
                      'Malcolm': 1,
                      'mettlesome': 1,
                      'Dunn': 1,
                      'countryside': 1,
                      'chronicle': 3,
                      'ear': 3,
                      'tea': 2,
                      'dramatically': 1,
                      'best': 20,
                      'Rider': 1,
                      'James': 11,
                      'Hart': 9,
                      'lower': 1,
                      'ingredient': 1,
                      "Leonato's": 1,
                      'though': 18,
                      'buckwheat': 1,
                      'successful': 5,
                      'shift': 1,
                      'facetiously': 1,
                      'defeat': 1,
                      'motif': 1,
                      'nurse': 1,
                      'princess': 1,
                      "aren't": 1,
                      'spot': 3,
                      'crimson': 1,
                      'cells': 2,
                      'Department': 4,
                      'twilight': 1,
                      'extremes': 1,
                      'birches': 1,
                      '2454': 1,
                      'gobbledygook': 1,
                      'dogmatism': 1,
                      ...}),
             'romance': Counter({'Without': 2,
                      'vacuum': 2,
                      'festival': 1,
                      'contented': 2,
                      'laden': 1,
                      'More': 4,
                      'oystchers': 1,
                      'anyhow': 4,
                      'speak': 11,
                      'plain-out': 1,
                      'house': 36,
                      'promises': 1,
                      'skilled': 1,
                      'muscles': 1,
                      'shipwrecked': 1,
                      'overdeveloped': 1,
                      'darling': 4,
                      'building': 8,
                      'Adams': 1,
                      'waters': 1,
                      'dig': 1,
                      'threshold': 1,
                      'use': 14,
                      'than': 65,
                      'killed': 4,
                      'experience': 4,
                      'served': 1,
                      'jubilantly': 1,
                      'sports': 3,
                      'tempted': 2,
                      'thoroughly': 2,
                      'February': 1,
                      'Spike-haired': 1,
                      'pleading': 1,
                      'forum': 2,
                      'coursing': 1,
                      'certain': 13,
                      'season': 1,
                      'Jenny': 7,
                      'inward': 2,
                      'shade': 2,
                      'others': 9,
                      'sunshine': 2,
                      'Wait': 2,
                      'mercy': 1,
                      'bucking': 1,
                      'taut': 3,
                      'catch': 3,
                      'beautiful': 25,
                      'shorter': 1,
                      'marrying': 2,
                      'relay': 1,
                      'Karamazov': 1,
                      'Jeroboam': 1,
                      'denying': 1,
                      "John'll": 1,
                      'brushing': 2,
                      'sorted': 1,
                      'imposition': 1,
                      'nuzzled': 1,
                      'hardships': 1,
                      'inflection': 1,
                      'chic': 2,
                      'greatness': 2,
                      'drab': 1,
                      'necktie': 1,
                      "cousins'": 1,
                      'less-dramatic': 1,
                      'correct': 2,
                      'corporation': 2,
                      'cabins': 1,
                      'glance': 6,
                      'sage': 1,
                      'sommelier': 1,
                      'lumbering': 1,
                      'kitchen': 12,
                      'amount': 1,
                      "She'd": 3,
                      'rabbits': 1,
                      'kibbutzim': 1,
                      'fire-colored': 1,
                      'crack': 2,
                      'omitted': 1,
                      'glances': 2,
                      'Seeing': 2,
                      'way': 83,
                      'intriguingly': 1,
                      'unbelievable': 2,
                      "wouldn't": 33,
                      'clucked': 1,
                      'hurry': 3,
                      'interdependent': 1,
                      'through': 56,
                      'Just': 28,
                      "we'll": 4,
                      'each': 24,
                      'bucking-up': 1,
                      'Ah': 5,
                      'Rossi': 1,
                      'expanse': 1,
                      'tons': 1,
                      'timid': 2,
                      'vegetable': 1,
                      'forget': 5,
                      "Maggie's": 3,
                      'behave': 1,
                      'postscript': 1,
                      'log-house': 1,
                      'cottage': 6,
                      'shy': 1,
                      'raised': 6,
                      'magnificence': 1,
                      'stray': 1,
                      'catchee': 1,
                      'stream': 3,
                      'ate': 4,
                      'grave': 3,
                      'distaste': 2,
                      'charlotte': 1,
                      'sprung': 1,
                      'buggy': 1,
                      'eternal': 3,
                      'blackout': 1,
                      'descend': 1,
                      'fences': 1,
                      'At': 29,
                      'nervously': 3,
                      "I'm": 77,
                      'focused': 1,
                      'vessels': 1,
                      'les': 1,
                      'tipsy': 1,
                      'Those': 5,
                      'surprisingly': 2,
                      'bordering': 1,
                      'drop': 5,
                      'prophets': 1,
                      'career': 2,
                      'idly': 1,
                      'husband': 10,
                      'friends': 13,
                      'Toonker': 1,
                      'bet': 3,
                      'dully': 1,
                      'Deegan': 22,
                      'shouting': 5,
                      'wiped': 1,
                      'sands': 1,
                      'slip': 1,
                      'negative': 1,
                      "patient's": 1,
                      'Afterwards': 1,
                      'small': 31,
                      'Myra': 24,
                      'competition': 1,
                      'Heads': 1,
                      'intrepid': 1,
                      'doubtfully': 1,
                      'snag': 1,
                      'knife': 9,
                      'hostess': 1,
                      'missiles': 1,
                      'recently': 2,
                      'earthy': 1,
                      'vanity': 1,
                      'babes': 1,
                      'snapped': 5,
                      'next': 43,
                      'sly': 1,
                      'strike': 1,
                      'shall': 3,
                      'lapping': 2,
                      'searching': 3,
                      'really': 38,
                      'broad': 5,
                      'tireless': 2,
                      'disdain': 1,
                      'Please': 5,
                      'comforting': 1,
                      'rushed': 5,
                      'lath': 2,
                      'signals': 1,
                      'effect': 4,
                      'Pete': 11,
                      'Anything': 2,
                      'giggled': 1,
                      'King': 2,
                      'artists': 2,
                      'grasses': 1,
                      'masculine': 1,
                      'Remembering': 1,
                      'evening': 14,
                      'prisoners': 5,
                      'dripped': 1,
                      'twelve': 5,
                      'telegraph': 1,
                      'keen': 1,
                      'relax': 1,
                      'Lovejoy': 3,
                      'terribly': 4,
                      'Tolley': 6,
                      'regretted': 2,
                      'defiance': 1,
                      'assets': 2,
                      'shooing': 1,
                      'ask': 10,
                      'invested': 1,
                      'found': 29,
                      'floorshow': 1,
                      "one's": 4,
                      'pretty': 19,
                      'Springfield': 1,
                      'scholarships': 1,
                      'mother': 44,
                      'Galt': 1,
                      'things': 30,
                      'Now': 29,
                      'job': 27,
                      'thank': 3,
                      'parkish': 1,
                      'department': 1,
                      'independent': 1,
                      'lewd': 1,
                      'permission': 2,
                      'divided': 3,
                      'suppose': 7,
                      'Hall': 2,
                      'Patrol': 2,
                      'sir': 6,
                      'bat': 4,
                      'harshened': 1,
                      'hatred': 4,
                      'German': 1,
                      'neighbor': 1,
                      'scream': 2,
                      'signal': 4,
                      'vehicles': 1,
                      'exquisitely': 1,
                      'grabbed': 1,
                      'medium-sized': 1,
                      'gushed': 1,
                      'yielded': 1,
                      'commencement': 1,
                      'shadow': 3,
                      'teats': 2,
                      'fabric': 1,
                      'chuck-a-luck': 1,
                      "wasn't": 50,
                      'ordinarily': 1,
                      'bore': 3,
                      'dying': 5,
                      'Lord': 6,
                      'strip': 2,
                      'curiously': 2,
                      'hunting': 3,
                      'Coopers': 1,
                      'run': 16,
                      'discussed': 2,
                      'hundreds': 1,
                      'unconcerned': 1,
                      'grandchildren': 2,
                      'uncomfortable': 1,
                      'cacophony': 1,
                      'themselves': 10,
                      'guest': 6,
                      'Collector': 1,
                      'licking': 1,
                      'forepaws': 1,
                      'wicker': 1,
                      'positive': 4,
                      'Santa': 1,
                      "Gladdy's": 2,
                      'Stern-faced': 1,
                      'elegant': 2,
                      'defeatism': 1,
                      'aviator': 1,
                      'pulled': 8,
                      'sprinkling': 2,
                      'framed': 1,
                      'novelty': 1,
                      'visited': 3,
                      'loaded': 1,
                      'Also': 1,
                      'radio': 1,
                      'coat': 6,
                      'delivery': 1,
                      'trees': 7,
                      "mother's": 4,
                      'Oscar': 1,
                      'Japanese': 14,
                      'flatter': 1,
                      'due': 4,
                      'seeming': 1,
                      'plea': 1,
                      'warmed': 2,
                      'furniture': 2,
                      'sixty': 2,
                      'marry': 7,
                      'problem': 2,
                      'pulse': 2,
                      'winter': 5,
                      'distance': 7,
                      'mutineer': 1,
                      'walking': 12,
                      'percentage': 1,
                      'receive': 1,
                      'quok': 1,
                      'insurmountable': 1,
                      'midweek': 1,
                      'reflect': 1,
                      'looking': 36,
                      'azalea': 2,
                      'crook': 1,
                      'Watch': 1,
                      'Time': 1,
                      'humor': 5,
                      'periods': 1,
                      'baggy': 2,
                      'Pendleton': 3,
                      "We're": 4,
                      'sentimental': 2,
                      'Big': 3,
                      'bottle': 8,
                      'quench': 1,
                      'pointed': 7,
                      'whiskered': 1,
                      'determination': 1,
                      'slowly': 22,
                      'unlocked': 2,
                      'place': 45,
                      'piece': 2,
                      'rode': 4,
                      'laments': 2,
                      'amusement': 1,
                      'tasted': 1,
                      'underestimate': 1,
                      'Straightened': 1,
                      'undying': 1,
                      'atmosphere': 2,
                      'blessed': 1,
                      'melting': 1,
                      'urgent': 1,
                      'Ugh': 1,
                      'Eh': 2,
                      'instrument': 1,
                      'could': 193,
                      'Especially': 1,
                      'enjoying': 2,
                      'winking': 2,
                      'conform': 1,
                      'audience': 1,
                      'rich': 6,
                      'stopped': 25,
                      'Wilson': 3,
                      'dilapidated': 1,
                      'elaborate': 2,
                      'creating': 1,
                      'pride': 2,
                      'sequence': 1,
                      'okay': 1,
                      'tapping': 1,
                      'darkened': 2,
                      'increasingly': 1,
                      'brute': 1,
                      'snippy': 1,
                      'ran': 13,
                      'wagons': 1,
                      'optimism': 1,
                      'Cars': 1,
                      'meant': 11,
                      'tiny': 12,
                      'picture': 8,
                      'guarding': 1,
                      'A.M.': 1,
                      'sympathetic': 2,
                      'unemployment': 1,
                      'stairway': 1,
                      'buildings': 4,
                      'between': 22,
                      'cost': 3,
                      'glorying': 1,
                      'intimate': 3,
                      'noncommittally': 1,
                      'eighth': 1,
                      'gilded': 1,
                      'emerge': 1,
                      'daydreaming': 1,
                      'higher': 3,
                      'Ready': 1,
                      'Ronald': 3,
                      'forever-Cathy': 1,
                      "Pete's": 1,
                      'sewed': 1,
                      'nod': 3,
                      'firmly': 2,
                      'backed': 1,
                      'fruit': 1,
                      'yanking': 1,
                      'dismissed': 1,
                      'nowhere': 2,
                      'told': 47,
                      'paot': 3,
                      'discovery': 1,
                      'chants': 1,
                      'The': 230,
                      'heat': 4,
                      'downstairs': 4,
                      'dilatation': 1,
                      'messing': 1,
                      'Hodges': 2,
                      'good-living': 1,
                      'Boy': 1,
                      'of': 1186,
                      'bicarbonate': 1,
                      'law': 1,
                      'transcending': 1,
                      'stained': 1,
                      'squirt': 1,
                      'dainty': 1,
                      'virility': 1,
                      'rear': 1,
                      'Homes': 1,
                      'unnnt': 1,
                      'yearned': 1,
                      'corner': 9,
                      'bold': 1,
                      'cups': 2,
                      'industrialist': 1,
                      'lived': 19,
                      'wars': 1,
                      'Larkspur': 2,
                      'spoiled': 2,
                      'mingled': 1,
                      'watching': 20,
                      'maid': 4,
                      'numb': 1,
                      'facts': 2,
                      'Hudson': 1,
                      'unusually': 1,
                      'Rhine-Main': 1,
                      "Riverside's": 1,
                      'assembled': 1,
                      'announcing': 1,
                      'peculiar': 2,
                      'display': 2,
                      'sherbet-colored': 1,
                      'wallpaper': 2,
                      'funny': 7,
                      'factory': 1,
                      'entry': 1,
                      'overhead': 3,
                      'outside': 11,
                      'hung': 3,
                      'arc': 1,
                      'book': 10,
                      'suggested': 6,
                      'begging': 2,
                      'Renaissance': 1,
                      'jacket': 9,
                      'stairways': 1,
                      'Marvelous': 1,
                      'pressure': 2,
                      'bridesmaids': 1,
                      'sullen': 3,
                      'knowingly': 1,
                      'intend': 1,
                      'endlessly': 1,
                      'snuck': 1,
                      'purpose': 1,
                      'give': 28,
                      'loathing': 1,
                      'clamped': 2,
                      'absolute': 1,
                      'misfortune': 1,
                      'dishes': 1,
                      'tape': 2,
                      'beauty': 6,
                      'odor': 1,
                      'flatly': 1,
                      'glove': 3,
                      'tunnel': 1,
                      'blond': 2,
                      'Richard': 17,
                      'waving': 3,
                      'limpid': 1,
                      'sanction': 1,
                      'strenuous': 1,
                      'applying': 1,
                      'allow': 1,
                      'shuttered': 1,
                      'blushed': 1,
                      'library': 3,
                      'cancer': 2,
                      'missing': 6,
                      'actually': 5,
                      'Several': 1,
                      'rocked': 2,
                      'explanation': 1,
                      'replied': 2,
                      'juice': 4,
                      'Malta': 1,
                      'purtiest': 1,
                      'after': 58,
                      'questioning': 2,
                      'ward': 6,
                      'reached': 6,
                      "Brush-off's": 1,
                      'anyway': 10,
                      'response': 2,
                      'bargain': 1,
                      'authority': 1,
                      'Perhaps': 9,
                      'grace': 3,
                      'confused': 2,
                      'cherries': 1,
                      'Shall': 1,
                      'strikingly': 1,
                      'big': 36,
                      'overhand': 1,
                      'quantities': 1,
                      'inspect': 1,
                      'femme': 1,
                      'nine-to-five': 1,
                      'completely': 1,
                      'wooden': 3,
                      'lore': 1,
                      'freak': 2,
                      'opened': 13,
                      'Elliott': 1,
                      'soldierly': 1,
                      'Papanicolaou': 1,
                      'twenty-three': 1,
                      'Holy': 2,
                      'Heavenly': 1,
                      'satin': 1,
                      'stood': 37,
                      'worst': 3,
                      'Dolan': 2,
                      'deck': 4,
                      'lipstick': 2,
                      'drinking': 4,
                      'rages': 1,
                      'street': 14,
                      'glass': 7,
                      'bow': 1,
                      'cops': 2,
                      'priest': 1,
                      'Longue': 3,
                      'Leaning': 1,
                      'wears': 1,
                      'mist-like': 1,
                      'accuse': 1,
                      'town': 15,
                      'now': 83,
                      'solos': 1,
                      'azaleas': 2,
                      'radiant': 2,
                      'upsets': 1,
                      'suddenly': 22,
                      'colder': 1,
                      'gets': 4,
                      'Quint': 11,
                      'weatherproof': 1,
                      'lay': 11,
                      'loss': 3,
                      "Deegan's": 2,
                      'Jobs': 1,
                      'three': 17,
                      'shut': 9,
                      'blazing': 1,
                      'Because': 3,
                      'stock': 7,
                      'Spring': 4,
                      'downed': 1,
                      'San': 5,
                      'saddles': 1,
                      'virtue': 1,
                      'pat': 1,
                      'pure': 4,
                      'organize': 1,
                      'shed': 1,
                      'patiently': 2,
                      'lips': 11,
                      'blossom': 1,
                      'mama': 1,
                      'joy': 4,
                      'beast': 1,
                      "Wouldn't": 1,
                      'dark-gray': 1,
                      'car': 20,
                      'godless': 1,
                      'shine': 1,
                      'discovered': 5,
                      'crickets': 1,
                      'salad': 1,
                      'present': 9,
                      'sonofabitch': 2,
                      'science': 6,
                      'unnaturally': 1,
                      'be': 289,
                      'alternative': 1,
                      'racing': 3,
                      'names': 1,
                      'ready': 11,
                      'Ten': 1,
                      'witches': 1,
                      'abundance': 1,
                      'France': 1,
                      'engaged': 1,
                      'dark-blue': 1,
                      'helluva': 1,
                      'conclusion': 1,
                      'ceased': 1,
                      'janitor': 1,
                      'minutes': 9,
                      'unbent': 1,
                      'remote': 3,
                      'touching': 5,
                      'joined': 4,
                      "He's": 14,
                      'nose': 7,
                      'sharpened': 1,
                      'covering': 1,
                      'afternoon': 12,
                      'prisoner': 2,
                      'teasing': 1,
                      'drained': 2,
                      'thousand-legged': 1,
                      'frivolous': 2,
                      'commercial': 1,
                      'sun-warmed': 1,
                      'harbor': 2,
                      'shivered': 2,
                      'satin-covered': 1,
                      'brandishing': 2,
                      'fail': 2,
                      'boarding': 1,
                      'burial': 2,
                      'mergers': 1,
                      "lovers'": 1,
                      'dignity': 4,
                      'force': 4,
                      'replaced': 2,
                      'Puny': 1,
                      'sneaky': 1,
                      'dawns': 1,
                      'prevented': 1,
                      'popularly': 1,
                      'rackety': 1,
                      'eastern': 1,
                      'dinners': 2,
                      'wildly': 5,
                      'Eating': 1,
                      'Donald': 7,
                      "child's": 3,
                      'revealed': 2,
                      'alone': 24,
                      'pardon': 1,
                      'fetes': 1,
                      'cent': 3,
                      'Half': 2,
                      'beneath': 6,
                      'bullet': 1,
                      'cramp': 1,
                      "You'd": 2,
                      'into': 136,
                      'encouragingly': 1,
                      'snoring': 1,
                      'address': 3,
                      'else': 29,
                      'skipped': 3,
                      'justify': 1,
                      'let': 39,
                      'fonder': 1,
                      'becoming': 3,
                      'promising': 2,
                      'outraged': 1,
                      'artfully': 1,
                      'necessity': 2,
                      'brother': 5,
                      'Somebody': 4,
                      'managed': 4,
                      'gleaming': 1,
                      'Dillinger': 1,
                      'fifty-fifty': 1,
                      'method': 1,
                      'murder': 4,
                      'Swan': 1,
                      'scarf': 2,
                      'grown-up': 1,
                      'Kare': 1,
                      'sweeping': 1,
                      'forgiven': 1,
                      'conceptions': 1,
                      "Alexander's": 2,
                      'gumming': 1,
                      'ironed': 1,
                      'hunger': 2,
                      'husband-stealer': 1,
                      'insiders': 1,
                      "Partlow's": 1,
                      'directors': 1,
                      'instigating': 1,
                      'pupils': 3,
                      'straining': 2,
                      'switch': 1,
                      'desiring': 1,
                      'Subdued': 1,
                      'Gordon': 2,
                      'singsonged': 1,
                      'electric': 1,
                      'jammed': 2,
                      'muttered': 3,
                      'secretaries': 1,
                      'absurd': 3,
                      ')': 21,
                      'forgive': 1,
                      'corners': 1,
                      'around': 68,
                      'understands': 1,
                      'edges': 2,
                      "I'll": 53,
                      'mosaic': 1,
                      'parades': 1,
                      'maw': 1,
                      'monotonous': 1,
                      'overly': 1,
                      'tawny': 1,
                      'naked': 7,
                      'clothes': 26,
                      'merest': 1,
                      'lukewarm': 1,
                      'advancement': 1,
                      'dignified': 1,
                      'thankful': 3,
                      'released': 1,
                      'neatness': 1,
                      'rapping': 2,
                      'attracted': 1,
                      'letter': 19,
                      'hide': 1,
                      "Horne's": 1,
                      'frenziedly': 1,
                      'infected': 1,
                      'parked': 1,
                      'biggest': 1,
                      'from': 202,
                      'honestly': 2,
                      "four-o'clock": 1,
                      'Swallow': 1,
                      'ages': 2,
                      'warning': 2,
                      'appearance': 4,
                      'atheists': 1,
                      'Pollock': 1,
                      'ball': 18,
                      'paunch': 1,
                      'Mother': 5,
                      'half-murmured': 1,
                      'Fudo': 7,
                      'reality': 1,
                      "Concetta's": 2,
                      'stretch': 1,
                      'gambit': 1,
                      'highball': 2,
                      'studied': 7,
                      'twenty-five': 1,
                      'appreciate': 1,
                      'years': 34,
                      'sticks': 3,
                      'East': 4,
                      'probably': 13,
                      'brunettes': 1,
                      'frequently': 1,
                      'During': 2,
                      'Cold': 1,
                      'pilgrim': 1,
                      'tore': 3,
                      'Obviously': 1,
                      'cashmere': 1,
                      'what': 121,
                      'or': 150,
                      'alloy': 1,
                      "Don't": 24,
                      'states': 2,
                      "Jim's": 3,
                      'shaky': 2,
                      'put-upon': 1,
                      'city': 10,
                      'sex': 4,
                      'ginkgo': 1,
                      'stomach': 8,
                      'sensing': 1,
                      'skins': 1,
                      'backing': 1,
                      'easier': 3,
                      'consciousness': 1,
                      'thoughtfully': 1,
                      'accordance': 1,
                      'Cursed': 1,
                      'stickman': 1,
                      'general': 3,
                      'sprawled': 2,
                      'Sam': 18,
                      'invite': 1,
                      'heart': 17,
                      'bleak': 1,
                      'national': 5,
                      'tomorrow': 10,
                      'umbrella': 2,
                      'expanding': 2,
                      'grasp': 1,
                      'contracted': 1,
                      "lady's": 2,
                      'called': 31,
                      'namesake': 1,
                      'year': 14,
                      'victory': 2,
                      'Dunne': 4,
                      'Clouds': 4,
                      'Too': 2,
                      'courage': 4,
                      'subdued': 2,
                      'painted': 4,
                      'agricultural': 1,
                      'susceptible': 1,
                      'bring': 9,
                      'shopping': 1,
                      'uncomfortably': 2,
                      'showering': 1,
                      'hurled': 1,
                      'ripening': 1,
                      'wardroom': 1,
                      'protected': 2,
                      'wonder': 10,
                      'solution': 3,
                      'born': 1,
                      'sharply': 3,
                      'makeshift': 1,
                      'chance': 16,
                      'tenant': 1,
                      'spirits': 8,
                      'Methodist': 1,
                      'blackjack': 2,
                      'ailment': 1,
                      'fellow': 5,
                      'halfway': 1,
                      'swears': 1,
                      'graham': 2,
                      'Shades': 2,
                      'not': 250,
                      'Adrien': 2,
                      'sons': 2,
                      'taught': 2,
                      'fuss': 1,
                      'basket': 2,
                      'isolating': 1,
                      'suffer': 1,
                      'complained': 1,
                      'Cross': 1,
                      'varicolored': 1,
                      'college': 12,
                      'envisioned': 1,
                      'placed': 4,
                      'Handsomest': 1,
                      'boat': 2,
                      'crumbling': 1,
                      'noble': 1,
                      'illustrated': 1,
                      'Nor': 3,
                      'Unlike': 1,
                      'sliding': 1,
                      'glanced': 3,
                      'worried': 7,
                      'hated': 6,
                      'soda': 1,
                      'arrangement': 1,
                      'limping': 1,
                      'laced': 1,
                      'Anniston': 15,
                      'five': 6,
                      'Rosa': 4,
                      'bankruptcy': 1,
                      'common': 3,
                      'Arcilla': 1,
                      'mash': 1,
                      'less': 12,
                      'movement': 1,
                      'gleeful': 1,
                      'scraping': 1,
                      'thrusting': 2,
                      "mustn't": 2,
                      'participate': 1,
                      'jocular': 1,
                      'capital': 1,
                      'cords': 1,
                      'application': 2,
                      'sulked': 1,
                      'temperament': 1,
                      'infection': 1,
                      'smaller': 2,
                      'cheerful': 3,
                      'AA': 1,
                      'dragooned': 1,
                      'sting': 2,
                      'word': 17,
                      'jobless': 1,
                      'magnificent': 2,
                      'Mattie': 1,
                      'restriction': 1,
                      'thinks': 4,
                      'ground': 8,
                      "We'd": 3,
                      'primitive': 1,
                      'bouts': 1,
                      'shrubs': 1,
                      'sloppy': 1,
                      'Walter': 2,
                      'stir': 1,
                      'triangular': 1,
                      'plodding': 2,
                      'blood-soaked': 1,
                      'disquiet': 1,
                      'business': 21,
                      'strength': 8,
                      'leads': 1,
                      'wreck': 2,
                      'adults': 1,
                      'Cooper': 8,
                      'Freed': 1,
                      'conclusions': 1,
                      'swim': 1,
                      'unusual': 5,
                      'Young': 2,
                      'clasped': 2,
                      'talons': 1,
                      'white': 30,
                      'dreamed': 4,
                      'Named': 1,
                      'Road': 1,
                      'Pink': 1,
                      'familial': 1,
                      'hopped': 1,
                      'stumbled': 5,
                      'spit': 1,
                      'carved': 1,
                      'stones': 2,
                      'sort': 10,
                      'bookish': 1,
                      'temptation': 2,
                      'cats': 2,
                      'mastiff': 1,
                      'stoop': 1,
                      'partially': 1,
                      'Petty': 1,
                      'trudged': 1,
                      'caves': 1,
                      'artificial': 1,
                      'safely': 2,
                      'depending': 1,
                      'depression': 3,
                      'assume': 1,
                      'Eugenia': 15,
                      'bear': 4,
                      'wicked': 2,
                      'chapel-like': 1,
                      'co-operate': 1,
                      'potential': 2,
                      'schools': 2,
                      'darted': 1,
                      'recalled': 2,
                      'Byron': 1,
                      'casually': 1,
                      'pleasantly': 1,
                      'handspikes': 3,
                      'russe': 1,
                      'visitors': 1,
                      'Hotel': 1,
                      'risky': 1,
                      'bless': 3,
                      'swift': 1,
                      'Chinaman': 1,
                      'remoteness': 1,
                      'hangers': 1,
                      'cremate': 1,
                      'censure': 1,
                      'crates': 1,
                      'obligated': 1,
                      'traffic': 3,
                      'ear': 3,
                      'tea': 3,
                      'wire-haired': 1,
                      'shackles': 2,
                      'best': 12,
                      'sweetly': 2,
                      'loyal': 1,
                      'trestles': 1,
                      'unquenched': 1,
                      'lingerie': 1,
                      'white-columned': 1,
                      ...}),
             'science_fiction': Counter({'premise': 1,
                      'Without': 1,
                      'vacuum': 1,
                      'critically': 1,
                      'W.': 1,
                      'read': 6,
                      'limited': 2,
                      'logical': 2,
                      'triumphantly': 1,
                      'dig': 1,
                      "They'll": 1,
                      'beyond': 1,
                      'aboard': 2,
                      'anyhow': 1,
                      'speak': 4,
                      'unaware': 1,
                      'encouraged': 1,
                      'Many': 1,
                      'grow': 5,
                      'hegemony': 1,
                      'company': 3,
                      'feel': 4,
                      'muscles': 1,
                      'fast': 4,
                      'long-distance': 1,
                      'darling': 1,
                      'building': 1,
                      'personnel': 6,
                      'Between': 1,
                      'smell': 1,
                      'eaten': 1,
                      'walked': 3,
                      'use': 4,
                      'considered': 1,
                      'worse': 1,
                      'reconstruct': 1,
                      'grok': 5,
                      'wept': 2,
                      'experience': 4,
                      'whichever': 1,
                      'rest': 2,
                      'endure': 1,
                      'whisper': 1,
                      'served': 1,
                      'committee': 2,
                      'dead': 7,
                      'tear': 1,
                      'sheered': 1,
                      'Ozagenians': 1,
                      'Australia': 2,
                      'trip': 3,
                      'hygiene': 1,
                      'survived': 2,
                      'has': 9,
                      'controlled': 1,
                      "That'll": 1,
                      'knot': 1,
                      'are': 23,
                      'maintenance': 1,
                      "instant's": 1,
                      'our': 6,
                      'along': 2,
                      'promoted': 2,
                      'side-effects': 1,
                      'certain': 2,
                      'season': 1,
                      'heaved': 1,
                      'God': 5,
                      'demanded': 2,
                      'My': 2,
                      'others': 4,
                      'philology': 1,
                      'covered': 2,
                      'happiness': 5,
                      'Muslim': 1,
                      'indicating': 1,
                      "skiff's": 1,
                      'anyone': 2,
                      'sure': 5,
                      'entertain': 1,
                      'colossal': 1,
                      'transition': 1,
                      'accident': 3,
                      'calendars': 1,
                      'contrite': 1,
                      'catch': 2,
                      'equally': 1,
                      'contributes': 1,
                      'beautiful': 6,
                      'elaborately': 1,
                      'Ringing': 1,
                      'nymphs': 1,
                      'retorted': 1,
                      'articulate': 1,
                      'first': 15,
                      'is': 47,
                      'pity': 1,
                      'implant': 1,
                      'extensions': 1,
                      'instruments': 1,
                      'alive': 4,
                      'threes-fulfilled': 1,
                      'cradled': 1,
                      'approximations': 1,
                      'shelled': 1,
                      'Malay': 1,
                      'spaces': 1,
                      'Radiation': 1,
                      'page': 1,
                      'discovery': 1,
                      'determine': 1,
                      'under': 5,
                      'infinitive': 1,
                      'correct': 2,
                      'charming': 1,
                      'vibrancy': 1,
                      'none': 4,
                      'glance': 2,
                      'plan': 2,
                      'streaks': 1,
                      'amount': 2,
                      'sonic': 1,
                      'orders': 1,
                      'died': 3,
                      'ship': 20,
                      'ripples': 1,
                      'mineral': 1,
                      'pressure': 3,
                      'unlikely': 1,
                      'Short': 3,
                      'dominant': 1,
                      'ABC': 1,
                      'scientists': 1,
                      "you'll": 1,
                      'directed': 1,
                      'normal': 5,
                      'Jack': 18,
                      'hear': 2,
                      'amended': 1,
                      'start': 3,
                      'man': 17,
                      'parlor': 1,
                      'interrupted': 1,
                      'been': 40,
                      'augmented': 1,
                      'organized': 1,
                      'through': 15,
                      'proper': 4,
                      'shockwave': 1,
                      'corporate': 1,
                      'Neither': 1,
                      'ring': 1,
                      "we'll": 1,
                      'each': 12,
                      'nondefeatist': 1,
                      'Plus': 1,
                      'slid': 1,
                      'free': 3,
                      'dwindle': 1,
                      'hug': 1,
                      'shape': 1,
                      'terrestrial': 1,
                      'grabbing': 1,
                      'decaying': 1,
                      'plain': 3,
                      'changes': 1,
                      'forget': 2,
                      'apparently': 2,
                      'Icelandic': 1,
                      'interstellar': 1,
                      'raised': 1,
                      'love': 3,
                      'Todd': 1,
                      'rise': 1,
                      'clubbed': 1,
                      'repeated': 2,
                      'eventually': 4,
                      'earlier': 1,
                      "'": 11,
                      'unfathomable': 1,
                      'swath': 1,
                      'comes': 1,
                      'service': 1,
                      'day': 6,
                      'hard': 1,
                      'At': 4,
                      'half-transparent': 1,
                      "I'm": 12,
                      'hand-covered': 1,
                      'mastery': 1,
                      'die': 6,
                      'BCD': 1,
                      'Daily': 1,
                      'commingled': 1,
                      'rebuild': 1,
                      'Icelandic-speaking': 1,
                      'green': 1,
                      'precedent': 1,
                      'Romano': 1,
                      'ago': 6,
                      'ladies': 2,
                      'Think': 1,
                      'cartilage': 1,
                      'husband': 2,
                      'friends': 1,
                      'gritty-eyed': 1,
                      'bet': 1,
                      'home': 3,
                      'sands': 1,
                      'insisted': 1,
                      'want': 6,
                      'manage': 1,
                      'negative': 2,
                      'Afterwards': 1,
                      'killed': 4,
                      'small': 6,
                      'Miss': 1,
                      'Times': 1,
                      'solid': 2,
                      'Fuming': 1,
                      'red': 1,
                      'did': 35,
                      'grokked': 4,
                      'copied': 1,
                      'pool': 1,
                      'appearance': 1,
                      'forward': 1,
                      'barriers': 1,
                      'disarmament': 1,
                      'Hills': 1,
                      'lost': 8,
                      'enforced': 1,
                      'babes': 1,
                      'grounds': 1,
                      'Sicily': 1,
                      'content': 2,
                      'message': 1,
                      'next': 3,
                      'Nations': 1,
                      'Fosterite': 1,
                      'infection': 1,
                      'screens': 1,
                      'shall': 3,
                      'waist': 1,
                      'examined': 1,
                      'really': 2,
                      'peoples': 2,
                      'ball': 1,
                      'might': 12,
                      'Mercer': 26,
                      'Uh-huh': 1,
                      'please': 1,
                      'Please': 1,
                      'comforting': 1,
                      'mates': 1,
                      'barefoot': 1,
                      'female': 3,
                      'ritual': 1,
                      'effect': 1,
                      'selected': 2,
                      "don't": 11,
                      'comment': 1,
                      'thin': 3,
                      'attractive': 1,
                      'thrashed': 1,
                      'truth': 2,
                      'courses': 1,
                      'Out': 1,
                      'party': 4,
                      'card': 1,
                      'physiological': 1,
                      'masculine': 2,
                      'weakness': 1,
                      'Hawaiian-Americans': 1,
                      'together': 1,
                      'bunk': 6,
                      'me': 20,
                      'dwarfs': 1,
                      'fear': 1,
                      'keen': 1,
                      'average': 1,
                      'South': 2,
                      'races': 3,
                      'remained': 2,
                      'needed': 5,
                      'People': 3,
                      'late': 1,
                      'Others': 1,
                      'dance': 1,
                      'choking': 1,
                      'Maybe': 4,
                      'ask': 6,
                      'Greenland': 1,
                      'veldt': 1,
                      'found': 9,
                      'sad': 3,
                      'touch': 3,
                      'war': 2,
                      'included': 1,
                      'doubted': 1,
                      'nevertheless': 1,
                      'Nevertheless': 1,
                      'exploring': 1,
                      ':': 10,
                      'titanium': 1,
                      'domes': 1,
                      'What': 14,
                      'pretty': 2,
                      'sense': 5,
                      'supported': 1,
                      'Caribbean': 1,
                      'things': 2,
                      'Now': 8,
                      'Earthmen': 5,
                      'shipboard': 1,
                      'its': 13,
                      'can': 16,
                      'gentleman': 1,
                      'master': 1,
                      'instructing': 1,
                      'independent': 1,
                      'phrase': 2,
                      'dismal': 1,
                      ';': 88,
                      'poured': 1,
                      'rabbit': 1,
                      'forced': 5,
                      'neuter': 1,
                      'Mary': 5,
                      'Swahili': 1,
                      'Suppose': 3,
                      'lights': 1,
                      'expression': 1,
                      'mind': 15,
                      'sir': 2,
                      'field': 5,
                      'warts': 1,
                      'Breath': 1,
                      'snack': 1,
                      "What's": 3,
                      '``': 235,
                      'Urielites': 1,
                      'leave': 2,
                      'Hebrew': 1,
                      'evidence': 1,
                      'importance': 1,
                      'power': 7,
                      'tears': 2,
                      'eliminating': 1,
                      'entire': 1,
                      'permanent': 1,
                      'shadow': 1,
                      'any': 17,
                      'alto': 1,
                      'voyage': 1,
                      'In': 12,
                      'belt': 2,
                      'observe': 1,
                      'cycles': 1,
                      'automatically': 2,
                      'required': 3,
                      'associations': 1,
                      'absolutes': 1,
                      'concealed': 1,
                      'riot': 1,
                      'Lord': 1,
                      'I.Q.': 1,
                      'unique': 2,
                      'beasts': 1,
                      'burning': 1,
                      'practicality': 1,
                      'run': 4,
                      'poisoned': 1,
                      'experimental': 1,
                      'victims': 1,
                      'connections': 1,
                      'exactly': 1,
                      'sparse': 1,
                      'done': 3,
                      'web': 3,
                      'tone': 2,
                      'week': 1,
                      'uniform': 1,
                      'uncomfortable': 2,
                      'thousand': 3,
                      'govern': 1,
                      'verb': 1,
                      'themselves': 3,
                      'Ariadne': 2,
                      'send': 2,
                      'occur': 1,
                      'dome': 2,
                      'music': 2,
                      'visit': 1,
                      'stepping': 1,
                      'Positive': 1,
                      'moral': 1,
                      'simultaneously': 2,
                      "world's": 1,
                      'fine': 2,
                      'himself': 17,
                      'agitation': 1,
                      'dropped': 1,
                      're-examine': 1,
                      'mental': 2,
                      'atomic': 2,
                      'Angels': 8,
                      'my': 30,
                      'agreed': 5,
                      'unnecessary': 1,
                      'Besides': 2,
                      'species': 1,
                      'invade': 1,
                      'considerably': 1,
                      'citizens': 2,
                      'activating': 1,
                      'spring': 1,
                      'centuries': 1,
                      'longer': 8,
                      'sing': 4,
                      'strange': 4,
                      'Though': 3,
                      'Manchester': 1,
                      'methods': 1,
                      'scowled': 1,
                      'loaded': 1,
                      'grokking': 4,
                      'seismological': 1,
                      'wanting': 1,
                      'doorway': 1,
                      'growing': 6,
                      'matches': 1,
                      'water': 7,
                      'Some': 4,
                      'tiptoeing': 1,
                      'miracle': 1,
                      'agreements': 2,
                      'Antares': 1,
                      'unfurled': 1,
                      'considering': 1,
                      'guys': 2,
                      'everybody': 2,
                      'cut': 3,
                      'men': 5,
                      'match': 2,
                      'lotus': 1,
                      'photographs': 1,
                      'marry': 1,
                      'serve': 2,
                      'while': 8,
                      'blaze': 1,
                      'cook': 1,
                      'distance': 2,
                      'feeling': 2,
                      'walking': 1,
                      'sit': 2,
                      'completed': 1,
                      'approach': 1,
                      'final': 3,
                      'face': 8,
                      'That': 5,
                      'looking': 1,
                      "Digby's": 1,
                      'Shadow': 1,
                      'hostile': 1,
                      'CB': 1,
                      'Time': 4,
                      'humor': 2,
                      'periods': 1,
                      'ponderous': 1,
                      'hindered': 1,
                      'Bazaar': 1,
                      'March': 1,
                      'resembled': 1,
                      'lie': 2,
                      'attract': 1,
                      'Anyhow': 1,
                      'reaction': 1,
                      'towards': 4,
                      'Suspicion': 1,
                      'argued': 1,
                      'predict': 1,
                      'Science': 1,
                      'pointed': 2,
                      'Lady': 7,
                      'realized': 1,
                      'If': 13,
                      'same': 6,
                      'dragged': 1,
                      'gay': 1,
                      'midnight': 3,
                      'states': 2,
                      'illness': 1,
                      'place': 8,
                      'swiftly': 1,
                      'elders': 3,
                      'patsy': 1,
                      'committees': 1,
                      'sigh': 2,
                      'amusement': 1,
                      'tasted': 1,
                      'Mike': 20,
                      'Even': 2,
                      'preferences': 2,
                      "B'dikkat's": 1,
                      'jump': 4,
                      'atmosphere': 4,
                      'the': 652,
                      'Thank': 1,
                      'photon-counting': 1,
                      'sometimes': 1,
                      'greater': 1,
                      'therefore': 2,
                      'remain': 1,
                      'Over': 1,
                      'taps': 1,
                      'help': 8,
                      'Philippine': 1,
                      'could': 49,
                      'chipper': 1,
                      'dissimilar': 1,
                      'failed': 1,
                      'sixteenth': 1,
                      'audience': 1,
                      'trusting': 1,
                      'triggered': 1,
                      'opening': 1,
                      'stopped': 3,
                      'decorticated': 1,
                      'aloud': 1,
                      'elaborate': 1,
                      'pride': 1,
                      'planetoids': 1,
                      'rich': 1,
                      'if': 16,
                      'sorry': 1,
                      'darkened': 1,
                      'sealed': 1,
                      'spoke': 9,
                      'Soviet': 1,
                      'something': 8,
                      'astronomical': 1,
                      'here': 8,
                      'ran': 2,
                      'matsyendra': 1,
                      'optimism': 1,
                      'defend': 1,
                      'meant': 6,
                      'actions': 1,
                      'shield': 1,
                      'old': 3,
                      'screams': 1,
                      'tiny': 3,
                      'paced': 1,
                      'knotty': 1,
                      'guru': 1,
                      'regret': 1,
                      'hospital': 1,
                      'politely': 2,
                      'picture': 2,
                      'mature': 1,
                      'morrow': 1,
                      'making': 3,
                      'full': 2,
                      'performance': 2,
                      'count': 2,
                      'Stinky': 1,
                      'correlating': 1,
                      'glittering': 1,
                      'between': 10,
                      'base': 1,
                      'commissioned': 1,
                      'corrosive': 1,
                      'recognized': 2,
                      'reactors': 1,
                      'air': 1,
                      'report': 1,
                      'solar': 1,
                      'scraped': 1,
                      'answers': 1,
                      'metal': 2,
                      'sound': 5,
                      'concerti': 1,
                      'film': 1,
                      'lovelies': 1,
                      'World': 1,
                      'sleepily': 2,
                      'assigned': 1,
                      'insoluble': 1,
                      'diplomatic': 2,
                      'eyes': 9,
                      'lightyears': 1,
                      'Holies': 1,
                      'hearth': 1,
                      'deformities': 1,
                      'Ones': 2,
                      'deposits': 1,
                      'brilliantly': 1,
                      'Coal': 2,
                      'Tabernacle': 1,
                      'felt': 13,
                      'discorporate': 2,
                      'Not': 1,
                      'modern': 1,
                      "Hal's": 1,
                      'spat': 1,
                      "Angel's": 2,
                      'chills': 1,
                      'baby': 1,
                      'hideous': 1,
                      'dismissed': 1,
                      'girls': 2,
                      'told': 3,
                      'they': 53,
                      'deceleration': 2,
                      'chicken': 1,
                      'find': 4,
                      'center': 5,
                      'The': 71,
                      'casual': 1,
                      'heat': 1,
                      'downstairs': 1,
                      'gases': 2,
                      'boot': 1,
                      'of': 321,
                      'parts': 2,
                      'crooned': 2,
                      'celebrated': 1,
                      'gripes': 1,
                      'published': 2,
                      'law': 2,
                      'spiritual': 1,
                      'Where': 5,
                      'scouts': 1,
                      'blastdown': 1,
                      'fact': 5,
                      'where': 10,
                      'Larry': 1,
                      'advanced': 1,
                      'whole': 1,
                      'fleeting': 1,
                      'lavishly': 1,
                      'limits': 1,
                      'private': 1,
                      'memorize': 1,
                      'presumed': 1,
                      'He': 52,
                      'lived': 3,
                      'scanners': 1,
                      'threshed': 1,
                      'protest': 1,
                      'Candide': 1,
                      'rewarding': 1,
                      'annals': 1,
                      'trouble': 3,
                      'fell': 2,
                      'totality': 1,
                      'reference': 1,
                      'compute': 1,
                      'pains': 2,
                      'low': 4,
                      'facts': 2,
                      'cremated': 1,
                      'Hudson': 1,
                      'moss': 1,
                      'unusually': 1,
                      'enjoyed': 3,
                      'virtually': 1,
                      'disastrous': 1,
                      'weep': 1,
                      'peculiar': 1,
                      'faster': 2,
                      'too': 14,
                      'Siberia': 2,
                      'linguists': 1,
                      'bulwark': 1,
                      'peculiarities': 1,
                      'aspect': 1,
                      'confirmed': 1,
                      '25': 1,
                      'outside': 8,
                      'later': 6,
                      'partnered': 1,
                      'resumed': 1,
                      'section': 1,
                      'language': 3,
                      'practicing': 1,
                      'book': 1,
                      'sprawled': 1,
                      'every': 4,
                      'Shayol': 4,
                      'following': 2,
                      'eggs': 1,
                      'deliberately': 1,
                      'suggested': 3,
                      'Naturally': 2,
                      'nature': 2,
                      'event': 1,
                      'married': 3,
                      'codified': 1,
                      'instantly': 1,
                      'do-good': 1,
                      'shrugged': 1,
                      'doing': 4,
                      '5': 1,
                      'ticket': 1,
                      '138': 1,
                      'purpose': 1,
                      'star': 4,
                      'whereby': 1,
                      'smiling': 1,
                      'Thus': 2,
                      'visual': 2,
                      'number': 3,
                      'enormous': 2,
                      'eat': 3,
                      'practice': 1,
                      'tape': 1,
                      'beauty': 1,
                      "it'll": 1,
                      'minute': 1,
                      'harvest': 1,
                      "he'll": 1,
                      'floor': 1,
                      'properly': 1,
                      'slave': 1,
                      'Tibetan': 1,
                      'angelic': 2,
                      'far': 2,
                      'mapping': 1,
                      'mutual': 1,
                      'easily': 1,
                      'planet': 10,
                      'flash': 1,
                      'shortly': 1,
                      'blushed': 1,
                      'library': 2,
                      'blanket': 1,
                      'Dawn': 1,
                      'walk': 1,
                      'disgrace': 1,
                      'missing': 1,
                      'actually': 1,
                      'shouldered': 1,
                      'dressed': 1,
                      'limitations': 1,
                      'nor': 1,
                      'after': 11,
                      'rock-steady': 1,
                      'thought': 11,
                      'ward': 2,
                      'lungs': 1,
                      'anyway': 1,
                      'programed': 1,
                      'on': 85,
                      'conviction': 1,
                      'bargain': 1,
                      'directions': 1,
                      'Perhaps': 3,
                      'Hell': 1,
                      'showing': 1,
                      'big': 6,
                      'ambulatory': 1,
                      'surviving': 1,
                      'recommend': 1,
                      'put': 2,
                      'bodies': 10,
                      'Sandalphon': 3,
                      'Mars': 9,
                      'telescope': 1,
                      'Arizona': 1,
                      'pleasure': 4,
                      'civilization': 1,
                      'cold': 2,
                      'punk': 1,
                      'northern': 3,
                      'opened': 2,
                      'fun': 1,
                      'readings': 1,
                      'period': 4,
                      'excited': 1,
                      'radioactive': 1,
                      'misuse': 1,
                      'lamechian': 1,
                      "R.'s": 1,
                      'powers': 2,
                      'seemed': 4,
                      'decided': 2,
                      'fate': 2,
                      'Holy': 1,
                      'Heavenly': 1,
                      'stood': 5,
                      'Exploration': 1,
                      'electronic': 1,
                      'everlasting': 1,
                      'pronounced': 1,
                      'stolen': 1,
                      'these': 12,
                      'pick': 1,
                      'clear': 3,
                      'patterns': 2,
                      'girl': 2,
                      'pranha': 1,
                      'pidgin': 1,
                      'happened': 4,
                      'insomnia': 2,
                      'worth': 3,
                      'destination': 2,
                      'combine': 1,
                      'adjectives': 1,
                      'expectations': 1,
                      'squat': 1,
                      'Service': 2,
                      'cares': 1,
                      'accuse': 1,
                      'Remember': 2,
                      'hands': 7,
                      'laxness': 1,
                      'prognosis': 1,
                      'manipulation': 1,
                      'now': 19,
                      'suspensor': 2,
                      'curve': 1,
                      'A': 14,
                      'funeral': 1,
                      'planning': 1,
                      'massive': 1,
                      'radiated': 1,
                      'nicest': 1,
                      'lady': 4,
                      'chase': 1,
                      'already': 6,
                      'surrounding': 1,
                      'colder': 1,
                      'last': 9,
                      'turtle': 1,
                      'Another': 2,
                      'asteroid': 1,
                      'exercise': 1,
                      'Will': 1,
                      'three': 6,
                      'Second': 1,
                      'automatic': 2,
                      'purposes': 2,
                      'itself': 4,
                      'nations': 1,
                      'Monitor': 1,
                      'thicken': 1,
                      'Fine': 2,
                      'Schools': 1,
                      'play': 3,
                      'beginnings': 1,
                      'grinned': 1,
                      'clearer': 1,
                      'industrial': 1,
                      'grammatical': 1,
                      'take': 11,
                      'Tristan': 1,
                      'goodness': 3,
                      'notion': 1,
                      'stretched': 1,
                      'beast': 1,
                      'horrified': 2,
                      'carry': 3,
                      'apartment': 1,
                      'cloud': 1,
                      "dabhumaksanigalu'ahai": 1,
                      'personal': 1,
                      'difficulty': 1,
                      'manipulated': 1,
                      'bomb': 1,
                      'people': 20,
                      'may': 4,
                      'resources': 1,
                      'keening': 1,
                      "Let's": 1,
                      'present': 2,
                      'radar': 2,
                      'revive': 1,
                      'necessary': 1,
                      'Dark': 1,
                      'as': 64,
                      'hit': 2,
                      'helium': 1,
                      'be': 80,
                      'ecological': 1,
                      'must': 8,
                      'names': 1,
                      'helpful': 1,
                      'ready': 4,
                      'faintest': 1,
                      'corpses': 1,
                      'metallic': 1,
                      'vegetable': 1,
                      'an': 33,
                      'name': 2,
                      'winds': 2,
                      'marriage': 2,
                      'Farrell': 1,
                      'Well': 4,
                      'advantage': 1,
                      "isn't": 6,
                      'salted': 1,
                      'trying': 2,
                      'being': 11,
                      'Same': 1,
                      'miss': 1,
                      'Listen': 2,
                      'minutes': 3,
                      'mate': 1,
                      'marks': 1,
                      'oxygen': 3,
                      'nose': 1,
                      'we': 30,
                      'friendly': 1,
                      'load': 1,
                      'wrangled': 1,
                      'Venusians': 1,
                      "Earthmen's": 1,
                      'necklaces': 1,
                      'story': 1,
                      'hospitable': 1,
                      'creepy': 1,
                      'letting': 1,
                      'proficiency': 1,
                      'central': 1,
                      "Jubal's": 3,
                      'Part': 1,
                      'pieces': 1,
                      'Tibet': 1,
                      'perceive': 1,
                      'supposed': 1,
                      'Visiting': 1,
                      'force': 1,
                      'grass': 2,
                      'prevented': 1,
                      'understanding': 1,
                      'Bay': 1,
                      'bother': 2,
                      'nudged': 2,
                      'nirvana': 1,
                      'violations': 1,
                      'how': 12,
                      'possibility': 2,
                      'eastern': 1,
                      'should': 3,
                      'ten': 1,
                      'Had': 2,
                      'wildly': 1,
                      'some': 16,
                      "child's": 1,
                      'alone': 3,
                      'dusty': 2,
                      'pardon': 1,
                      'half': 10,
                      'Above': 1,
                      'food': 1,
                      'Dissect': 1,
                      'whether': 3,
                      'serviettes': 1,
                      'recordings': 2,
                      'increased': 1,
                      'living': 3,
                      'breathe': 2,
                      'craters': 1,
                      'declaration': 1,
                      'into': 22,
                      'precepts': 1,
                      'exceedingly': 1,
                      'Get': 1,
                      'question': 4,
                      'Insects': 1,
                      'address': 1,
                      'else': 2,
                      'random': 1,
                      'local': 1,
                      'classical': 1,
                      'let': 4,
                      'above': 4,
                      'qualified': 1,
                      'gave': 3,
                      'maintained': 1,
                      'We': 15,
                      'necessity': 2,
                      'grown': 3,
                      'millennia': 1,
                      'follow': 2,
                      'smelling': 1,
                      'National': 1,
                      "Gabriel's": 1,
                      'method': 1,
                      'looks': 1,
                      'shell-psychology': 1,
                      'existed': 1,
                      'trance': 2,
                      'because': 10,
                      'given': 5,
                      'give': 7,
                      "Grandmothers'": 1,
                      'situations': 1,
                      ...})})

simpler example...

  • Look at two genres: news and romance
  • For each genre, loop over every word in the genre to produce (genre, word) pairs

In [31]:
genre_word = [(genre, word)
              for genre in ['news', 'romance']
              for word in brown.words(categories=genre)]
len(genre_word)


Out[31]:
170576

In [32]:
genre_word[:4]


Out[32]:
[('news', 'The'), ('news', 'Fulton'), ('news', 'County'), ('news', 'Grand')]

In [33]:
genre_word[-4:]


Out[33]:
[('romance', 'afraid'),
 ('romance', 'not'),
 ('romance', "''"),
 ('romance', '.')]

In [34]:
cfd = nltk.ConditionalFreqDist(genre_word)
cfd


Out[34]:
defaultdict(nltk.probability.FreqDist,
            {'news': Counter({'Without': 4,
                      'festival': 2,
                      '5-run': 2,
                      'Motorists': 1,
                      'builders': 7,
                      'taxing': 1,
                      'More': 12,
                      'tribunals': 1,
                      '23': 9,
                      'Anthony': 1,
                      'assent': 1,
                      'speak': 3,
                      'located': 6,
                      'economics': 3,
                      'contented': 1,
                      'education': 31,
                      'house': 26,
                      'promises': 3,
                      'skilled': 1,
                      'conspirators': 2,
                      'officiating': 1,
                      'overdeveloped': 1,
                      'Gerald': 3,
                      'tube': 1,
                      'Wilmette': 3,
                      'building': 15,
                      'Adams': 1,
                      'Exploratory': 1,
                      'waters': 4,
                      'Arvey': 1,
                      'ex-National': 1,
                      'Minnie': 2,
                      'Between': 1,
                      'shouldda': 1,
                      'threshold': 3,
                      'junction': 1,
                      'restrain': 1,
                      'operating': 4,
                      'drafts': 2,
                      'LSU': 1,
                      'use': 31,
                      "bridegroom's": 2,
                      'authoritative': 1,
                      'thoroughfare': 1,
                      'Olivia': 1,
                      'experience': 6,
                      'endure': 2,
                      'served': 17,
                      'committee': 38,
                      'Norm': 2,
                      'bars': 1,
                      'sports': 6,
                      'Feis': 1,
                      'population': 13,
                      'Sr.': 1,
                      "Baltimore's": 2,
                      'Mickey': 12,
                      'thoroughly': 1,
                      'nomination': 6,
                      'Omsk': 1,
                      'views': 7,
                      'Much': 2,
                      'plunder': 1,
                      'forum': 1,
                      "Mayor's": 2,
                      'abrupt': 1,
                      'certain': 18,
                      'season': 43,
                      'Gardner': 1,
                      'Coopers': 1,
                      'Melcher': 1,
                      'glutted': 1,
                      'Continental': 7,
                      'shade': 1,
                      'others': 15,
                      'budgets': 2,
                      'Durwood': 1,
                      'vicinity': 1,
                      'manufacturers': 10,
                      'Patrolmen': 1,
                      'Polls': 1,
                      '6.5': 1,
                      'Callan': 1,
                      'Zoe': 1,
                      'McN.': 1,
                      'Milwaukee': 4,
                      'beautiful': 5,
                      'gesture': 2,
                      'intimidation': 1,
                      'sixth': 10,
                      'crypt': 1,
                      'august': 1,
                      'marriages': 4,
                      'head-and-shoulders': 1,
                      'all-important': 1,
                      'self-respect': 1,
                      'Couturier': 1,
                      'denying': 1,
                      'Schrunk': 1,
                      'handy': 1,
                      'Reputedly': 1,
                      'cheer': 2,
                      'retains': 1,
                      "countin'": 1,
                      'imposition': 1,
                      'believes': 8,
                      'Stevens': 4,
                      'constituents': 1,
                      'chic': 4,
                      'chicken': 4,
                      'determine': 3,
                      'drab': 1,
                      'barrage': 1,
                      'Loen': 1,
                      'goaded': 1,
                      'pitchers': 4,
                      'correct': 3,
                      'corporation': 7,
                      'kitchen': 5,
                      'amount': 18,
                      '$7,500,000': 1,
                      'crack': 2,
                      'omitted': 1,
                      'Called': 1,
                      'pressure': 12,
                      'Md.': 2,
                      'Chinese': 2,
                      '2-score-year': 1,
                      'darling': 1,
                      '22-acre': 1,
                      'amended': 1,
                      'adviser': 3,
                      'perturbed': 1,
                      'won-lost': 1,
                      "wouldn't": 2,
                      'scholastic': 2,
                      'hard-hit': 1,
                      'scratching': 1,
                      'through': 54,
                      'classical': 3,
                      'Dallas-based': 1,
                      'halftime': 1,
                      'nationalism': 2,
                      'Just': 7,
                      'specific': 4,
                      'each': 61,
                      'Because': 4,
                      'Plus': 2,
                      'Ah': 2,
                      'Justice': 1,
                      '13-1/2': 1,
                      'tons': 1,
                      'Dennis': 1,
                      'stakes': 1,
                      'menaced': 1,
                      'Walters': 1,
                      'France': 7,
                      'inviolate': 1,
                      'ringsiders': 1,
                      'ceased': 1,
                      "yesterday's": 1,
                      'Democrats': 11,
                      'Pitcher': 1,
                      'cottage': 1,
                      'raised': 8,
                      'Todd': 1,
                      'place-kicker': 1,
                      'stream': 3,
                      'revealed': 1,
                      'Ernst': 1,
                      'enunciate': 1,
                      'grave': 2,
                      'Sunnyvale': 3,
                      'believing': 2,
                      'keyhole': 1,
                      'second-degree': 2,
                      'Hinsdale': 1,
                      'situations': 2,
                      'summoned': 4,
                      'blackout': 1,
                      'Doris': 1,
                      'At': 38,
                      'roundup': 1,
                      "I'm": 18,
                      'deodorant': 2,
                      'focused': 1,
                      'patrolman': 1,
                      '$17': 1,
                      'colleagues': 1,
                      'balconies': 1,
                      'acquittal': 2,
                      'scoop': 1,
                      'Those': 8,
                      'adoption': 2,
                      'rejection': 3,
                      'CBS': 1,
                      'Buddy': 1,
                      'Nelson': 4,
                      'Manville': 1,
                      'career': 10,
                      'baggage': 2,
                      'husband': 13,
                      'friends': 11,
                      'bet': 2,
                      'shouting': 1,
                      'worse': 6,
                      "Hallowell's": 1,
                      'slip': 1,
                      'negative': 2,
                      'Suddenly': 1,
                      'Afterwards': 1,
                      'killed': 5,
                      'small': 27,
                      'Television-Electronics': 1,
                      'bull': 1,
                      'Ruling': 1,
                      'enemy': 3,
                      'continue': 11,
                      'Equator': 1,
                      'understatement': 1,
                      'quo': 1,
                      'Peking': 1,
                      'Hamey': 1,
                      'neckline': 2,
                      'recession': 3,
                      'Worth': 3,
                      'missiles': 3,
                      'recently': 11,
                      'tournament': 13,
                      'enforced': 3,
                      'steamship': 1,
                      'legislation': 14,
                      'next': 40,
                      'Ghormley': 1,
                      'propeller': 1,
                      'sly': 1,
                      'reared': 1,
                      'disagreed': 2,
                      'strike': 12,
                      'four': 73,
                      'Larson': 5,
                      'Marines': 1,
                      'really': 13,
                      'peoples': 1,
                      'singled': 6,
                      'Foliage': 1,
                      'inflicted': 1,
                      'ideology': 1,
                      'comforting': 1,
                      'rushed': 2,
                      'borrowed': 2,
                      'uncorked': 1,
                      'signals': 1,
                      'effect': 17,
                      'senator': 3,
                      'Pete': 2,
                      'exact': 1,
                      'NBC': 2,
                      'tulle': 1,
                      'whites': 2,
                      "Pirates'": 1,
                      "Lenobel's": 1,
                      '187.5': 1,
                      'smoother': 1,
                      'cell': 2,
                      'accounted': 2,
                      'King': 3,
                      'vice-chairman': 1,
                      'artists': 5,
                      'brokers': 2,
                      'lag': 1,
                      '17,000': 1,
                      "Berry's": 1,
                      'swearing-in': 1,
                      'spending': 9,
                      'confronting': 2,
                      'Amateur': 1,
                      'prisoners': 1,
                      'probation': 6,
                      'twelve': 1,
                      'Parmer': 1,
                      'Joseph': 18,
                      'Culbertson': 1,
                      'tip': 1,
                      'Bureau': 2,
                      'daylight': 1,
                      'Carmichael': 1,
                      'colts': 1,
                      'Camilla': 1,
                      'clerk': 4,
                      'Nori': 1,
                      'regretted': 1,
                      '675': 1,
                      'Michaels': 1,
                      '687.87': 1,
                      'assets': 1,
                      'intends': 1,
                      'ask': 7,
                      'found': 30,
                      'hope': 11,
                      'writes': 1,
                      'win': 10,
                      'statutory': 2,
                      'schedule': 9,
                      'dug': 2,
                      'scholarships': 4,
                      'Haynes': 1,
                      'mother': 15,
                      'Galt': 1,
                      'Jorge': 1,
                      'things': 10,
                      'Now': 11,
                      'thank': 1,
                      'Huntingtons': 1,
                      'shortcuts': 1,
                      'alienated': 1,
                      'Ponce': 1,
                      'department': 12,
                      'colorful': 1,
                      'independent': 3,
                      'February': 10,
                      'chow': 1,
                      '3-0': 1,
                      'exported': 2,
                      '4:05': 1,
                      'yardstick': 1,
                      'permission': 2,
                      'divided': 4,
                      'reconstruction': 1,
                      'Ky.': 2,
                      '18': 20,
                      'Grove': 4,
                      'inaugural': 1,
                      'stops': 2,
                      "fund's": 3,
                      'sir': 1,
                      'bat': 6,
                      'boosting': 1,
                      'Fra': 1,
                      'investigations': 1,
                      'advisers': 7,
                      'one-third': 1,
                      'brethren': 1,
                      'pricing': 3,
                      'snack': 1,
                      'fairway': 5,
                      'constituted': 4,
                      'our': 55,
                      'Wright': 1,
                      'German': 8,
                      'Molly': 2,
                      'Nairne': 2,
                      'Bassi': 1,
                      'Welfare': 3,
                      'neighbor': 4,
                      'submarines': 5,
                      'signal': 1,
                      'vehicles': 2,
                      'battalions': 1,
                      '1959': 21,
                      'grabbed': 2,
                      'musicians': 3,
                      'managing': 2,
                      'McEachern': 1,
                      "county's": 1,
                      'taper': 1,
                      'Owls': 1,
                      'Attendance': 1,
                      'deductible': 2,
                      'ordinarily': 1,
                      'Latinovich': 1,
                      'Cox': 4,
                      'outset': 1,
                      'Gomez': 1,
                      'required': 11,
                      'Cadillac': 1,
                      'milestone': 2,
                      'Crumlish': 1,
                      'wages': 11,
                      'surpassed': 1,
                      "players'": 1,
                      'candid': 1,
                      'Lord': 2,
                      'Division': 5,
                      'chicanery': 1,
                      'crews': 1,
                      '1961': 42,
                      'strip': 4,
                      '6th': 1,
                      'hunting': 2,
                      'Joplin': 1,
                      'resource': 1,
                      'run': 35,
                      'discussed': 6,
                      'speedy': 1,
                      'stores': 4,
                      'I.': 7,
                      'Moments': 1,
                      'demanded': 3,
                      'breathes': 1,
                      'effected': 1,
                      'Evans': 1,
                      'grandchildren': 2,
                      'Cocktails': 2,
                      'Opposition': 1,
                      '72-hole': 1,
                      'Shiflett': 2,
                      'themselves': 19,
                      'developing': 4,
                      'academic': 10,
                      'Luette': 1,
                      'Deputy': 2,
                      "Wednesday's": 1,
                      'Supervisor': 1,
                      'Vernava': 3,
                      'eligible': 4,
                      'pecan': 1,
                      'capitalist': 1,
                      'positive': 2,
                      'Santa': 6,
                      'parole': 4,
                      'drunk': 1,
                      'Herridge': 1,
                      'Howard': 13,
                      'newsletter': 1,
                      'Ted': 4,
                      'elegant': 2,
                      'reader': 1,
                      'thorny': 1,
                      'Maris': 36,
                      'Second': 5,
                      'Stamford': 2,
                      'sophomore': 3,
                      'pulled': 3,
                      'blacked': 1,
                      'background': 3,
                      'continuous': 1,
                      'naughtier': 1,
                      'Taxation': 1,
                      'transition': 3,
                      'sentences': 1,
                      'methods': 10,
                      'visited': 4,
                      'loaded': 4,
                      '$5': 2,
                      'visa': 1,
                      'disposed': 1,
                      'Also': 9,
                      'radio': 6,
                      'coat': 2,
                      'Agencies': 1,
                      'delivery': 1,
                      'vindication': 2,
                      'trees': 2,
                      'melee': 2,
                      'Elmer': 1,
                      "they're": 5,
                      'Cotty': 1,
                      'brochures': 1,
                      'Oscar': 2,
                      'kickoff': 2,
                      'exchange': 5,
                      'Frederick': 3,
                      'due': 9,
                      'Fletcher': 1,
                      'congratulated': 2,
                      'Sheldon': 4,
                      'Turnpike': 6,
                      'plea': 3,
                      'warmed': 3,
                      'checked': 2,
                      'problem': 39,
                      'freer': 1,
                      'Morocco': 2,
                      'pulse': 1,
                      'modification': 1,
                      'Summer': 1,
                      'winter': 8,
                      'distance': 2,
                      '5000': 1,
                      'percentage': 4,
                      'receive': 15,
                      'Gauer': 2,
                      'Trustees': 2,
                      'Mortgage': 1,
                      'specialize': 1,
                      'notarized': 1,
                      'informal': 4,
                      'Collins': 4,
                      'looking': 7,
                      '37,679': 1,
                      'catchers': 1,
                      'picker': 1,
                      'British': 24,
                      '255': 1,
                      'Operating': 2,
                      'Time': 6,
                      'humor': 4,
                      "John's": 1,
                      'heyday': 1,
                      'Hat': 1,
                      'Achaeans': 1,
                      'delegation': 3,
                      'Briar': 1,
                      "We're": 3,
                      'collected': 6,
                      'Samoa': 1,
                      'Big': 4,
                      'attract': 2,
                      'Kimpton': 4,
                      'ranked': 2,
                      'Birgit': 1,
                      'pointed': 9,
                      'guiding': 1,
                      'Usually': 1,
                      'determination': 3,
                      'amendment': 5,
                      'evacuation': 1,
                      'slowly': 3,
                      'place': 25,
                      'piece': 3,
                      'Korman': 1,
                      'garrulous': 1,
                      'moldboard': 1,
                      'Kelly': 1,
                      "Princes'": 1,
                      'committees': 4,
                      'payroll': 13,
                      'bombers': 1,
                      'underestimate': 1,
                      'first': 143,
                      'Stetsons': 1,
                      'atmosphere': 5,
                      'union': 21,
                      'urgent': 3,
                      'track': 10,
                      'ex-gambler': 1,
                      'Salvatore': 1,
                      'gulf': 1,
                      'inflamed': 1,
                      '1859': 1,
                      'textile-producing': 2,
                      'announcements': 2,
                      'Natural': 1,
                      '1825': 1,
                      'instrument': 1,
                      'pork': 1,
                      'could': 86,
                      'Lawrence': 12,
                      'teenagers': 3,
                      'Federal': 17,
                      'gruonded': 1,
                      'slippers': 1,
                      'stopped': 6,
                      'Wilson': 6,
                      'aloud': 1,
                      'elaborate': 3,
                      'Revolutionary': 1,
                      'Along': 3,
                      'pride': 3,
                      'Soule': 1,
                      'Franklin': 8,
                      'twisting': 1,
                      'Neiman-Marcus': 4,
                      'Kanin': 1,
                      '13th': 5,
                      '111': 1,
                      'increasingly': 2,
                      'inflexible': 1,
                      'Ind.': 1,
                      'search': 3,
                      'infiltrating': 1,
                      'reflexes': 2,
                      'delinquency': 1,
                      'wagons': 1,
                      'optimism': 2,
                      'meant': 3,
                      'Monroe': 5,
                      'morals': 1,
                      'ivory': 1,
                      'luncheons': 1,
                      'forbids': 1,
                      'Successful': 1,
                      'businesses': 2,
                      'Dumont': 2,
                      'Petipa-Tschaikowsky': 1,
                      'picture': 7,
                      'cuisine': 1,
                      'guarding': 1,
                      'A.M.': 2,
                      'sympathetic': 1,
                      'unemployment': 1,
                      'Vital': 1,
                      'laurels': 1,
                      'Yogi': 2,
                      'convention': 2,
                      'Bourguiba': 1,
                      'buildings': 6,
                      'between': 47,
                      "Gannon's": 1,
                      '2269': 1,
                      'cost': 15,
                      'exempt': 2,
                      '1953': 5,
                      'R-Warren': 1,
                      'nailed': 1,
                      'Inter-American': 1,
                      'Retail': 1,
                      'outmoded': 1,
                      'eighth': 2,
                      'nine-game': 1,
                      'costlier': 1,
                      'masterpiece': 2,
                      'recognize': 1,
                      'improve': 4,
                      'lefthander': 2,
                      'feeds': 1,
                      'marked': 4,
                      'Anderson': 8,
                      '$15': 1,
                      'Pye': 1,
                      'higher': 26,
                      'lovely': 2,
                      'Dutch': 1,
                      'Boonton': 1,
                      'fast-grossing': 1,
                      'Cezannes': 1,
                      'Menderes': 2,
                      'golfing': 1,
                      'Security': 4,
                      'foliage': 1,
                      'Alpha': 1,
                      'Stanley': 2,
                      'residences': 1,
                      'penalized': 1,
                      'firmly': 5,
                      'Until': 5,
                      'marketing': 3,
                      'backed': 1,
                      'handicapped': 1,
                      'Achievement': 7,
                      'bateau': 2,
                      'Bay-front': 1,
                      'Neuberger': 1,
                      'Harveys': 1,
                      'Slate': 5,
                      'Golden': 3,
                      "carpenters'": 1,
                      'endowments': 2,
                      "union's": 2,
                      'hospitable': 1,
                      'dismissed': 1,
                      "writers'": 1,
                      'Campagnoli': 1,
                      'told': 53,
                      'Alice': 3,
                      'distinctly': 1,
                      'wiping': 1,
                      'Transportation': 1,
                      'monopolistic': 2,
                      'The': 806,
                      'promise': 3,
                      'D.': 24,
                      '5155': 1,
                      'Hodges': 7,
                      'sportsman': 1,
                      'Boy': 1,
                      'Ocean': 1,
                      'merited': 1,
                      'of': 2849,
                      'under': 83,
                      'Gateway': 1,
                      'court-appointed': 1,
                      '7.5': 1,
                      'celebrated': 2,
                      '47': 4,
                      '$10,000': 5,
                      'withdrawn': 1,
                      'law': 38,
                      'spiritual': 1,
                      'slugger': 3,
                      'bankers': 8,
                      'Innumerable': 1,
                      '1920s': 1,
                      'untold': 1,
                      'Stalling': 1,
                      'nearing': 4,
                      'forgetting': 1,
                      '553': 1,
                      'police': 31,
                      'Snodgrass': 1,
                      'inboard': 1,
                      "Angeles'": 2,
                      '155-yarder': 1,
                      'rear': 4,
                      'ova': 1,
                      'bustling': 1,
                      'metal': 1,
                      "Brooks's": 1,
                      'refocusing': 1,
                      'industrialist': 1,
                      'lived': 3,
                      'success': 4,
                      'wars': 2,
                      'resisting': 1,
                      'Sanger-Harris': 1,
                      'Eastland': 1,
                      'SMU': 5,
                      'murdered': 1,
                      'Sheets': 4,
                      'Brod': 1,
                      'Rylie': 1,
                      "ladies'": 1,
                      'maid': 3,
                      'low': 7,
                      "Beadles'": 1,
                      'Hudson': 2,
                      'unusually': 2,
                      'retaliating': 1,
                      'assembled': 1,
                      'announcing': 2,
                      'crabapple': 1,
                      'recovery': 9,
                      'peculiar': 2,
                      'display': 4,
                      'Cover': 1,
                      'Dearborn': 1,
                      'Fatima': 1,
                      'well': 43,
                      'Siberia': 2,
                      'orders': 7,
                      'bulwark': 1,
                      'Baltimorean': 1,
                      "Cotten's": 1,
                      'lucky': 1,
                      'factory': 3,
                      'entry': 2,
                      'overhead': 2,
                      'Dame': 5,
                      'outside': 10,
                      'Berger': 5,
                      '1630': 1,
                      'arc': 1,
                      '355': 1,
                      'book': 13,
                      'subscribe': 1,
                      'recommends': 1,
                      '101b': 1,
                      'Eire': 1,
                      'unrest': 1,
                      'planes': 2,
                      'suggested': 8,
                      'Basic': 1,
                      'begging': 1,
                      'Grover': 4,
                      "Grizzlies'": 1,
                      'employees': 11,
                      'mentions': 1,
                      'Notre': 5,
                      'Renaissance': 1,
                      'arrive': 5,
                      'jacket': 2,
                      'nightly': 1,
                      'bridesmaids': 1,
                      'intend': 3,
                      'pars': 3,
                      'presents': 1,
                      'Engaging': 2,
                      'purpose': 9,
                      'broadened': 1,
                      'give': 32,
                      'deferred': 1,
                      'absolute': 1,
                      'Hartman': 4,
                      'membership': 5,
                      'monopolies': 3,
                      'spotlights': 1,
                      'Richards': 1,
                      'Funeral': 5,
                      'dishes': 1,
                      'housing': 14,
                      'beauty': 4,
                      'colonialist': 1,
                      'battery': 1,
                      'Bellows': 5,
                      'Stetson': 1,
                      'Beatrice': 1,
                      'flatly': 1,
                      'glove': 4,
                      'blond': 2,
                      'Richard': 27,
                      'semester': 3,
                      'Branum': 1,
                      '18,792': 1,
                      '1966': 3,
                      'dynamic': 4,
                      'Decries': 1,
                      'applying': 1,
                      'allow': 5,
                      'critical': 6,
                      'Superior': 5,
                      'correctness': 1,
                      'Werner': 5,
                      '3505o': 1,
                      'Aj': 14,
                      'hoodlums': 2,
                      'boost': 6,
                      "Dresbachs'": 1,
                      'missing': 3,
                      'actually': 10,
                      'Several': 5,
                      'economist': 3,
                      'explanation': 3,
                      'replied': 5,
                      'juice': 1,
                      'Billy': 3,
                      'after': 127,
                      'Sees': 1,
                      'homers': 3,
                      'Hewlett-Woodmere': 1,
                      'questioning': 1,
                      'assailant': 1,
                      'ward': 7,
                      'reached': 15,
                      'anyway': 3,
                      'Balkanizing': 1,
                      '469': 1,
                      'response': 5,
                      'Japan': 3,
                      'bargain': 1,
                      'compared': 6,
                      'authority': 8,
                      'Point': 3,
                      'Perhaps': 3,
                      'grace': 2,
                      'jury-tampering': 1,
                      'Audubon': 3,
                      'troopships': 1,
                      'football': 14,
                      'strikingly': 1,
                      'big': 38,
                      'Juvenile': 3,
                      'Sidney': 2,
                      '180-degrees': 1,
                      'services': 18,
                      'nourished': 1,
                      'Cod': 2,
                      'attempted': 5,
                      'completely': 6,
                      'journey': 2,
                      'C': 4,
                      'wooden': 1,
                      'cold': 6,
                      'opened': 9,
                      'resumption': 2,
                      'Elliott': 1,
                      'rookies': 2,
                      'radioactive': 1,
                      'blabbed': 1,
                      'Baker': 6,
                      'fate': 1,
                      'Holy': 2,
                      "Tuesday's": 1,
                      'satin': 4,
                      'stood': 4,
                      'worst': 2,
                      'Boxwood': 1,
                      'lend': 1,
                      'instrumentals': 1,
                      'Milenoff': 1,
                      'Washington-Alexandria': 1,
                      'street': 5,
                      'Horstman': 1,
                      'Gets': 1,
                      '158-pounder': 1,
                      'gas-fired': 1,
                      'Parker': 5,
                      'Belleville': 1,
                      '11,744': 1,
                      'Deloris': 1,
                      'inlaid': 1,
                      'current': 13,
                      'View': 2,
                      'senseless': 1,
                      'Grayson': 1,
                      'fill': 3,
                      'Redevelopment': 3,
                      'Section': 4,
                      'expectations': 1,
                      'Sybert': 1,
                      'Allegretti': 1,
                      'modernized': 1,
                      'accuse': 1,
                      'town': 18,
                      'fulfilled': 3,
                      '$700': 2,
                      'Economically': 1,
                      'settling': 1,
                      'now': 76,
                      'Zinman': 1,
                      'solos': 1,
                      'Bonanza': 1,
                      'full-fledged': 1,
                      'dramatic': 8,
                      'erupts': 1,
                      'bestowal': 1,
                      'Cooperation': 1,
                      'cab': 3,
                      'domination': 1,
                      'De': 11,
                      'Yankee': 6,
                      '4-year-old': 1,
                      "we'll": 4,
                      'lay': 5,
                      'Belanger': 1,
                      'Carson': 1,
                      'newcomers': 1,
                      'loss': 11,
                      'three': 97,
                      "Atlanta's": 4,
                      'automatic': 5,
                      'consulted': 4,
                      'demand': 17,
                      'batch': 1,
                      'stock': 21,
                      'Garza': 1,
                      'Inauguration': 3,
                      'Spring': 1,
                      'By-passing': 1,
                      'alcoholics': 2,
                      'downed': 1,
                      '$28': 4,
                      'San': 22,
                      'furnish': 2,
                      'presidential': 5,
                      'wage': 3,
                      'split-level': 1,
                      'industrial': 5,
                      'Kunkel': 2,
                      'Convenience': 2,
                      'pure': 2,
                      'organize': 2,
                      'extended': 5,
                      'Bernhard': 1,
                      "Louis's": 1,
                      'Screw': 1,
                      'Giacomo': 1,
                      'FBI': 5,
                      'Actor-Crooner': 1,
                      'Slow': 1,
                      "Wouldn't": 1,
                      'ineffectual': 1,
                      'motion': 2,
                      'onetime': 1,
                      'car': 50,
                      'Mansion': 1,
                      'lucrative': 1,
                      'cap': 1,
                      'completions': 3,
                      '$30': 1,
                      'A.': 38,
                      '3-month': 1,
                      'Hyannis': 2,
                      'NLRB': 1,
                      "Hyde's": 1,
                      'present': 30,
                      'describes': 1,
                      'throws': 2,
                      'well-played': 1,
                      '$58,918': 1,
                      'science': 7,
                      'Merit': 1,
                      'officiated': 3,
                      'receives': 1,
                      'be': 526,
                      'wealth': 2,
                      'alternative': 4,
                      'racing': 2,
                      'names': 6,
                      'ready': 12,
                      'Exhibition': 1,
                      'baseballs': 1,
                      'Crawford': 2,
                      'cotton-growing': 1,
                      'bonding': 1,
                      'wed': 2,
                      'adjoining': 2,
                      'shares': 19,
                      'engaged': 3,
                      "Musician's": 1,
                      '220-yard': 1,
                      'Journal-American': 1,
                      'conclusion': 3,
                      'ant': 3,
                      'reasons': 12,
                      'auto': 3,
                      'janitor': 1,
                      'minutes': 25,
                      'Baseball': 4,
                      'Missiles': 1,
                      ...}),
             'romance': Counter({'Without': 2,
                      'vacuum': 2,
                      'festival': 1,
                      'contented': 2,
                      'laden': 1,
                      'More': 4,
                      'oystchers': 1,
                      'anyhow': 4,
                      'speak': 11,
                      'plain-out': 1,
                      'house': 36,
                      'promises': 1,
                      'skilled': 1,
                      'muscles': 1,
                      'shipwrecked': 1,
                      'overdeveloped': 1,
                      'darling': 4,
                      'building': 8,
                      'Adams': 1,
                      'waters': 1,
                      'dig': 1,
                      'threshold': 1,
                      'use': 14,
                      'than': 65,
                      'killed': 4,
                      'experience': 4,
                      'served': 1,
                      'jubilantly': 1,
                      'sports': 3,
                      'tempted': 2,
                      'thoroughly': 2,
                      'February': 1,
                      'Spike-haired': 1,
                      'pleading': 1,
                      'forum': 2,
                      'coursing': 1,
                      'certain': 13,
                      'season': 1,
                      'Jenny': 7,
                      'inward': 2,
                      'shade': 2,
                      'others': 9,
                      'sunshine': 2,
                      'Wait': 2,
                      'mercy': 1,
                      'bucking': 1,
                      'taut': 3,
                      'catch': 3,
                      'beautiful': 25,
                      'shorter': 1,
                      'marrying': 2,
                      'relay': 1,
                      'Karamazov': 1,
                      'Jeroboam': 1,
                      'denying': 1,
                      "John'll": 1,
                      'brushing': 2,
                      'sorted': 1,
                      'imposition': 1,
                      'nuzzled': 1,
                      'hardships': 1,
                      'inflection': 1,
                      'chic': 2,
                      'greatness': 2,
                      'drab': 1,
                      'necktie': 1,
                      "cousins'": 1,
                      'less-dramatic': 1,
                      'correct': 2,
                      'corporation': 2,
                      'cabins': 1,
                      'glance': 6,
                      'sage': 1,
                      'sommelier': 1,
                      'lumbering': 1,
                      'kitchen': 12,
                      'amount': 1,
                      "She'd": 3,
                      'rabbits': 1,
                      'kibbutzim': 1,
                      'fire-colored': 1,
                      'crack': 2,
                      'omitted': 1,
                      'glances': 2,
                      'Seeing': 2,
                      'way': 83,
                      'intriguingly': 1,
                      'unbelievable': 2,
                      "wouldn't": 33,
                      'clucked': 1,
                      'hurry': 3,
                      'interdependent': 1,
                      'through': 56,
                      'Just': 28,
                      "we'll": 4,
                      'each': 24,
                      'bucking-up': 1,
                      'Ah': 5,
                      'Rossi': 1,
                      'expanse': 1,
                      'tons': 1,
                      'timid': 2,
                      'vegetable': 1,
                      'forget': 5,
                      "Maggie's": 3,
                      'behave': 1,
                      'postscript': 1,
                      'log-house': 1,
                      'cottage': 6,
                      'shy': 1,
                      'raised': 6,
                      'magnificence': 1,
                      'stray': 1,
                      'catchee': 1,
                      'stream': 3,
                      'ate': 4,
                      'grave': 3,
                      'distaste': 2,
                      'charlotte': 1,
                      'sprung': 1,
                      'buggy': 1,
                      'eternal': 3,
                      'blackout': 1,
                      'descend': 1,
                      'fences': 1,
                      'At': 29,
                      'nervously': 3,
                      "I'm": 77,
                      'focused': 1,
                      'vessels': 1,
                      'les': 1,
                      'tipsy': 1,
                      'Those': 5,
                      'surprisingly': 2,
                      'bordering': 1,
                      'drop': 5,
                      'prophets': 1,
                      'career': 2,
                      'idly': 1,
                      'husband': 10,
                      'friends': 13,
                      'Toonker': 1,
                      'bet': 3,
                      'dully': 1,
                      'Deegan': 22,
                      'shouting': 5,
                      'wiped': 1,
                      'sands': 1,
                      'slip': 1,
                      'negative': 1,
                      "patient's": 1,
                      'Afterwards': 1,
                      'small': 31,
                      'Myra': 24,
                      'competition': 1,
                      'Heads': 1,
                      'intrepid': 1,
                      'doubtfully': 1,
                      'snag': 1,
                      'knife': 9,
                      'hostess': 1,
                      'missiles': 1,
                      'recently': 2,
                      'earthy': 1,
                      'vanity': 1,
                      'babes': 1,
                      'snapped': 5,
                      'next': 43,
                      'sly': 1,
                      'strike': 1,
                      'shall': 3,
                      'lapping': 2,
                      'searching': 3,
                      'really': 38,
                      'broad': 5,
                      'tireless': 2,
                      'disdain': 1,
                      'Please': 5,
                      'comforting': 1,
                      'rushed': 5,
                      'lath': 2,
                      'signals': 1,
                      'effect': 4,
                      'Pete': 11,
                      'Anything': 2,
                      'giggled': 1,
                      'King': 2,
                      'artists': 2,
                      'grasses': 1,
                      'masculine': 1,
                      'Remembering': 1,
                      'evening': 14,
                      'prisoners': 5,
                      'dripped': 1,
                      'twelve': 5,
                      'telegraph': 1,
                      'keen': 1,
                      'relax': 1,
                      'Lovejoy': 3,
                      'terribly': 4,
                      'Tolley': 6,
                      'regretted': 2,
                      'defiance': 1,
                      'assets': 2,
                      'shooing': 1,
                      'ask': 10,
                      'invested': 1,
                      'found': 29,
                      'floorshow': 1,
                      "one's": 4,
                      'pretty': 19,
                      'Springfield': 1,
                      'scholarships': 1,
                      'mother': 44,
                      'Galt': 1,
                      'things': 30,
                      'Now': 29,
                      'job': 27,
                      'thank': 3,
                      'parkish': 1,
                      'department': 1,
                      'independent': 1,
                      'lewd': 1,
                      'permission': 2,
                      'divided': 3,
                      'suppose': 7,
                      'Hall': 2,
                      'Patrol': 2,
                      'sir': 6,
                      'bat': 4,
                      'harshened': 1,
                      'hatred': 4,
                      'German': 1,
                      'neighbor': 1,
                      'scream': 2,
                      'signal': 4,
                      'vehicles': 1,
                      'exquisitely': 1,
                      'grabbed': 1,
                      'medium-sized': 1,
                      'gushed': 1,
                      'yielded': 1,
                      'commencement': 1,
                      'shadow': 3,
                      'teats': 2,
                      'fabric': 1,
                      'chuck-a-luck': 1,
                      "wasn't": 50,
                      'ordinarily': 1,
                      'bore': 3,
                      'dying': 5,
                      'Lord': 6,
                      'strip': 2,
                      'curiously': 2,
                      'hunting': 3,
                      'Coopers': 1,
                      'run': 16,
                      'discussed': 2,
                      'hundreds': 1,
                      'unconcerned': 1,
                      'grandchildren': 2,
                      'uncomfortable': 1,
                      'cacophony': 1,
                      'themselves': 10,
                      'guest': 6,
                      'Collector': 1,
                      'licking': 1,
                      'forepaws': 1,
                      'wicker': 1,
                      'positive': 4,
                      'Santa': 1,
                      "Gladdy's": 2,
                      'Stern-faced': 1,
                      'elegant': 2,
                      'defeatism': 1,
                      'aviator': 1,
                      'pulled': 8,
                      'sprinkling': 2,
                      'framed': 1,
                      'novelty': 1,
                      'visited': 3,
                      'loaded': 1,
                      'Also': 1,
                      'radio': 1,
                      'coat': 6,
                      'delivery': 1,
                      'trees': 7,
                      "mother's": 4,
                      'Oscar': 1,
                      'Japanese': 14,
                      'flatter': 1,
                      'due': 4,
                      'seeming': 1,
                      'plea': 1,
                      'warmed': 2,
                      'furniture': 2,
                      'sixty': 2,
                      'marry': 7,
                      'problem': 2,
                      'pulse': 2,
                      'winter': 5,
                      'distance': 7,
                      'mutineer': 1,
                      'walking': 12,
                      'percentage': 1,
                      'receive': 1,
                      'quok': 1,
                      'insurmountable': 1,
                      'midweek': 1,
                      'reflect': 1,
                      'looking': 36,
                      'azalea': 2,
                      'crook': 1,
                      'Watch': 1,
                      'Time': 1,
                      'humor': 5,
                      'periods': 1,
                      'baggy': 2,
                      'Pendleton': 3,
                      "We're": 4,
                      'sentimental': 2,
                      'Big': 3,
                      'bottle': 8,
                      'quench': 1,
                      'pointed': 7,
                      'whiskered': 1,
                      'determination': 1,
                      'slowly': 22,
                      'unlocked': 2,
                      'place': 45,
                      'piece': 2,
                      'rode': 4,
                      'laments': 2,
                      'amusement': 1,
                      'tasted': 1,
                      'underestimate': 1,
                      'Straightened': 1,
                      'undying': 1,
                      'atmosphere': 2,
                      'blessed': 1,
                      'melting': 1,
                      'urgent': 1,
                      'Ugh': 1,
                      'Eh': 2,
                      'instrument': 1,
                      'could': 193,
                      'Especially': 1,
                      'enjoying': 2,
                      'winking': 2,
                      'conform': 1,
                      'audience': 1,
                      'rich': 6,
                      'stopped': 25,
                      'Wilson': 3,
                      'dilapidated': 1,
                      'elaborate': 2,
                      'creating': 1,
                      'pride': 2,
                      'sequence': 1,
                      'okay': 1,
                      'tapping': 1,
                      'darkened': 2,
                      'increasingly': 1,
                      'brute': 1,
                      'snippy': 1,
                      'ran': 13,
                      'wagons': 1,
                      'optimism': 1,
                      'Cars': 1,
                      'meant': 11,
                      'tiny': 12,
                      'picture': 8,
                      'guarding': 1,
                      'A.M.': 1,
                      'sympathetic': 2,
                      'unemployment': 1,
                      'stairway': 1,
                      'buildings': 4,
                      'between': 22,
                      'cost': 3,
                      'glorying': 1,
                      'intimate': 3,
                      'noncommittally': 1,
                      'eighth': 1,
                      'gilded': 1,
                      'emerge': 1,
                      'daydreaming': 1,
                      'higher': 3,
                      'Ready': 1,
                      'Ronald': 3,
                      'forever-Cathy': 1,
                      "Pete's": 1,
                      'sewed': 1,
                      'nod': 3,
                      'firmly': 2,
                      'backed': 1,
                      'fruit': 1,
                      'yanking': 1,
                      'dismissed': 1,
                      'nowhere': 2,
                      'told': 47,
                      'paot': 3,
                      'discovery': 1,
                      'chants': 1,
                      'The': 230,
                      'heat': 4,
                      'downstairs': 4,
                      'dilatation': 1,
                      'messing': 1,
                      'Hodges': 2,
                      'good-living': 1,
                      'Boy': 1,
                      'of': 1186,
                      'bicarbonate': 1,
                      'law': 1,
                      'transcending': 1,
                      'stained': 1,
                      'squirt': 1,
                      'dainty': 1,
                      'virility': 1,
                      'rear': 1,
                      'Homes': 1,
                      'unnnt': 1,
                      'yearned': 1,
                      'corner': 9,
                      'bold': 1,
                      'cups': 2,
                      'industrialist': 1,
                      'lived': 19,
                      'wars': 1,
                      'Larkspur': 2,
                      'spoiled': 2,
                      'mingled': 1,
                      'watching': 20,
                      'maid': 4,
                      'numb': 1,
                      'facts': 2,
                      'Hudson': 1,
                      'unusually': 1,
                      'Rhine-Main': 1,
                      "Riverside's": 1,
                      'assembled': 1,
                      'announcing': 1,
                      'peculiar': 2,
                      'display': 2,
                      'sherbet-colored': 1,
                      'wallpaper': 2,
                      'funny': 7,
                      'factory': 1,
                      'entry': 1,
                      'overhead': 3,
                      'outside': 11,
                      'hung': 3,
                      'arc': 1,
                      'book': 10,
                      'suggested': 6,
                      'begging': 2,
                      'Renaissance': 1,
                      'jacket': 9,
                      'stairways': 1,
                      'Marvelous': 1,
                      'pressure': 2,
                      'bridesmaids': 1,
                      'sullen': 3,
                      'knowingly': 1,
                      'intend': 1,
                      'endlessly': 1,
                      'snuck': 1,
                      'purpose': 1,
                      'give': 28,
                      'loathing': 1,
                      'clamped': 2,
                      'absolute': 1,
                      'misfortune': 1,
                      'dishes': 1,
                      'tape': 2,
                      'beauty': 6,
                      'odor': 1,
                      'flatly': 1,
                      'glove': 3,
                      'tunnel': 1,
                      'blond': 2,
                      'Richard': 17,
                      'waving': 3,
                      'limpid': 1,
                      'sanction': 1,
                      'strenuous': 1,
                      'applying': 1,
                      'allow': 1,
                      'shuttered': 1,
                      'blushed': 1,
                      'library': 3,
                      'cancer': 2,
                      'missing': 6,
                      'actually': 5,
                      'Several': 1,
                      'rocked': 2,
                      'explanation': 1,
                      'replied': 2,
                      'juice': 4,
                      'Malta': 1,
                      'purtiest': 1,
                      'after': 58,
                      'questioning': 2,
                      'ward': 6,
                      'reached': 6,
                      "Brush-off's": 1,
                      'anyway': 10,
                      'response': 2,
                      'bargain': 1,
                      'authority': 1,
                      'Perhaps': 9,
                      'grace': 3,
                      'confused': 2,
                      'cherries': 1,
                      'Shall': 1,
                      'strikingly': 1,
                      'big': 36,
                      'overhand': 1,
                      'quantities': 1,
                      'inspect': 1,
                      'femme': 1,
                      'nine-to-five': 1,
                      'completely': 1,
                      'wooden': 3,
                      'lore': 1,
                      'freak': 2,
                      'opened': 13,
                      'Elliott': 1,
                      'soldierly': 1,
                      'Papanicolaou': 1,
                      'twenty-three': 1,
                      'Holy': 2,
                      'Heavenly': 1,
                      'satin': 1,
                      'stood': 37,
                      'worst': 3,
                      'Dolan': 2,
                      'deck': 4,
                      'lipstick': 2,
                      'drinking': 4,
                      'rages': 1,
                      'street': 14,
                      'glass': 7,
                      'bow': 1,
                      'cops': 2,
                      'priest': 1,
                      'Longue': 3,
                      'Leaning': 1,
                      'wears': 1,
                      'mist-like': 1,
                      'accuse': 1,
                      'town': 15,
                      'now': 83,
                      'solos': 1,
                      'azaleas': 2,
                      'radiant': 2,
                      'upsets': 1,
                      'suddenly': 22,
                      'colder': 1,
                      'gets': 4,
                      'Quint': 11,
                      'weatherproof': 1,
                      'lay': 11,
                      'loss': 3,
                      "Deegan's": 2,
                      'Jobs': 1,
                      'three': 17,
                      'shut': 9,
                      'blazing': 1,
                      'Because': 3,
                      'stock': 7,
                      'Spring': 4,
                      'downed': 1,
                      'San': 5,
                      'saddles': 1,
                      'virtue': 1,
                      'pat': 1,
                      'pure': 4,
                      'organize': 1,
                      'shed': 1,
                      'patiently': 2,
                      'lips': 11,
                      'blossom': 1,
                      'mama': 1,
                      'joy': 4,
                      'beast': 1,
                      "Wouldn't": 1,
                      'dark-gray': 1,
                      'car': 20,
                      'godless': 1,
                      'shine': 1,
                      'discovered': 5,
                      'crickets': 1,
                      'salad': 1,
                      'present': 9,
                      'sonofabitch': 2,
                      'science': 6,
                      'unnaturally': 1,
                      'be': 289,
                      'alternative': 1,
                      'racing': 3,
                      'names': 1,
                      'ready': 11,
                      'Ten': 1,
                      'witches': 1,
                      'abundance': 1,
                      'France': 1,
                      'engaged': 1,
                      'dark-blue': 1,
                      'helluva': 1,
                      'conclusion': 1,
                      'ceased': 1,
                      'janitor': 1,
                      'minutes': 9,
                      'unbent': 1,
                      'remote': 3,
                      'touching': 5,
                      'joined': 4,
                      "He's": 14,
                      'nose': 7,
                      'sharpened': 1,
                      'covering': 1,
                      'afternoon': 12,
                      'prisoner': 2,
                      'teasing': 1,
                      'drained': 2,
                      'thousand-legged': 1,
                      'frivolous': 2,
                      'commercial': 1,
                      'sun-warmed': 1,
                      'harbor': 2,
                      'shivered': 2,
                      'satin-covered': 1,
                      'brandishing': 2,
                      'fail': 2,
                      'boarding': 1,
                      'burial': 2,
                      'mergers': 1,
                      "lovers'": 1,
                      'dignity': 4,
                      'force': 4,
                      'replaced': 2,
                      'Puny': 1,
                      'sneaky': 1,
                      'dawns': 1,
                      'prevented': 1,
                      'popularly': 1,
                      'rackety': 1,
                      'eastern': 1,
                      'dinners': 2,
                      'wildly': 5,
                      'Eating': 1,
                      'Donald': 7,
                      "child's": 3,
                      'revealed': 2,
                      'alone': 24,
                      'pardon': 1,
                      'fetes': 1,
                      'cent': 3,
                      'Half': 2,
                      'beneath': 6,
                      'bullet': 1,
                      'cramp': 1,
                      "You'd": 2,
                      'into': 136,
                      'encouragingly': 1,
                      'snoring': 1,
                      'address': 3,
                      'else': 29,
                      'skipped': 3,
                      'justify': 1,
                      'let': 39,
                      'fonder': 1,
                      'becoming': 3,
                      'promising': 2,
                      'outraged': 1,
                      'artfully': 1,
                      'necessity': 2,
                      'brother': 5,
                      'Somebody': 4,
                      'managed': 4,
                      'gleaming': 1,
                      'Dillinger': 1,
                      'fifty-fifty': 1,
                      'method': 1,
                      'murder': 4,
                      'Swan': 1,
                      'scarf': 2,
                      'grown-up': 1,
                      'Kare': 1,
                      'sweeping': 1,
                      'forgiven': 1,
                      'conceptions': 1,
                      "Alexander's": 2,
                      'gumming': 1,
                      'ironed': 1,
                      'hunger': 2,
                      'husband-stealer': 1,
                      'insiders': 1,
                      "Partlow's": 1,
                      'directors': 1,
                      'instigating': 1,
                      'pupils': 3,
                      'straining': 2,
                      'switch': 1,
                      'desiring': 1,
                      'Subdued': 1,
                      'Gordon': 2,
                      'singsonged': 1,
                      'electric': 1,
                      'jammed': 2,
                      'muttered': 3,
                      'secretaries': 1,
                      'absurd': 3,
                      ')': 21,
                      'forgive': 1,
                      'corners': 1,
                      'around': 68,
                      'understands': 1,
                      'edges': 2,
                      "I'll": 53,
                      'mosaic': 1,
                      'parades': 1,
                      'maw': 1,
                      'monotonous': 1,
                      'overly': 1,
                      'tawny': 1,
                      'naked': 7,
                      'clothes': 26,
                      'merest': 1,
                      'lukewarm': 1,
                      'advancement': 1,
                      'dignified': 1,
                      'thankful': 3,
                      'released': 1,
                      'neatness': 1,
                      'rapping': 2,
                      'attracted': 1,
                      'letter': 19,
                      'hide': 1,
                      "Horne's": 1,
                      'frenziedly': 1,
                      'infected': 1,
                      'parked': 1,
                      'biggest': 1,
                      'from': 202,
                      'honestly': 2,
                      "four-o'clock": 1,
                      'Swallow': 1,
                      'ages': 2,
                      'warning': 2,
                      'appearance': 4,
                      'atheists': 1,
                      'Pollock': 1,
                      'ball': 18,
                      'paunch': 1,
                      'Mother': 5,
                      'half-murmured': 1,
                      'Fudo': 7,
                      'reality': 1,
                      "Concetta's": 2,
                      'stretch': 1,
                      'gambit': 1,
                      'highball': 2,
                      'studied': 7,
                      'twenty-five': 1,
                      'appreciate': 1,
                      'years': 34,
                      'sticks': 3,
                      'East': 4,
                      'probably': 13,
                      'brunettes': 1,
                      'frequently': 1,
                      'During': 2,
                      'Cold': 1,
                      'pilgrim': 1,
                      'tore': 3,
                      'Obviously': 1,
                      'cashmere': 1,
                      'what': 121,
                      'or': 150,
                      'alloy': 1,
                      "Don't": 24,
                      'states': 2,
                      "Jim's": 3,
                      'shaky': 2,
                      'put-upon': 1,
                      'city': 10,
                      'sex': 4,
                      'ginkgo': 1,
                      'stomach': 8,
                      'sensing': 1,
                      'skins': 1,
                      'backing': 1,
                      'easier': 3,
                      'consciousness': 1,
                      'thoughtfully': 1,
                      'accordance': 1,
                      'Cursed': 1,
                      'stickman': 1,
                      'general': 3,
                      'sprawled': 2,
                      'Sam': 18,
                      'invite': 1,
                      'heart': 17,
                      'bleak': 1,
                      'national': 5,
                      'tomorrow': 10,
                      'umbrella': 2,
                      'expanding': 2,
                      'grasp': 1,
                      'contracted': 1,
                      "lady's": 2,
                      'called': 31,
                      'namesake': 1,
                      'year': 14,
                      'victory': 2,
                      'Dunne': 4,
                      'Clouds': 4,
                      'Too': 2,
                      'courage': 4,
                      'subdued': 2,
                      'painted': 4,
                      'agricultural': 1,
                      'susceptible': 1,
                      'bring': 9,
                      'shopping': 1,
                      'uncomfortably': 2,
                      'showering': 1,
                      'hurled': 1,
                      'ripening': 1,
                      'wardroom': 1,
                      'protected': 2,
                      'wonder': 10,
                      'solution': 3,
                      'born': 1,
                      'sharply': 3,
                      'makeshift': 1,
                      'chance': 16,
                      'tenant': 1,
                      'spirits': 8,
                      'Methodist': 1,
                      'blackjack': 2,
                      'ailment': 1,
                      'fellow': 5,
                      'halfway': 1,
                      'swears': 1,
                      'graham': 2,
                      'Shades': 2,
                      'not': 250,
                      'Adrien': 2,
                      'sons': 2,
                      'taught': 2,
                      'fuss': 1,
                      'basket': 2,
                      'isolating': 1,
                      'suffer': 1,
                      'complained': 1,
                      'Cross': 1,
                      'varicolored': 1,
                      'college': 12,
                      'envisioned': 1,
                      'placed': 4,
                      'Handsomest': 1,
                      'boat': 2,
                      'crumbling': 1,
                      'noble': 1,
                      'illustrated': 1,
                      'Nor': 3,
                      'Unlike': 1,
                      'sliding': 1,
                      'glanced': 3,
                      'worried': 7,
                      'hated': 6,
                      'soda': 1,
                      'arrangement': 1,
                      'limping': 1,
                      'laced': 1,
                      'Anniston': 15,
                      'five': 6,
                      'Rosa': 4,
                      'bankruptcy': 1,
                      'common': 3,
                      'Arcilla': 1,
                      'mash': 1,
                      'less': 12,
                      'movement': 1,
                      'gleeful': 1,
                      'scraping': 1,
                      'thrusting': 2,
                      "mustn't": 2,
                      'participate': 1,
                      'jocular': 1,
                      'capital': 1,
                      'cords': 1,
                      'application': 2,
                      'sulked': 1,
                      'temperament': 1,
                      'infection': 1,
                      'smaller': 2,
                      'cheerful': 3,
                      'AA': 1,
                      'dragooned': 1,
                      'sting': 2,
                      'word': 17,
                      'jobless': 1,
                      'magnificent': 2,
                      'Mattie': 1,
                      'restriction': 1,
                      'thinks': 4,
                      'ground': 8,
                      "We'd": 3,
                      'primitive': 1,
                      'bouts': 1,
                      'shrubs': 1,
                      'sloppy': 1,
                      'Walter': 2,
                      'stir': 1,
                      'triangular': 1,
                      'plodding': 2,
                      'blood-soaked': 1,
                      'disquiet': 1,
                      'business': 21,
                      'strength': 8,
                      'leads': 1,
                      'wreck': 2,
                      'adults': 1,
                      'Cooper': 8,
                      'Freed': 1,
                      'conclusions': 1,
                      'swim': 1,
                      'unusual': 5,
                      'Young': 2,
                      'clasped': 2,
                      'talons': 1,
                      'white': 30,
                      'dreamed': 4,
                      'Named': 1,
                      'Road': 1,
                      'Pink': 1,
                      'familial': 1,
                      'hopped': 1,
                      'stumbled': 5,
                      'spit': 1,
                      'carved': 1,
                      'stones': 2,
                      'sort': 10,
                      'bookish': 1,
                      'temptation': 2,
                      'cats': 2,
                      'mastiff': 1,
                      'stoop': 1,
                      'partially': 1,
                      'Petty': 1,
                      'trudged': 1,
                      'caves': 1,
                      'artificial': 1,
                      'safely': 2,
                      'depending': 1,
                      'depression': 3,
                      'assume': 1,
                      'Eugenia': 15,
                      'bear': 4,
                      'wicked': 2,
                      'chapel-like': 1,
                      'co-operate': 1,
                      'potential': 2,
                      'schools': 2,
                      'darted': 1,
                      'recalled': 2,
                      'Byron': 1,
                      'casually': 1,
                      'pleasantly': 1,
                      'handspikes': 3,
                      'russe': 1,
                      'visitors': 1,
                      'Hotel': 1,
                      'risky': 1,
                      'bless': 3,
                      'swift': 1,
                      'Chinaman': 1,
                      'remoteness': 1,
                      'hangers': 1,
                      'cremate': 1,
                      'censure': 1,
                      'crates': 1,
                      'obligated': 1,
                      'traffic': 3,
                      'ear': 3,
                      'tea': 3,
                      'wire-haired': 1,
                      'shackles': 2,
                      'best': 12,
                      'sweetly': 2,
                      'loyal': 1,
                      'trestles': 1,
                      'unquenched': 1,
                      'lingerie': 1,
                      'white-columned': 1,
                      ...})})

In [35]:
cfd.conditions()


Out[35]:
['romance', 'news']

each condition is just a frequency distribution


In [36]:
print(cfd['news'])
print(cfd['romance'])


<FreqDist with 14394 samples and 100554 outcomes>
<FreqDist with 8452 samples and 70022 outcomes>

In [37]:
cfd['romance'].most_common(20)


Out[37]:
[(',', 3899),
 ('.', 3736),
 ('the', 2758),
 ('and', 1776),
 ('to', 1502),
 ('a', 1335),
 ('of', 1186),
 ('``', 1045),
 ("''", 1044),
 ('was', 993),
 ('I', 951),
 ('in', 875),
 ('he', 702),
 ('had', 692),
 ('?', 690),
 ('her', 651),
 ('that', 583),
 ('it', 573),
 ('his', 559),
 ('she', 496)]

In [38]:
cfd['romance']['could']


Out[38]:
193

Plotting and Tabulating Distributions

Generating (condition, event) pairs for the Inaugural Address corpus -- using the fact that all filenames start with the year, such as 1865-Lincoln.txt


In [39]:
from nltk.corpus import inaugural
inaugural.fileids()


Out[39]:
['1789-Washington.txt',
 '1793-Washington.txt',
 '1797-Adams.txt',
 '1801-Jefferson.txt',
 '1805-Jefferson.txt',
 '1809-Madison.txt',
 '1813-Madison.txt',
 '1817-Monroe.txt',
 '1821-Monroe.txt',
 '1825-Adams.txt',
 '1829-Jackson.txt',
 '1833-Jackson.txt',
 '1837-VanBuren.txt',
 '1841-Harrison.txt',
 '1845-Polk.txt',
 '1849-Taylor.txt',
 '1853-Pierce.txt',
 '1857-Buchanan.txt',
 '1861-Lincoln.txt',
 '1865-Lincoln.txt',
 '1869-Grant.txt',
 '1873-Grant.txt',
 '1877-Hayes.txt',
 '1881-Garfield.txt',
 '1885-Cleveland.txt',
 '1889-Harrison.txt',
 '1893-Cleveland.txt',
 '1897-McKinley.txt',
 '1901-McKinley.txt',
 '1905-Roosevelt.txt',
 '1909-Taft.txt',
 '1913-Wilson.txt',
 '1917-Wilson.txt',
 '1921-Harding.txt',
 '1925-Coolidge.txt',
 '1929-Hoover.txt',
 '1933-Roosevelt.txt',
 '1937-Roosevelt.txt',
 '1941-Roosevelt.txt',
 '1945-Roosevelt.txt',
 '1949-Truman.txt',
 '1953-Eisenhower.txt',
 '1957-Eisenhower.txt',
 '1961-Kennedy.txt',
 '1965-Johnson.txt',
 '1969-Nixon.txt',
 '1973-Nixon.txt',
 '1977-Carter.txt',
 '1981-Reagan.txt',
 '1985-Reagan.txt',
 '1989-Bush.txt',
 '1993-Clinton.txt',
 '1997-Clinton.txt',
 '2001-Bush.txt',
 '2005-Bush.txt',
 '2009-Obama.txt']

In [40]:
inaugural.words('2009-Obama.txt')


Out[40]:
['My', 'fellow', 'citizens', ':', 'I', 'stand', 'here', ...]

Loop through all words in all fileids and create pairs that are of the form:

  • ('america', '1997')
  • ('citizen', '1993')

In [41]:
cfd = nltk.ConditionalFreqDist((target, fileid[:4]) 
                               for fileid in inaugural.fileids()
                               for w in inaugural.words(fileid)
                               for target in ['america', 'citizen']
                               if w.lower().startswith(target))
cfd


Out[41]:
defaultdict(nltk.probability.FreqDist,
            {'america': Counter({'1789': 2,
                      '1793': 1,
                      '1797': 8,
                      '1805': 1,
                      '1813': 1,
                      '1817': 1,
                      '1821': 2,
                      '1833': 2,
                      '1837': 2,
                      '1841': 7,
                      '1849': 2,
                      '1853': 2,
                      '1857': 3,
                      '1861': 2,
                      '1865': 1,
                      '1877': 1,
                      '1881': 2,
                      '1885': 4,
                      '1889': 6,
                      '1893': 9,
                      '1897': 9,
                      '1901': 7,
                      '1909': 12,
                      '1917': 4,
                      '1921': 24,
                      '1925': 11,
                      '1929': 12,
                      '1933': 2,
                      '1937': 5,
                      '1941': 12,
                      '1945': 2,
                      '1949': 4,
                      '1953': 6,
                      '1957': 7,
                      '1961': 7,
                      '1965': 10,
                      '1969': 10,
                      '1973': 23,
                      '1977': 5,
                      '1981': 16,
                      '1985': 21,
                      '1989': 11,
                      '1993': 33,
                      '1997': 31,
                      '2001': 20,
                      '2005': 30,
                      '2009': 15}),
             'citizen': Counter({'1789': 5,
                      '1793': 1,
                      '1797': 6,
                      '1801': 7,
                      '1805': 10,
                      '1809': 1,
                      '1813': 4,
                      '1817': 14,
                      '1821': 15,
                      '1825': 3,
                      '1829': 2,
                      '1833': 3,
                      '1837': 7,
                      '1841': 38,
                      '1845': 11,
                      '1849': 2,
                      '1853': 4,
                      '1857': 7,
                      '1861': 7,
                      '1869': 5,
                      '1873': 3,
                      '1877': 9,
                      '1881': 9,
                      '1885': 13,
                      '1889': 12,
                      '1893': 10,
                      '1897': 10,
                      '1901': 2,
                      '1905': 1,
                      '1909': 6,
                      '1917': 3,
                      '1921': 6,
                      '1925': 5,
                      '1929': 12,
                      '1933': 1,
                      '1937': 2,
                      '1941': 1,
                      '1945': 1,
                      '1949': 1,
                      '1953': 7,
                      '1961': 5,
                      '1965': 4,
                      '1969': 1,
                      '1973': 1,
                      '1981': 3,
                      '1985': 6,
                      '1989': 3,
                      '1993': 2,
                      '1997': 10,
                      '2001': 11,
                      '2005': 7,
                      '2009': 2})})

A different conditional frequency distribution. Condition is the name of the language and counts are word lengths


In [42]:
from nltk.corpus import udhr
languages = ['Chickasaw', 'English', 'German_Deutsch', 
             'Greenlandic_Inuktikut', 'Hungarian_Magyar', 'Ibibio_Efik']
udhr.words('Chickasaw'+'-Latin1')


Out[42]:
['Hattak', 'Móma', 'Iholisso', 'Ishtaa', '-', 'aya', ...]

In [43]:
cfd = nltk.ConditionalFreqDist((lang, len(word))
                               for lang in languages
                               for word in udhr.words(lang + '-Latin1'))

plot() and tabulate() methods allow a conditions parameter to specify a subset of conditions


In [44]:
cfd.tabulate(conditions=['English', 'German_Deutsch'],
             samples=range(10),
             cumulative=True)


                  0    1    2    3    4    5    6    7    8    9 
       English    0  185  525  883  997 1166 1283 1440 1558 1638 
German_Deutsch    0  171  263  614  717  894 1013 1110 1213 1275 

(leaving out conditions returns all conditions)


In [45]:
cfd.tabulate(samples=range(10),
             cumulative=True)


                         0    1    2    3    4    5    6    7    8    9 
            Chickasaw    0  411  510  551  619  710  799  876  946  995 
              English    0  185  525  883  997 1166 1283 1440 1558 1638 
       German_Deutsch    0  171  263  614  717  894 1013 1110 1213 1275 
Greenlandic_Inuktikut    0  139  150  151  154  175  182  241  259  283 
     Hungarian_Magyar    0  302  431  503  655  767  881  972 1081 1171 
          Ibibio_Efik    0  228  440  915 1418 1705 1867 1974 2049 2074 

Your Turn: Working with the news and romance genres from the Brown Corpus, find out which days of the week are most newsworthy, and which are most romantic. Define a variable called days containing a list of days of the week, i.e. ['Monday', ...]. Now tabulate the counts for these words using cfd.tabulate(samples=days). Now try the same thing using plot in place of tabulate. You may control the output order of days with the help of an extra parameter: samples=['Monday', ...].


In [46]:
from nltk.corpus import brown
days = ['Monday', 'Tuesday', 'Wednesday', 'Thursday', 'Friday']
genres = ['news', 'romance']
cfd = nltk.ConditionalFreqDist((genre, word)
                               for genre in genres
                               for word in brown.words(categories=genre)
                               if word in days)
cfd


Out[46]:
defaultdict(nltk.probability.FreqDist,
            {'news': Counter({'Friday': 41,
                      'Monday': 54,
                      'Thursday': 20,
                      'Tuesday': 43,
                      'Wednesday': 22}),
             'romance': Counter({'Friday': 3,
                      'Monday': 2,
                      'Thursday': 1,
                      'Tuesday': 3,
                      'Wednesday': 3})})

In [47]:
cfd.tabulate(samples=days)


        Monday Tuesday Wednesday Thursday Friday 
   news   54   43   22   20   41 
romance    2    3    3    1    3 

In [48]:
plot = plt.figure(figsize=(18,10))
cfd.plot(samples=days)


Generating Random Text with Bigrams

Creating a table of bigrams (word pairs)

bigrams() builds a list of consecutive word pairs from a list of words (returns a generator)


In [49]:
sent = ['In', 'the', 'beginning', 'God', 'created', 'the', 'heaven',
        'and', 'the', 'earth', '.']
list(nltk.bigrams(sent))


Out[49]:
[('In', 'the'),
 ('the', 'beginning'),
 ('beginning', 'God'),
 ('God', 'created'),
 ('created', 'the'),
 ('the', 'heaven'),
 ('heaven', 'and'),
 ('and', 'the'),
 ('the', 'earth'),
 ('earth', '.')]

write a function generate_model() that runs a loop to generate text based on maximum frequencies


In [50]:
def generate_model(cfdist, word, num=15):
    for i in range(num):
        print(word, end=' ')
        word = cfdist[word].max() # choose next word based on max frequency following another word

In [51]:
text = nltk.corpus.genesis.words('english-kjv.txt')
bigrams = nltk.bigrams(text)
cfd = nltk.ConditionalFreqDist(bigrams)
cfd


Out[51]:
defaultdict(nltk.probability.FreqDist,
            {'Beerlahairoi': Counter({';': 1}),
             'lighted': Counter({'off': 1, 'upon': 1}),
             'pillows': Counter({',': 2}),
             'laden': Counter({'with': 2}),
             'na': Counter({'and': 1}),
             'Escape': Counter({'for': 1}),
             'blood': Counter({',': 2,
                      ';': 1,
                      '?': 1,
                      'be': 1,
                      'crieth': 1,
                      'from': 1,
                      'is': 1,
                      'of': 2,
                      'thereof': 1}),
             'Eber': Counter({',': 1,
                      '.': 1,
                      ':': 1,
                      'four': 1,
                      'lived': 2,
                      'were': 1}),
             'drove': Counter({'.': 1,
                      'and': 1,
                      'by': 1,
                      'out': 1,
                      'them': 1,
                      'which': 1}),
             'speak': Counter({',': 1,
                      '?': 1,
                      'a': 1,
                      'not': 2,
                      'peaceably': 1,
                      'unto': 6,
                      'yet': 1}),
             'inn': Counter({',': 2}),
             'dost': Counter({'ask': 1, 'overtake': 1}),
             'grow': Counter({'every': 1, 'into': 1}),
             'strive': Counter({'with': 2}),
             'Ararat': Counter({'.': 1}),
             'house': Counter({',': 38,
                      '.': 8,
                      ':': 2,
                      ';': 5,
                      '?': 1,
                      'also': 1,
                      'for': 2,
                      'in': 1,
                      'into': 1,
                      'is': 2,
                      'of': 12,
                      'round': 1,
                      'silver': 1,
                      'than': 1,
                      'there': 1,
                      'these': 1,
                      'to': 2,
                      'with': 2}),
             'desolate': Counter({'.': 1}),
             'company': Counter({',': 1, '.': 2, 'of': 2, 'which': 1}),
             'feel': Counter({'me': 1, 'thee': 1}),
             'fast': Counter({'closed': 1}),
             'Rebek': Counter({'he': 1}),
             'dainties': Counter({'.': 1}),
             'Jabbok': Counter({'.': 1}),
             'waters': Counter({',': 1,
                      '.': 3,
                      'asswaged': 1,
                      'bring': 1,
                      'brought': 1,
                      'called': 1,
                      'decreased': 1,
                      'from': 1,
                      'in': 1,
                      'increased': 1,
                      'of': 3,
                      'prevail': 1,
                      'prevailed': 3,
                      'returned': 1,
                      'shall': 1,
                      'under': 1,
                      'upon': 1,
                      'was': 1,
                      'were': 6,
                      'which': 2}),
             'generation': Counter({'.': 1, ':': 1, 'they': 1}),
             'toil': Counter({',': 1, 'of': 1}),
             'smell': Counter({'of': 3}),
             'walked': Counter({'with': 3}),
             'Mibzar': Counter({',': 1}),
             'sevenfold': Counter({',': 1, '.': 2}),
             'than': Counter({'I': 3,
                      'Leah': 1,
                      'all': 3,
                      'any': 1,
                      'he': 1,
                      'that': 2,
                      'the': 1,
                      'thou': 1,
                      'we': 1,
                      'with': 1}),
             'worse': Counter({'with': 1}),
             'Succoth': Counter({',': 1, '.': 1}),
             'wept': Counter({'.': 4,
                      ';': 2,
                      'aloud': 1,
                      'for': 1,
                      'on': 1,
                      'there': 1,
                      'upon': 3,
                      'when': 1}),
             'experience': Counter({'that': 1}),
             'ea': Counter({'and': 3}),
             'rest': Counter({'for': 1, 'of': 1, 'was': 1, 'yourselves': 1}),
             'endure': Counter({',': 1}),
             'linen': Counter({',': 1}),
             'served': Counter({'Chedorlaomer': 1,
                      'him': 1,
                      'seven': 1,
                      'th': 1,
                      'thee': 3,
                      'with': 1,
                      'your': 1}),
             'dead': Counter({',': 4,
                      '.': 3,
                      ';': 1,
                      'man': 1,
                      'out': 2,
                      'there': 1}),
             'ewe': Counter({'lambs': 3}),
             'kn': Counter({'and': 1}),
             'hid': Counter({';': 1, 'myself': 1, 'them': 1, 'themselves': 1}),
             'beyond': Counter({'Jordan': 2, 'the': 1}),
             'greatly': Counter({';': 2,
                      'afraid': 1,
                      'multiply': 1,
                      'upon': 1}),
             'Pinon': Counter({',': 1}),
             'colts': Counter({',': 1}),
             'Perizzit': Counter({'and': 1}),
             'thoroughly': Counter({'.': 1}),
             'shouldest': Counter({'have': 1, 'not': 1, 'say': 1, 'thou': 1}),
             'he': Counter({',': 6,
                      '.': 1,
                      ';': 1,
                      'And': 1,
                      'Ephraim': 1,
                      'Se': 1,
                      'a': 1,
                      'abode': 1,
                      'alone': 1,
                      'also': 6,
                      'an': 1,
                      'and': 6,
                      'armed': 1,
                      'arose': 1,
                      'asked': 3,
                      'asses': 1,
                      'be': 1,
                      'became': 1,
                      'began': 1,
                      'begat': 18,
                      'beget': 1,
                      'believed': 2,
                      'besought': 1,
                      'blessed': 7,
                      'bought': 1,
                      'bowed': 2,
                      'brought': 6,
                      'builded': 3,
                      'built': 1,
                      'called': 15,
                      'came': 9,
                      'carried': 1,
                      'charged': 1,
                      'cometh': 1,
                      'comforted': 1,
                      'commanded': 4,
                      'communed': 1,
                      'couched': 1,
                      'could': 2,
                      'counted': 1,
                      'cried': 2,
                      'deal': 1,
                      'delivered': 2,
                      'departed': 1,
                      'did': 10,
                      'die': 1,
                      'died': 11,
                      'discerned': 1,
                      'divided': 3,
                      'divineth': 1,
                      'drank': 2,
                      'dreamed': 3,
                      'drew': 1,
                      'drove': 1,
                      'dwelt': 4,
                      'entered': 1,
                      'entreated': 1,
                      'erected': 1,
                      'espied': 1,
                      'feared': 2,
                      'fed': 2,
                      'fell': 2,
                      'felt': 1,
                      'fled': 3,
                      'forget': 1,
                      'found': 2,
                      'gathered': 2,
                      'gave': 11,
                      'given': 1,
                      'giveth': 1,
                      'goats': 2,
                      'gotten': 1,
                      'grew': 1,
                      'had': 49,
                      'halted': 1,
                      'hanged': 2,
                      'hasted': 1,
                      'hath': 13,
                      'heard': 3,
                      'hearkened': 1,
                      'held': 1,
                      'him': 2,
                      'interpreted': 2,
                      'is': 13,
                      'it': 1,
                      'kept': 1,
                      'kissed': 2,
                      'knew': 5,
                      'lay': 1,
                      'left': 6,
                      'lift': 1,
                      'lifted': 4,
                      'lighted': 1,
                      'lingered': 1,
                      'lived': 1,
                      'lodged': 1,
                      'looked': 2,
                      'love': 1,
                      'loved': 3,
                      'made': 11,
                      'man': 1,
                      'may': 5,
                      'might': 1,
                      'not': 3,
                      'opened': 1,
                      'out': 1,
                      'overthrew': 2,
                      'overtook': 1,
                      'passed': 2,
                      'perceived': 2,
                      'placed': 1,
                      'planted': 1,
                      'poured': 2,
                      'pressed': 1,
                      'prevailed': 1,
                      'put': 9,
                      'ran': 2,
                      'refused': 2,
                      'removed': 4,
                      'rent': 1,
                      'rested': 1,
                      'restored': 2,
                      'returned': 2,
                      'rightly': 1,
                      'rose': 2,
                      'said': 115,
                      'saith': 1,
                      'sat': 1,
                      'saw': 5,
                      'searched': 2,
                      'seemed': 1,
                      'seeth': 1,
                      'sent': 8,
                      'served': 2,
                      'set': 3,
                      'shall': 14,
                      'shaved': 1,
                      'sheweth': 1,
                      'should': 2,
                      'sle': 1,
                      'slept': 1,
                      'slew': 1,
                      'smelled': 1,
                      'sold': 1,
                      'sought': 1,
                      'spake': 4,
                      'spilled': 1,
                      'stayed': 2,
                      'stood': 5,
                      'stooped': 1,
                      'sware': 3,
                      'talked': 2,
                      'that': 6,
                      'the': 1,
                      'them': 2,
                      'thought': 1,
                      'told': 4,
                      'took': 17,
                      'touched': 2,
                      'turned': 2,
                      'ungirded': 1,
                      'unto': 1,
                      'up': 1,
                      'urged': 1,
                      'was': 23,
                      'washed': 2,
                      'well': 1,
                      'went': 9,
                      'wept': 1,
                      'whom': 1,
                      'will': 6,
                      'with': 2,
                      'worshipped': 1,
                      'would': 2,
                      'wrestled': 1,
                      'yet': 3}),
             'are': Counter({',': 1,
                      'absent': 1,
                      'against': 1,
                      'all': 1,
                      'among': 1,
                      'an': 1,
                      'at': 1,
                      'brethren': 1,
                      'circumcised': 1,
                      'co': 1,
                      'come': 5,
                      'departed': 1,
                      'embalm': 1,
                      'five': 2,
                      'fulfilled': 2,
                      'here': 1,
                      'in': 3,
                      'mine': 1,
                      'my': 5,
                      'no': 3,
                      'not': 2,
                      'of': 1,
                      'peaceable': 1,
                      'ringstraked': 1,
                      'seven': 3,
                      'shepherds': 2,
                      'spi': 1,
                      'spies': 2,
                      'tender': 1,
                      'the': 43,
                      'their': 2,
                      'therein': 1,
                      'these': 5,
                      'they': 1,
                      'thine': 1,
                      'those': 1,
                      'three': 2,
                      'thy': 1,
                      'to': 1,
                      'true': 4,
                      'twelve': 1,
                      'verily': 1,
                      'we': 3,
                      'with': 3,
                      'ye': 1}),
             'baker': Counter({':': 1,
                      'among': 1,
                      'had': 1,
                      'of': 1,
                      'saw': 1}),
             'east': Counter({',': 2,
                      '.': 2,
                      'country': 1,
                      'of': 4,
                      'wind': 3}),
             'our': Counter({"'": 1,
                      'asses': 1,
                      'bodies': 1,
                      'brethren': 1,
                      'brother': 4,
                      'children': 1,
                      'country': 1,
                      'daughter': 1,
                      'daughters': 3,
                      'dreams': 1,
                      'fathe': 1,
                      'father': 13,
                      'fathers': 1,
                      'flesh': 1,
                      'hand': 2,
                      'hands': 2,
                      'herds': 1,
                      'image': 1,
                      'journey': 1,
                      'kindred': 1,
                      'lan': 1,
                      'land': 3,
                      'likene': 1,
                      'little': 1,
                      'lives': 1,
                      'money': 4,
                      'sacks': 4,
                      'sepulchres': 1,
                      'sister': 3,
                      'state': 1,
                      'work': 1,
                      'youngest': 2,
                      'youth': 1}),
             'hide': Counter({'from': 1, 'it': 1}),
             'certain': Counter({'Adullamite': 1,
                      'Canaanite': 1,
                      'man': 1,
                      'place': 1}),
             'wages': Counter({',': 1, ';': 1, 'be': 1, 'ten': 2}),
             'Jac': Counter({'all': 1, 'and': 1}),
             'God': Counter({"'": 3,
                      ',': 18,
                      '.': 4,
                      ':': 3,
                      ';': 1,
                      '?': 3,
                      'Almighty': 4,
                      'after': 1,
                      'amongst': 1,
                      'and': 2,
                      'appeared': 2,
                      'ascending': 1,
                      'be': 1,
                      'blessed': 5,
                      'brought': 1,
                      'called': 5,
                      'came': 3,
                      'caused': 2,
                      'commanded': 2,
                      'created': 6,
                      'destroyed': 1,
                      'did': 2,
                      'divided': 1,
                      'doth': 1,
                      'ended': 1,
                      'face': 1,
                      'forbid': 2,
                      'formed': 2,
                      'gave': 1,
                      'give': 1,
                      'had': 10,
                      'hath': 21,
                      'healed': 1,
                      'heard': 1,
                      'hearkened': 2,
                      'is': 5,
                      'looked': 1,
                      'made': 7,
                      'make': 2,
                      'meant': 1,
                      'met': 1,
                      'moved': 1,
                      'of': 27,
                      'opened': 1,
                      'planted': 1,
                      'remembered': 3,
                      'said': 26,
                      'saw': 9,
                      'seest': 1,
                      'sent': 2,
                      'set': 1,
                      'shall': 3,
                      'spake': 5,
                      'suffered': 1,
                      'talked': 1,
                      'that': 1,
                      'to': 1,
                      'took': 2,
                      'unto': 1,
                      'walking': 1,
                      'was': 2,
                      'went': 2,
                      'which': 1,
                      'will': 5}),
             'My': Counter({'LORD': 1,
                      'brethren': 2,
                      'father': 5,
                      'lord': 3,
                      'money': 1,
                      'punishment': 1,
                      's': 1,
                      'son': 2,
                      'spirit': 1,
                      'wrong': 1}),
             'Elbethel': Counter({':': 1}),
             'covered': Counter({'.': 2, 'her': 2, 'herself': 1, 'the': 1}),
             'shaved': Counter({'himself': 1}),
             'more': Counter({'.': 4,
                      ';': 1,
                      'Jacob': 2,
                      'be': 2,
                      'become': 1,
                      'by': 1,
                      'every': 1,
                      'for': 2,
                      'honourable': 1,
                      'righteous': 1,
                      'subtil': 1,
                      'than': 4}),
             'sure': Counter({'Unto': 1, 'unto': 1}),
             'tarry': Counter({'all': 1, 'n': 1, 'with': 1}),
             'Hamathite': Counter({':': 1}),
             'Only': Counter({'herein': 1, 'the': 1}),
             'foolishly': Counter({'in': 1}),
             'fetch': Counter({'a': 1, 'me': 2, 'thee': 1, 'your': 1}),
             'messes': Counter({'unto': 1}),
             'obeyed': Counter({'his': 1, 'my': 2}),
             'beautiful': Counter({'and': 1}),
             'sou': Counter({'all': 1, 'and': 1}),
             'sixth': Counter({'day': 1, 'son': 1}),
             'Arodi': Counter({',': 1}),
             'Eden': Counter({',': 1,
                      '.': 1,
                      ';': 1,
                      'Cherubims': 1,
                      'to': 2}),
             'repenteth': Counter({'me': 1}),
             'marriages': Counter({'with': 1}),
             'camest': Counter({',': 1, '?': 1, 'thou': 1}),
             'first': Counter({'.': 2,
                      'born': 3,
                      'came': 1,
                      'day': 3,
                      'famine': 1,
                      'is': 1,
                      'month': 1,
                      'seven': 1,
                      'time': 2,
                      'year': 1}),
             'balm': Counter({',': 1, 'and': 1}),
             'is': Counter({',': 2,
                      '.': 1,
                      '?': 1,
                      'Abel': 1,
                      'Beersheba': 1,
                      'Bethlehem': 2,
                      'Edom': 3,
                      'Esau': 1,
                      'Euphrates': 1,
                      'Gihon': 1,
                      'God': 1,
                      'Hebron': 3,
                      'Hiddekel': 1,
                      'Jacob': 2,
                      'Kadesh': 1,
                      'Pison': 1,
                      'Sarah': 1,
                      'Zoar': 2,
                      'a': 12,
                      'about': 2,
                      'an': 3,
                      'as': 1,
                      'bdellium': 1,
                      'because': 1,
                      'become': 2,
                      'before': 6,
                      'behind': 2,
                      'better': 1,
                      'between': 2,
                      'beyond': 2,
                      'born': 2,
                      'bought': 1,
                      'bound': 1,
                      'brought': 1,
                      'by': 2,
                      'called': 1,
                      'come': 2,
                      'corn': 1,
                      'dead': 2,
                      'eaten': 1,
                      'eight': 1,
                      'enough': 1,
                      'established': 1,
                      'even': 1,
                      'evil': 1,
                      'filled': 1,
                      'fle': 1,
                      'found': 3,
                      'gold': 1,
                      'good': 2,
                      'governor': 1,
                      'great': 1,
                      'greater': 1,
                      'he': 1,
                      'his': 1,
                      'in': 15,
                      'it': 10,
                      'large': 1,
                      'left': 3,
                      'life': 1,
                      'mi': 1,
                      'mine': 1,
                      'my': 15,
                      'near': 1,
                      'ninety': 1,
                      'no': 1,
                      'none': 4,
                      'not': 17,
                      'now': 2,
                      'old': 1,
                      'on': 1,
                      'one': 3,
                      'ou': 1,
                      'our': 1,
                      'ours': 1,
                      'pleasant': 1,
                      'preserved': 1,
                      'required': 1,
                      'restored': 1,
                      'said': 2,
                      'seed': 1,
                      'si': 1,
                      'sore': 1,
                      'spent': 1,
                      'that': 1,
                      'the': 26,
                      'there': 1,
                      'therein': 3,
                      'thine': 2,
                      'this': 14,
                      'thy': 5,
                      'to': 1,
                      'torn': 1,
                      'uncircumcised': 1,
                      'upon': 6,
                      'very': 1,
                      'waxen': 1,
                      'well': 1,
                      'with': 8,
                      'without': 1,
                      'witness': 1,
                      'worth': 2,
                      'yet': 4,
                      'your': 3}),
             'Kedar': Counter({',': 1}),
             'sackcloth': Counter({'upon': 1}),
             'clave': Counter({'the': 1, 'unto': 1}),
             'deed': Counter({'is': 1}),
             'messenger': Counter({'unto': 1}),
             'seed': Counter({',': 4,
                      '.': 3,
                      ':': 1,
                      ';': 3,
                      'after': 10,
                      'alive': 1,
                      'all': 1,
                      'also': 1,
                      'and': 1,
                      'as': 3,
                      'be': 2,
                      'brought': 1,
                      'exceedingly': 1,
                      'for': 3,
                      'have': 1,
                      'instead': 1,
                      'is': 1,
                      'of': 3,
                      'possess': 1,
                      'shall': 7,
                      'should': 1,
                      'to': 3,
                      'was': 1,
                      'will': 2,
                      'with': 2}),
             'blasted': Counter({'with': 3}),
             'wall': Counter({'.': 1}),
             'under': Counter({',': 1,
                      'an': 1,
                      'heaven': 1,
                      'her': 1,
                      'his': 1,
                      'my': 2,
                      'one': 1,
                      'the': 9}),
             'Shur': Counter({',': 2, '.': 1}),
             'Goshen': Counter({',': 2, '.': 4, ';': 3, 'let': 1}),
             'Peace': Counter({'be': 1}),
             'communing': Counter({'with': 1}),
             'lo': Counter({',': 11}),
             'boys': Counter({'grew': 1}),
             'died': Counter({')': 1,
                      ',': 12,
                      '.': 10,
                      ';': 2,
                      'before': 1,
                      'by': 1,
                      'in': 5,
                      'that': 1}),
             'pluckt': Counter({'o': 1}),
             'confederate': Counter({'with': 1}),
             'Seeing': Counter({'that': 1}),
             'aw': Counter({'all': 1, 'and': 1}),
             'childless': Counter({',': 1}),
             'divide': Counter({'the': 4, 'them': 1}),
             'hear': Counter({',': 1,
                      ';': 1,
                      '?': 1,
                      'I': 1,
                      'me': 2,
                      'will': 1}),
             'northward': Counter({',': 1}),
             'man': Counter({"'": 12,
                      ',': 21,
                      '.': 3,
                      ':': 1,
                      ';': 7,
                      '?': 1,
                      'a': 1,
                      'according': 3,
                      'and': 2,
                      'as': 1,
                      'asked': 2,
                      'became': 1,
                      'bowed': 1,
                      'brought': 2,
                      'came': 1,
                      'can': 1,
                      'changes': 1,
                      'child': 3,
                      'deferred': 1,
                      'did': 2,
                      'discreet': 1,
                      'found': 1,
                      'from': 1,
                      'his': 7,
                      'in': 6,
                      'increased': 1,
                      'is': 3,
                      'known': 1,
                      'leave': 1,
                      'lift': 1,
                      'of': 3,
                      'on': 1,
                      'or': 1,
                      'said': 3,
                      'shall': 1,
                      'should': 1,
                      'to': 4,
                      'took': 1,
                      'unto': 1,
                      'was': 1,
                      'waxed': 1,
                      'whether': 1,
                      'whom': 2,
                      'with': 2,
                      'wondering': 1}),
             'redeemed': Counter({'me': 1}),
             'lentiles': Counter({';': 1}),
             'slayeth': Counter({'Cain': 1}),
             'through': Counter({'all': 1, 'the': 3, 'them': 1}),
             'moveth': Counter({',': 1, 'upon': 2}),
             'Know': Counter({'of': 1, 'ye': 1}),
             'conspired': Counter({'against': 1}),
             'Isa': Counter({'And': 1, 'and': 1, 'the': 1}),
             'each': Counter({'man': 6, 'piece': 1}),
             'Jetheth': Counter({',': 1}),
             'meanest': Counter({'thou': 1}),
             'Because': Counter({'I': 3,
                      'of': 1,
                      'that': 1,
                      'the': 3,
                      'thou': 3}),
             'battle': Counter({'with': 1}),
             'entreated': Counter({'Abram': 1}),
             'Arbah': Counter({',': 1}),
             'plain': Counter({',': 4, ';': 1, 'in': 1, 'man': 1, 'of': 5}),
             'forget': Counter({'all': 1, 'that': 1}),
             'slay': Counter({',': 1,
                      'also': 1,
                      'him': 2,
                      'his': 1,
                      'me': 3,
                      'my': 1,
                      'our': 1,
                      'the': 1}),
             'ceased': Counter({'to': 1}),
             'Uz': Counter({',': 2}),
             'ye': Counter({',': 2,
                      ';': 2,
                      '?': 2,
                      'Laban': 1,
                      'a': 1,
                      'another': 1,
                      'are': 6,
                      'be': 1,
                      'bereaved': 1,
                      'bring': 1,
                      'come': 1,
                      'die': 1,
                      'eat': 1,
                      'fall': 1,
                      'find': 1,
                      'fruitful': 1,
                      'go': 1,
                      'had': 1,
                      'hate': 1,
                      'have': 3,
                      'here': 1,
                      'know': 1,
                      'look': 1,
                      'marriages': 1,
                      'may': 1,
                      'moreover': 1,
                      'not': 3,
                      'rewarded': 1,
                      'say': 1,
                      'shall': 23,
                      'so': 2,
                      'sold': 2,
                      'sons': 1,
                      'spake': 2,
                      'speak': 2,
                      'take': 1,
                      'the': 1,
                      'therein': 1,
                      'thought': 1,
                      'to': 2,
                      'touch': 1,
                      'will': 4,
                      'wives': 1,
                      'would': 1,
                      'your': 1}),
             'archer': Counter({'.': 1}),
             'Pison': Counter({':': 1}),
             'love': Counter({',': 1, 'And': 1, 'he': 1, 'me': 1}),
             'rise': Counter({'up': 2}),
             'grave': Counter({'.': 3, ':': 1, 'unto': 2, 'which': 1}),
             'tarried': Counter({'all': 2, 'there': 1}),
             'sprung': Counter({'up': 2}),
             'so': Counter({',': 4,
                      '.': 7,
                      ':': 2,
                      'And': 1,
                      'I': 2,
                      'Noah': 1,
                      'Reuben': 1,
                      'and': 1,
                      'are': 1,
                      'commanded': 1,
                      'did': 1,
                      'discreet': 1,
                      'doing': 2,
                      'done': 1,
                      'he': 1,
                      'hotly': 1,
                      'ill': 1,
                      'it': 1,
                      'much': 2,
                      'now': 1,
                      'quickly': 1,
                      'sadly': 1,
                      'shall': 1,
                      'sore': 1,
                      'that': 8,
                      'the': 3,
                      'therefore': 1,
                      'they': 1,
                      'wickedly': 1,
                      'will': 1}),
             'fall': Counter({'backward': 1, 'not': 1, 'upon': 2}),
             'weig': Counter({'and': 1}),
             'ungirded': Counter({'his': 1}),
             'service': Counter({'which': 2}),
             'day': Counter({',': 18,
                      '.': 13,
                      ';': 2,
                      '?': 2,
                      'Abraham': 1,
                      'God': 1,
                      'and': 3,
                      'breaketh': 1,
                      'by': 1,
                      'entered': 1,
                      'from': 3,
                      'of': 7,
                      'on': 1,
                      'that': 5,
                      'the': 3,
                      'thy': 1,
                      'unto': 2,
                      'was': 1,
                      'were': 1,
                      'when': 1,
                      'with': 2,
                      'ye': 1}),
             'hard': Counter({'for': 1, 'labour': 2}),
             'At': Counter({'the': 1}),
             ')': Counter({'Even': 1, 'that': 1}),
             'seas': Counter({',': 1}),
             'Masrekah': Counter({'reigned': 1}),
             'grove': Counter({'in': 1}),
             'daughers': Counter({'of': 2}),
             'die': Counter({',': 5,
                      '.': 10,
                      ':': 6,
                      'also': 1,
                      'before': 1,
                      'for': 1,
                      'in': 1}),
             'vessels': Counter({',': 1}),
             'jewels': Counter({'of': 2}),
             'Take': Counter({',': 1,
                      'also': 1,
                      'heed': 1,
                      'me': 1,
                      'now': 1,
                      'thou': 1}),
             'toucheth': Counter({'this': 1}),
             'mourn': Counter({'for': 1}),
             'Chesed': Counter({',': 1}),
             'built': Counter({'an': 1, 'him': 1, 'there': 2}),
             'bakemeats': Counter({'for': 1}),
             'Laban': Counter({"'": 4,
                      ',': 8,
                      ':': 2,
                      ';': 1,
                      'all': 1,
                      'and': 1,
                      'answered': 1,
                      'called': 1,
                      'departed': 1,
                      'doeth': 1,
                      'gathered': 1,
                      'gave': 4,
                      'had': 1,
                      'heard': 1,
                      'his': 3,
                      'my': 1,
                      'on': 1,
                      'overtook': 1,
                      'ran': 1,
                      'rose': 1,
                      'said': 9,
                      'searched': 1,
                      'the': 4,
                      'thy': 1,
                      'went': 2,
                      'with': 1}),
             'numbering': Counter({';': 1}),
             'husband': Counter({',': 1,
                      '?': 1,
                      'Abram': 1,
                      'be': 1,
                      'dwell': 1,
                      'will': 1,
                      'with': 1}),
             'friends': Counter({',': 1}),
             'Horites': Counter({',': 1, ';': 1, 'in': 1}),
             'Padanaram': Counter({',': 6, '.': 1, ';': 2, 'unto': 1}),
             'fallen': Counter({'?': 1}),
             'home': Counter({',': 2, '.': 1}),
             'tithes': Counter({'of': 1}),
             'killed': Counter({'a': 1}),
             'smelled': Counter({'a': 1, 'the': 1}),
             'small': Counter({'and': 1, 'matter': 1}),
             'younger': Counter({',': 4,
                      '.': 1,
                      'arose': 1,
                      'before': 1,
                      'brother': 2,
                      'daughter': 1,
                      's': 1,
                      'son': 2,
                      'was': 1}),
             'Phuvah': Counter({',': 1}),
             'storehouses': Counter({',': 1}),
             'smooth': Counter({'m': 1, 'of': 1}),
             'red': Counter({',': 1, 'pottage': 1, 'with': 1}),
             'tidings': Counter({'of': 1}),
             'did': Counter({',': 1,
                      '.': 1,
                      'Noah': 1,
                      'Sarah': 1,
                      'according': 2,
                      'as': 1,
                      'bake': 1,
                      'bear': 1,
                      'command': 1,
                      'conceive': 1,
                      'displeased': 1,
                      'e': 1,
                      'eat': 18,
                      'he': 2,
                      'interpret': 1,
                      'not': 3,
                      'send': 1,
                      'separate': 1,
                      'so': 3,
                      'solemnly': 1,
                      'strive': 1,
                      'tempt': 1,
                      'the': 3,
                      'there': 2,
                      'thy': 1,
                      'to': 1,
                      'unto': 4,
                      'walk': 1,
                      'yearn': 1}),
             'feebler': Counter({'were': 1}),
             'mocked': Counter({'unto': 1}),
             'king': Counter({"'": 2, 'of': 29, 'over': 1}),
             'knife': Counter({';': 1, 'to': 1}),
             'overthrew': Counter({'the': 1, 'those': 1}),
             'gavest': Counter({'to': 1}),
             'devoured': Counter({'also': 1, 'h': 1, 'him': 1, 'the': 2}),
             'Baalhanan': Counter({'the': 2}),
             'Stand': Counter({'back': 1}),
             'content': Counter({'.': 1}),
             'next': Counter({'year': 1}),
             'Zeboim': Counter({',': 1}),
             'shall': Counter({'I': 12,
                      'Israel': 1,
                      'Pharaoh': 2,
                      'Sarah': 1,
                      'a': 1,
                      'add': 1,
                      'afflict': 1,
                      'all': 6,
                      'arise': 1,
                      'be': 77,
                      'bear': 4,
                      'become': 2,
                      'befall': 1,
                      'bless': 1,
                      'bow': 1,
                      'bring': 3,
                      'bruise': 1,
                      'call': 1,
                      'carry': 1,
                      'circumcise': 1,
                      'cleave': 1,
                      'come': 16,
                      'comfort': 1,
                      'consume': 1,
                      'devour': 1,
                      'die': 1,
                      'dine': 1,
                      'divide': 1,
                      'dwell': 4,
                      'eat': 2,
                      'enlarge': 1,
                      'escape': 1,
                      'fall': 1,
                      'gather': 1,
                      'give': 3,
                      'go': 1,
                      'hang': 1,
                      'haste': 1,
                      'have': 2,
                      'he': 2,
                      'help': 1,
                      'her': 1,
                      'his': 1,
                      'inherit': 1,
                      'it': 1,
                      'judge': 1,
                      'keep': 2,
                      'lack': 1,
                      'lie': 1,
                      'live': 2,
                      'my': 2,
                      'neither': 1,
                      'no': 2,
                      'not': 17,
                      'overcome': 2,
                      'pass': 1,
                      'possess': 1,
                      'praise': 1,
                      'pray': 1,
                      'put': 1,
                      'ravin': 1,
                      'rise': 1,
                      'rouse': 1,
                      'rule': 1,
                      'say': 7,
                      'see': 3,
                      'seem': 1,
                      'send': 1,
                      'serve': 3,
                      'slay': 1,
                      'sow': 1,
                      'surely': 2,
                      'tell': 2,
                      'the': 2,
                      'there': 1,
                      'they': 1,
                      'thirty': 1,
                      'this': 1,
                      'thy': 5,
                      'traffick': 1,
                      'we': 4,
                      'withhold': 1,
                      'ye': 6,
                      'yield': 1,
                      'your': 1}),
             'swear': Counter({',': 2, '.': 2, 'by': 1, 'unto': 1}),
             'Aholibamah': Counter({',': 2, 'Esau': 1, 'bare': 1, 'the': 3}),
             'Zeboiim': Counter({',': 2}),
             'Shepho': Counter({',': 1}),
             'yonder': Counter({'and': 1}),
             'might': Counter({',': 1,
                      'be': 1,
                      'conceive': 1,
                      'dwell': 2,
                      'have': 2,
                      'lightly': 1,
                      'live': 1,
                      'not': 1,
                      'rid': 1}),
             'Machpelah': Counter({',': 5, 'before': 1}),
             'eyes': Counter({',': 18,
                      '.': 4,
                      'and': 2,
                      'of': 7,
                      'see': 1,
                      'shall': 2,
                      'upon': 2,
                      'were': 1}),
             'mountains': Counter({'of': 1, 'seen': 1, 'were': 1, 'which': 1}),
             'goat': Counter({'of': 1}),
             'female': Counter({',': 1,
                      '.': 2,
                      ';': 1,
                      'created': 2,
                      'of': 1}),
             'horses': Counter({',': 1}),
             'lad': Counter({"'": 1,
                      ',': 3,
                      ';': 2,
                      'a': 1,
                      'be': 2,
                      'cannot': 1,
                      'drink': 1,
                      'go': 1,
                      'is': 1,
                      'unto': 1,
                      'was': 1,
                      'where': 1,
                      'will': 1,
                      'with': 1}),
             'fai': Counter({'And': 1, 'therefore': 1}),
             'clothed': Counter({'them': 1}),
             'tongue': Counter({',': 1}),
             'thin': Counter({',': 1, 'and': 1, 'ears': 3}),
             'continually': Counter({'.': 1, ':': 1, 'until': 1}),
             'truth': Counter({',': 1, 'in': 1}),
             'Out': Counter({'of': 2}),
             'seasons': Counter({',': 1}),
             'Merari': Counter({'.': 1}),
             'trembled': Counter({'very': 1}),
             'Jachin': Counter({',': 1}),
             'evening': Counter({',': 3, ';': 1, 'and': 6}),
             'prisoners': Counter({'that': 1, 'were': 1}),
             'them': Counter({',': 75,
                      '.': 25,
                      ':': 2,
                      ';': 11,
                      'Simeon': 1,
                      'a': 3,
                      'abroad': 2,
                      'after': 1,
                      'again': 1,
                      'alive': 2,
                      'all': 1,
                      'are': 1,
                      'as': 1,
                      'away': 3,
                      'be': 2,
                      'because': 1,
                      'both': 2,
                      'bread': 1,
                      'by': 2,
                      'dwe': 1,
                      'dwell': 1,
                      'for': 1,
                      'forth': 1,
                      'four': 1,
                      'from': 3,
                      'gather': 1,
                      'greatly': 1,
                      'grow': 1,
                      'have': 1,
                      'he': 1,
                      'his': 1,
                      'in': 9,
                      'into': 3,
                      'is': 1,
                      'keep': 1,
                      'made': 1,
                      'near': 2,
                      'not': 5,
                      'of': 4,
                      'on': 1,
                      'one': 1,
                      'opened': 1,
                      'our': 1,
                      'out': 6,
                      'over': 1,
                      'provision': 2,
                      'rose': 1,
                      'rulers': 1,
                      'savoury': 1,
                      'say': 1,
                      'seven': 1,
                      'take': 1,
                      'that': 3,
                      'the': 1,
                      'these': 1,
                      'thoroughly': 1,
                      'to': 5,
                      'together': 2,
                      'under': 2,
                      'unto': 8,
                      'up': 1,
                      'upon': 1,
                      'wagons': 1,
                      'was': 1,
                      'water': 1,
                      'were': 1,
                      'with': 3,
                      'wives': 2}),
             'throughout': Counter({'all': 3}),
             'twelve': Counter({'brethren': 2,
                      'princes': 2,
                      'tribes': 1,
                      'years': 1}),
             'Jaalam': Counter({',': 3}),
             'Isaac': Counter({"'": 4,
                      ',': 11,
                      '.': 6,
                      ';': 3,
                      'all': 1,
                      'and': 3,
                      'answered': 1,
                      'being': 1,
                      'brought': 1,
                      'called': 1,
                      'came': 1,
                      'departed': 1,
                      'did': 1,
                      'digged': 1,
                      'dwelt': 2,
                      'gave': 1,
                      'had': 2,
                      'his': 11,
                      'intreated': 1,
                      'loved': 1,
                      'said': 5,
                      'sent': 2,
                      'shall': 1,
                      'sojourned': 1,
                      'sowed': 1,
                      'spake': 2,
                      'trembled': 1,
                      'was': 7,
                      'went': 2,
                      'were': 1}),
             'comforted': Counter({',': 1, ';': 1, 'after': 1, 'them': 1}),
             'fear': Counter({'G': 1, 'him': 1, 'not': 4, 'of': 4, 'ye': 1}),
             'Ashbel': Counter({',': 1}),
             'Arphaxad': Counter({',': 1,
                      'begat': 1,
                      'five': 1,
                      'lived': 2,
                      'two': 1}),
             'dowry': Counter({';': 1, 'and': 1}),
             'wives': Counter({',': 7,
                      ':': 2,
                      'and': 1,
                      'beside': 1,
                      'of': 4,
                      'took': 1,
                      'upon': 1,
                      'which': 1,
                      'with': 4}),
             'with': Counter({'Abel': 1,
                      'Abrah': 1,
                      'Abram': 3,
                      'Bera': 1,
                      'Bilhah': 1,
                      'Birsha': 1,
                      'God': 4,
                      'Isaac': 3,
                      'Jacob': 2,
                      'Joseph': 2,
                      'Laban': 2,
                      'Rebekah': 1,
                      'Sarah': 1,
                      'Tidal': 1,
                      'a': 5,
                      'all': 4,
                      'an': 1,
                      'and': 1,
                      'blessings': 1,
                      'blindness': 1,
                      'bread': 2,
                      'chi': 1,
                      'child': 3,
                      'corn': 3,
                      'earth': 1,
                      'every': 1,
                      'five': 1,
                      'food': 1,
                      'great': 2,
                      'h': 3,
                      'harp': 1,
                      'her': 11,
                      'him': 48,
                      'his': 11,
                      'lower': 1,
                      'man': 1,
                      'me': 30,
                      'men': 1,
                      'milk': 1,
                      'mirth': 1,
                      'money': 2,
                      'my': 12,
                      'our': 3,
                      'pitch': 1,
                      'songs': 1,
                      'sorrow': 3,
                      'subtilty': 1,
                      'tabret': 1,
                      'th': 1,
                      'that': 1,
                      'the': 24,
                      'thee': 25,
                      'their': 3,
                      'them': 11,
                      'this': 1,
                      'thy': 3,
                      'us': 13,
                      'violence': 2,
                      'water': 1,
                      'whom': 2,
                      'wine': 1,
                      'you': 13,
                      'young': 1,
                      'your': 1,
                      'yourselves': 1}),
             'remained': Counter({'alive': 1, 'fled': 1}),
             'entered': Counter({'Noah': 1, 'into': 4}),
             'departed': Counter({',': 3,
                      '.': 1,
                      ':': 1,
                      ';': 1,
                      'from': 2,
                      'hence': 1,
                      'out': 1,
                      'thence': 2}),
             'stead': Counter({',': 1, '.': 5, 'of': 1}),
             'ask': Counter({'after': 1}),
             'begotten': Counter({'Seth': 1}),
             'found': Counter({',': 2,
                      '.': 1,
                      'a': 1,
                      'an': 1,
                      'favour': 2,
                      'grace': 6,
                      'her': 3,
                      'him': 1,
                      'in': 3,
                      'it': 1,
                      'mandrakes': 1,
                      'no': 1,
                      'not': 1,
                      'of': 1,
                      'out': 1,
                      'shall': 1,
                      'the': 1,
                      'them': 3,
                      'there': 5,
                      'water': 1}),
             'wine': Counter({',': 5,
                      ':': 1,
                      'and': 1,
                      'have': 1,
                      'that': 2,
                      'this': 1}),
             'war': Counter({'with': 1}),
             'Rephaims': Counter({',': 1, 'in': 1}),
             'boldly': Counter({',': 1}),
             'What': Counter({'God': 1,
                      'aileth': 1,
                      'deed': 1,
                      'hast': 3,
                      'is': 10,
                      'man': 1,
                      'mean': 1,
                      'meanest': 1,
                      'needeth': 1,
                      'pledge': 1,
                      'profit': 1,
                      'sawest': 1,
                      'seekest': 1,
                      'shall': 2,
                      'wilt': 1}),
             'spotted': Counter({',': 2,
                      '.': 1,
                      'among': 1,
                      'and': 1,
                      'cattle': 1}),
             'henceforth': Counter({'yield': 1}),
             'mother': Counter({"'": 9,
                      ',': 4,
                      '.': 1,
                      ':': 1,
                      ';': 1,
                      'Leah': 1,
                      'Sarah': 1,
                      'and': 1,
                      'made': 1,
                      'of': 3,
                      'precious': 1,
                      'said': 2,
                      'took': 1,
                      'with': 1}),
             'begat': Counter({'Abram': 2,
                      'Almodad': 1,
                      'Arphaxad': 2,
                      'Cainan': 2,
                      'Eber': 3,
                      'Eno': 1,
                      'Enoch': 1,
                      'Enos': 2,
                      'Isa': 1,
                      'Jared': 2,
                      'Lamech': 3,
                      'Lot': 1,
                      'Ludim': 1,
                      'Mahalaleel': 2,
                      'Mehujael': 1,
                      'Methusa': 1,
                      'Methuselah': 2,
                      'Nahor': 2,
                      'Nimrod': 1,
                      'Noah': 1,
                      'Peleg': 2,
                      'Rebekah': 1,
                      'Reu': 2,
                      'Salah': 3,
                      'Serug': 2,
                      'Sheba': 1,
                      'Shem': 1,
                      'Sidon': 1,
                      'Terah': 2,
                      'a': 2,
                      'sons': 17,
                      'three': 1}),
             'things': Counter({',': 6,
                      '.': 5,
                      'are': 1,
                      'in': 1,
                      'of': 1,
                      'that': 1,
                      'the': 1}),
             'heifer': Counter({'of': 1}),
             'Bethlehem': Counter({'.': 2}),
             'night': Counter({',': 11,
                      '.': 3,
                      ':': 1,
                      ';': 4,
                      '?': 1,
                      'also': 2,
                      'for': 1,
                      'he': 1,
                      'in': 2,
                      'shall': 1}),
             'fruits': Counter({'in': 1}),
             'can': Counter({'I': 2,
                      'bear': 1,
                      'carry': 1,
                      'certainly': 1,
                      'interpret': 1,
                      'number': 1}),
             'ram': Counter({',': 1, 'caught': 1, 'of': 1}),
             'master': Counter({"'": 7,
                      ',': 4,
                      '.': 3,
                      'Abraham': 5,
                      'greatly': 1,
                      'heard': 1,
                      'made': 1,
                      'of': 1,
                      'saw': 1,
                      'the': 1,
                      'took': 1,
                      'were': 1,
                      'when': 1,
                      'wotteth': 1}),
             'arrayed': Counter({'him': 1}),
             'divided': Counter({';': 1,
                      'he': 1,
                      'himself': 1,
                      'in': 2,
                      'the': 4,
                      'them': 1}),
             ';': Counter({'(': 2,
                      'Adah': 1,
                      'Alvan': 1,
                      'And': 34,
                      'Ashkenaz': 1,
                      'Because': 1,
                      'Bilhan': 1,
                      'But': 1,
                      'Cush': 1,
                      'Dan': 1,
                      'Dishon': 1,
                      'Elam': 1,
                      'Eliphaz': 1,
                      'Elishah': 1,
                      'Ephah': 1,
                      'Er': 1,
                      'Every': 1,
                      'Gershon': 1,
                      'Gomer': 1,
                      'Hanoch': 1,
                      'Heber': 1,
                      'Hemdan': 1,
                      'Hushim': 1,
                      'Huz': 1,
                      'I': 5,
                      'In': 1,
                      'It': 1,
                      'Jahzeel': 1,
                      'Jemuel': 1,
                      'Jimnah': 1,
                      'Joseph': 4,
                      'Lotan': 1,
                      'Male': 1,
                      'Nahath': 1,
                      'Reuben': 1,
                      'Seba': 1,
                      'Seeing': 1,
                      'Sered': 1,
                      'Sheba': 1,
                      'That': 2,
                      'The': 2,
                      'Then': 2,
                      'There': 1,
                      'They': 1,
                      'Thorns': 1,
                      'Thy': 1,
                      'Tola': 1,
                      'Two': 1,
                      'Until': 1,
                      'Unto': 1,
                      'Uz': 2,
                      'When': 1,
                      'With': 1,
                      'Ziphion': 1,
                      'a': 3,
                      'according': 1,
                      'after': 2,
                      'an': 1,
                      'and': 262,
                      'arise': 1,
                      'as': 2,
                      'at': 3,
                      'because': 13,
                      'behold': 1,
                      'both': 2,
                      'bring': 1,
                      'bury': 1,
                      'but': 16,
                      'do': 1,
                      'doth': 1,
                      'duke': 5,
                      'dwell': 2,
                      'escape': 1,
                      'even': 1,
                      'every': 2,
                      'for': 67,
                      'four': 1,
                      'from': 1,
                      'he': 4,
                      'hear': 1,
                      'his': 1,
                      'in': 7,
                      'instruments': 1,
                      'into': 1,
                      'it': 4,
                      'keep': 1,
                      'kings': 1,
                      'lade': 1,
                      'leave': 1,
                      'lest': 3,
                      'let': 3,
                      'look': 1,
                      'male': 1,
                      'me': 1,
                      'my': 1,
                      'neither': 4,
                      'none': 1,
                      'now': 2,
                      'of': 2,
                      'one': 1,
                      'or': 1,
                      'peradventure': 2,
                      'put': 1,
                      'rooms': 1,
                      'saying': 1,
                      'see': 1,
                      'seeing': 1,
                      'send': 1,
                      'she': 2,
                      'so': 1,
                      'take': 3,
                      'ten': 1,
                      'that': 12,
                      'the': 5,
                      'then': 7,
                      'there': 2,
                      'therefore': 2,
                      'these': 2,
                      'they': 1,
                      'thou': 2,
                      'thy': 2,
                      'to': 4,
                      'twelve': 2,
                      'unto': 1,
                      'upon': 1,
                      'walk': 1,
                      'we': 3,
                      'what': 2,
                      'wherefore': 1,
                      'which': 1,
                      'who': 2,
                      'whose': 1,
                      'with': 1,
                      'ye': 1}),
             'physicians': Counter({'embalmed': 1, 'to': 1}),
             'feed': Counter({'and': 1,
                      'cattle': 1,
                      'the': 1,
                      'their': 2,
                      'them': 1}),
             'poured': Counter({'a': 1, 'oil': 2}),
             'Eve': Counter({';': 1, 'his': 1}),
             'brown': Counter({'among': 2, 'cattle': 1, 'in': 1}),
             'tops': Counter({'of': 1}),
             'add': Counter({'to': 1}),
             'sorrow': Counter({'and': 1, 'shalt': 1, 'thou': 1, 'to': 3}),
             'lights': Counter({';': 1, 'in': 2}),
             'mischief': Counter({'befall': 3}),
             'mind': Counter({'that': 1, 'unto': 1}),
             'sir': Counter({',': 1}),
             'field': Counter({',': 15,
                      '.': 1,
                      ';': 6,
                      'and': 1,
                      'at': 1,
                      'before': 2,
                      'for': 1,
                      'give': 1,
                      'in': 1,
                      'of': 8,
                      'to': 2,
                      'unto': 1,
                      'when': 1,
                      'which': 3}),
             'Hezron': Counter({',': 1, 'and': 1}),
             'scarlet': Counter({'thread': 2}),
             'pleasant': Counter({';': 1, 'to': 2}),
             'earrings': Counter({'which': 1}),
             'brethren': Counter({',': 36,
                      '.': 8,
                      ':': 3,
                      ';': 3,
                      'also': 1,
                      'and': 1,
                      'are': 2,
                      'away': 1,
                      'be': 1,
                      'came': 2,
                      'could': 1,
                      'did': 1,
                      'discern': 1,
                      'envied': 1,
                      'feed': 1,
                      'have': 1,
                      'here': 1,
                      'in': 1,
                      'indeed': 1,
                      'pitched': 1,
                      'said': 1,
                      'saw': 2,
                      'shall': 1,
                      'talked': 1,
                      'to': 2,
                      'went': 2,
                      'were': 1,
                      'with': 1,
                      'without': 1}),
             'sto': Counter({'and': 1}),
             'leave': Counter({'his': 3, 'one': 1, 'thee': 1, 'with': 1}),
             'shrank': Counter({',': 1, '.': 1}),
             'Hebrew': Counter({',': 1, ';': 1, 'servant': 1, 'unto': 1}),
             'whensoever': Counter({'the': 1}),
             'power': Counter({'I': 1, 'of': 1, 'with': 1}),
             'yielded': Counter({'up': 1}),
             'till': Counter({'Shelah': 1, 'the': 2, 'they': 1, 'thou': 3}),
             'shadow': Counter({'of': 1}),
             'any': Counter({'beast': 1,
                      'besides': 1,
                      'finding': 1,
                      'king': 1,
                      'man': 1,
                      'men': 1,
                      'more': 7,
                      'of': 1,
                      'portion': 1,
                      'stranger': 1,
                      'thi': 1,
                      'thing': 5,
                      'truth': 1}),
             'created': Counter({',': 1,
                      '.': 1,
                      'and': 1,
                      'from': 1,
                      'great': 1,
                      'he': 3,
                      'man': 2,
                      'the': 1}),
             'plains': Counter({'of': 1}),
             'In': Counter({'my': 2, 'the': 9, 'thee': 1}),
             'Mesopotamia': Counter({',': 1}),
             'ways': Counter({'.': 1}),
             'Nod': Counter({',': 1}),
             'season': Counter({'in': 1}),
             'Be': Counter({'fruitful': 3}),
             'lamb': Counter({'for': 2}),
             'spe': Counter({'Peradventure': 1}),
             'fowls': Counter({',': 1,
                      'after': 1,
                      'also': 1,
                      'came': 1,
                      'of': 1}),
             'proved': Counter({',': 1, ':': 1}),
             'foot': Counter({',': 1, 'in': 1}),
             'Beno': Counter({'but': 1}),
             'beasts': Counter({',': 3, 'I': 1, 'that': 2}),
             'breadth': Counter({'of': 2}),
             'drink': Counter({',': 5,
                      '.': 4,
                      ';': 2,
                      'a': 1,
                      'al': 2,
                      'also': 1,
                      'offering': 1,
                      'thou': 1,
                      'wine': 4}),
             'hunting': Counter({'.': 1}),
             'burning': Counter({'lamp': 1}),
             'establish': Counter({'my': 5, 'with': 1}),
             'run': Counter({'over': 1}),
             'persons': Counter({',': 1, 'of': 1}),
             'Lift': Counter({'up': 2}),
             'remaineth': Counter({',': 1}),
             'stand': Counter({'by': 1, 'here': 1}),
             'Sarai': Counter({"'": 1,
                      ',': 2,
                      '.': 2,
                      ';': 1,
                      'Abram': 3,
                      'dealt': 1,
                      'his': 3,
                      'said': 2,
                      'thy': 1,
                      'was': 1}),
             'Potiphar': Counter({',': 2}),
             'summer': Counter({'and': 1}),
             'womenservants': Counter({',': 2}),
             'tim': Counter({'he': 1}),
             'sowed': Counter({'in': 1}),
             'knowledge': Counter({'of': 2}),
             'Peradventure': Counter({'ten': 1,
                      'the': 2,
                      'there': 5,
                      'thou': 1}),
             'laugh': Counter({',': 2, '.': 1, 'with': 1}),
             'befall': Counter({'him': 3, 'you': 1}),
             'thousand': Counter({'pieces': 1}),
             'dust': Counter({'and': 1, 'of': 4, 'shalt': 2, 'thou': 1}),
             'Jokshan': Counter({',': 1, 'begat': 1}),
             'smite': Counter({'any': 1, 'it': 1, 'me': 1}),
             'sold': Counter({'Joseph': 1,
                      'every': 1,
                      'him': 1,
                      'his': 1,
                      'into': 1,
                      'me': 1,
                      'not': 1,
                      'to': 1,
                      'unto': 1,
                      'us': 1}),
             'eyed': Counter({';': 1}),
             'visit': Counter({'you': 2}),
             'lightly': Counter({'have': 1}),
             'fine': Counter({'linen': 1, 'meal': 1}),
             'males': Counter({'.': 1}),
             'right': Counter({';': 1, '?': 1, 'hand': 7, 'way': 1}),
             'badne': Counter({'And': 1}),
             'speckl': Counter({'and': 1}),
             'withheld': Counter({'from': 1, 'thee': 1, 'thy': 2}),
             'Return': Counter({'to': 1, 'unto': 2}),
             'saddled': Counter({'his': 1}),
             'nativity': Counter({',': 1}),
             'herb': Counter({'bearing': 1,
                      'for': 1,
                      'have': 1,
                      'of': 2,
                      'yielding': 2}),
             'anguish': Counter({'of': 1}),
             'Beersheba': Counter({',': 3, '.': 3, ':': 2, ';': 2, 'unto': 1}),
             'Abram': Counter({"'": 7,
                      ',': 13,
                      '.': 2,
                      ';': 1,
                      'a': 1,
                      'and': 2,
                      'called': 2,
                      'departed': 1,
                      'drove': 1,
                      'dwelled': 1,
                      'fell': 1,
                      'had': 1,
                      'heard': 1,
                      'hearkened': 1,
                      'his': 1,
                      'in': 1,
                      'journeyed': 1,
                      'of': 1,
                      'passed': 1,
                      'removed': 1,
                      'ri': 1,
                      'said': 5,
                      'the': 1,
                      'to': 1,
                      'took': 1,
                      'was': 5,
                      'well': 1,
                      'went': 2}),
             'Almodad': Counter({',': 1}),
             'dry': Counter({'.': 1, 'land': 3}),
             'si': Counter({'and': 1}),
             'Set': Counter({'on': 1}),
             'displeased': Counter({'h': 1, 'the': 1}),
             'breaking': Counter({'of': 1}),
             'strange': Counter({'gods': 2, 'unto': 1}),
             'strong': Counter({'ass': 1, 'by': 1}),
             'mercy': Counter({',': 2, 'and': 1, 'before': 1}),
             'visited': Counter({'Sarah': 1}),
             'uncovered': Counter({'within': 1}),
             'Shaveh': Counter({',': 1, 'Kiriathaim': 1}),
             'folly': Counter({'in': 1}),
             'Amalek': Counter({':': 1}),
             'Also': Counter({'he': 1, 'regard': 1}),
             'ringstraked': Counter({',': 4, '.': 1, 'and': 1, 'shall': 1}),
             'coat': Counter({',': 2, ';': 1, 'in': 1, 'of': 3, 'or': 1}),
             'Serug': Counter({':': 1, 'lived': 2, 'two': 1}),
             'eighteen': Counter({',': 1}),
             'fearest': Counter({'God': 1}),
             'trees': Counter({'of': 2, 'that': 1}),
             'findeth': Counter({'me': 1}),
             'beginning': Counter({',': 1, '.': 1, 'God': 1, 'of': 2}),
             'Ard': Counter({'.': 1}),
             'quiver': Counter({'and': 1}),
             'Some': Counter({'evil': 1}),
             'anointedst': Counter({'the': 1}),
             'goodly': Counter({'person': 1, 'raiment': 1, 'words': 1}),
             'exchange': Counter({'for': 1}),
             'whither': Counter({'goest': 1,
                      'shall': 1,
                      'thou': 1,
                      'we': 1,
                      'wilt': 1}),
             'Ashteroth': Counter({'Karnaim': 1}),
             'duke': Counter({'Alvah': 1,
                      'Amalek': 1,
                      'Anah': 1,
                      'Dishan': 1,
                      'Elah': 1,
                      'Ezer': 1,
                      'Gatam': 1,
                      'Iram': 1,
                      'Jaalam': 1,
                      'Jetheth': 1,
                      'Jeush': 1,
                      'Kenaz': 1,
                      'Kor': 1,
                      'Lotan': 1,
                      'Mibzar': 1,
                      'Mizz': 1,
                      'Nahath': 1,
                      'Omar': 1,
                      'Pinon': 1,
                      'Shammah': 1,
                      'Shobal': 1,
                      'Teman': 2,
                      'Timnah': 1,
                      'Zepho': 1,
                      'Zerah': 1,
                      'Zibeon': 1}),
             'sheweth': Counter({'unto': 1}),
             'cut': Counter({'off': 2}),
             'watering': Counter({'troughs': 1}),
             'men': Counter({"'": 2,
                      ',': 7,
                      '.': 2,
                      ';': 3,
                      'are': 2,
                      'began': 1,
                      'builded': 1,
                      'concerning': 1,
                      'consent': 1,
                      'do': 1,
                      'have': 1,
                      'home': 1,
                      'into': 2,
                      'laid': 1,
                      'marvelled': 1,
                      'of': 16,
                      'put': 1,
                      'rose': 1,
                      'said': 1,
                      'shall': 1,
                      'should': 1,
                      'stood': 1,
                      'that': 3,
                      'there': 1,
                      'to': 1,
                      'took': 1,
                      'turned': 1,
                      'were': 4,
                      'which': 3,
                      'with': 2}),
             'furniture': Counter({',': 1}),
             'sixty': Counter({'and': 6}),
             'marry': Counter({'her': 1}),
             'serve': Counter({',': 1,
                      'me': 1,
                      'the': 1,
                      'thee': 2,
                      'them': 1,
                      'thy': 1,
                      'with': 2}),
             'while': Counter({'.': 1, 'Joseph': 1, 'he': 3}),
             'Noah': Counter({"'": 2,
                      ',': 12,
                      '.': 1,
                      ':': 2,
                      ';': 1,
                      'and': 1,
                      'awoke': 1,
                      'began': 1,
                      'begat': 2,
                      'builded': 1,
                      'did': 1,
                      'five': 1,
                      'found': 1,
                      'into': 2,
                      'knew': 1,
                      'lived': 1,
                      'only': 1,
                      'opened': 1,
                      'removed': 1,
                      'walked': 1,
                      'was': 3,
                      'went': 2,
                      'were': 1}),
             'milch': Counter({'camels': 1}),
             'wash': Counter({'his': 1, 'your': 2}),
             'of': Counter({'.': 2,
                      ':': 1,
                      ';': 1,
                      'Abel': 1,
                      'Abimelech': 1,
                      'Abrah': 1,
                      'Abraham': 12,
                      'Abram': 2,
                      'Achbor': 2,
                      'Adah': 3,
                      'Adam': 2,
                      'Admah': 2,
                      'Aholibamah': 3,
                      'Ammon': 1,
                      'Amraphel': 1,
                      'An': 1,
                      'Anah': 5,
                      'Aram': 2,
                      'Ararat': 1,
                      'Arbah': 1,
                      'Asher': 2,
                      'Assyria': 1,
                      'Atad': 2,
                      'Bashemath': 3,
                      'Bedad': 1,
                      'Beeri': 1,
                      'Beersheba': 1,
                      'Bela': 2,
                      'Benjamin': 1,
                      'Beor': 1,
                      'Beriah': 1,
                      'Bethel': 2,
                      'Bethuel': 5,
                      'Bilhah': 3,
                      'Bozrah': 1,
                      'Cainan': 1,
                      'Cana': 3,
                      'Canaan': 38,
                      'Chedorlaomer': 1,
                      'Cush': 1,
                      'Damascus': 2,
                      'Dan': 1,
                      'Dedan': 1,
                      'Dishan': 1,
                      'Dishon': 1,
                      'Eber': 1,
                      'Edar': 1,
                      'Eden': 5,
                      'Edom': 6,
                      'Egy': 1,
                      'Egypt': 46,
                      'Elam': 2,
                      'Eliphaz': 3,
                      'Ellasar': 2,
                      'Elon': 2,
                      'Enoch': 1,
                      'Enos': 1,
                      'Ephraim': 1,
                      'Ephrath': 1,
                      'Ephron': 5,
                      'Es': 1,
                      'Esau': 13,
                      'Eshcol': 1,
                      'Ethiopia': 1,
                      'Ezer': 1,
                      'Gad': 1,
                      'Gerar': 3,
                      'Gilead': 1,
                      'God': 16,
                      'Gomer': 1,
                      'Gomorrah': 2,
                      'Goshen': 8,
                      'Hagar': 1,
                      'Ham': 2,
                      'Hamor': 2,
                      'Haran': 3,
                      'Havilah': 1,
                      'He': 1,
                      'Hebron': 1,
                      'Heth': 11,
                      'Hori': 1,
                      'Isa': 1,
                      'Isaac': 4,
                      'Iscah': 1,
                      'Ishmael': 6,
                      'Ishmeelites': 1,
                      'Isra': 1,
                      'Israel': 10,
                      'Issachar': 1,
                      'Jacob': 17,
                      'Japheth': 2,
                      'Jared': 1,
                      'Javan': 1,
                      'Joktan': 1,
                      'Jordan': 2,
                      'Joseph': 5,
                      'Judah': 1,
                      'Keturah': 1,
                      'Laban': 8,
                      'Lamech': 2,
                      'Leah': 4,
                      'Levi': 1,
                      'Lot': 2,
                      'Lotan': 1,
                      'Machir': 1,
                      'Machpelah': 5,
                      'Mahalaleel': 1,
                      'Mamre': 3,
                      'Man': 1,
                      'Manasseh': 1,
                      'Masrekah': 1,
                      'Matred': 1,
                      'Methuselah': 1,
                      'Mezahab': 1,
                      'Midian': 1,
                      'Milcah': 3,
                      'Moab': 1,
                      'Moreh': 1,
                      'Moriah': 1,
                      'Nahor': 5,
                      'Naphtali': 1,
                      'Nebajoth': 2,
                      'Noah': 8,
                      'Nod': 1,
                      'On': 3,
                      'Padanaram': 2,
                      'Paran': 1,
                      'Pharaoh': 15,
                      'Pharez': 1,
                      'Potipherah': 3,
                      'Raamah': 1,
                      'Rachel': 4,
                      'Rameses': 1,
                      'Rebekah': 2,
                      'Rehoboth': 1,
                      'Reuben': 1,
                      'Reuel': 3,
                      'Salem': 1,
                      'Sarah': 3,
                      'Sarai': 2,
                      'Seir': 4,
                      'Seth': 1,
                      'Shaveh': 1,
                      'Shechem': 3,
                      'Shem': 5,
                      'Shinar': 4,
                      'Shobal': 1,
                      'Shuah': 1,
                      'Sichem': 1,
                      'Siddim': 3,
                      'Simeon': 1,
                      'Sod': 1,
                      'Sodom': 10,
                      'Temani': 1,
                      'Terah': 2,
                      'Tubalcain': 1,
                      'Ur': 1,
                      'Zeboiim': 2,
                      'Zebulun': 1,
                      'Zerah': 1,
                      'Zibeon': 4,
                      'Zilpah': 3,
                      'Zoar': 1,
                      'Zohar': 2,
                      'a': 18,
                      'activity': 1,
                      'all': 36,
                      'any': 1,
                      'bakemeats': 1,
                      'beast': 1,
                      'beasts': 3,
                      'blessing': 1,
                      'bread': 1,
                      'cattle': 4,
                      'commanding': 1,
                      'corn': 2,
                      'cruelty': 1,
                      'cursed': 1,
                      'da': 1,
                      'dearth': 1,
                      'dignity': 1,
                      'every': 15,
                      'famine': 5,
                      'fine': 2,
                      'five': 1,
                      'flocks': 1,
                      'for': 1,
                      'forty': 1,
                      'fowl': 2,
                      'fowls': 1,
                      'gold': 2,
                      'good': 2,
                      'gopher': 1,
                      'grap': 1,
                      'great': 2,
                      'green': 1,
                      'half': 1,
                      'heaven': 16,
                      'her': 9,
                      'herds': 1,
                      'him': 4,
                      'his': 74,
                      'it': 21,
                      'joint': 1,
                      'knowledge': 1,
                      'lentiles': 1,
                      'life': 9,
                      'man': 4,
                      'many': 5,
                      'me': 4,
                      'men': 3,
                      'millions': 1,
                      'mind': 1,
                      'money': 3,
                      'mourning': 1,
                      'my': 52,
                      'nations': 5,
                      'old': 1,
                      'one': 4,
                      'our': 7,
                      'peace': 1,
                      'people': 4,
                      'plenteousness': 1,
                      'pow': 1,
                      'raiment': 2,
                      'renown': 1,
                      'salt': 1,
                      'servan': 1,
                      'servants': 1,
                      'sheep': 2,
                      'ships': 1,
                      'silv': 2,
                      'silver': 4,
                      'skins': 1,
                      'slimepits': 1,
                      'springing': 1,
                      'sto': 1,
                      'such': 3,
                      'ten': 1,
                      'that': 12,
                      'the': 372,
                      'thee': 8,
                      'their': 14,
                      'them': 12,
                      'these': 1,
                      'thine': 2,
                      'this': 5,
                      'those': 3,
                      'thousands': 1,
                      'three': 3,
                      'thy': 25,
                      'time': 2,
                      'two': 1,
                      'us': 2,
                      'water': 8,
                      'waters': 2,
                      'wheat': 1,
                      'which': 3,
                      'whom': 3,
                      'women': 2,
                      'years': 1,
                      'you': 5,
                      'your': 13}),
             'winter': Counter({',': 1}),
             'walking': Counter({'in': 1}),
             'commanded': Counter({',': 1,
                      '.': 1,
                      'Noah': 1,
                      'h': 1,
                      'he': 1,
                      'him': 3,
                      'his': 2,
                      'the': 3,
                      'thee': 2,
                      'them': 2,
                      'to': 1}),
             'Aran': Counter({'.': 1}),
             'sit': Counter({'and': 1}),
             'large': Counter({'enough': 1}),
             'Shaul': Counter({'the': 1}),
             'Kenites': Counter({',': 1}),
             'face': Counter({',': 10,
                      '.': 2,
                      ':': 1,
                      ';': 2,
                      'no': 1,
                      'of': 24,
                      'shall': 1,
                      'shalt': 1,
                      'to': 2,
                      'toward': 2,
                      'unto': 1}),
             'That': Counter({'I': 1,
                      'be': 1,
                      'he': 1,
                      'in': 1,
                      'is': 1,
                      'she': 1,
                      'the': 1,
                      'these': 1,
                      'thou': 1,
                      'which': 1,
                      'ye': 1}),
             'Bow': Counter({'the': 1}),
             'commended': Counter({'her': 1}),
             'denied': Counter({',': 1}),
             'drew': Counter({'and': 1,
                      'back': 1,
                      'for': 1,
                      'near': 1,
                      'nigh': 1,
                      'wat': 1}),
             'woman': Counter({"'": 1,
                      ',': 5,
                      '.': 1,
                      'he': 1,
                      'said': 2,
                      'saw': 1,
                      'that': 1,
                      'to': 1,
                      'was': 1,
                      'which': 1,
                      'whom': 2,
                      'will': 3}),
             'curseth': Counter({'th': 1, 'thee': 1}),
             'Ishmael': Counter({"'": 1,
                      ',': 7,
                      '.': 1,
                      ';': 1,
                      'Abraham': 1,
                      'buried': 1,
                      'his': 3,
                      'might': 1,
                      'to': 1}),
             'ribs': Counter({',': 1}),
             'abomination': Counter({'unto': 2}),
             'lie': Counter({'by': 1, 'with': 5}),
             'descending': Counter({'on': 1}),
             'Abimelech': Counter({"'": 1,
                      ',': 2,
                      ';': 1,
                      'and': 1,
                      'because': 1,
                      'called': 2,
                      'charged': 1,
                      'had': 1,
                      'in': 1,
                      'king': 3,
                      'rose': 2,
                      'said': 6,
                      'took': 1,
                      'went': 1}),
             'Say': Counter({',': 1, 'unto': 1}),
             'tillest': Counter({'the': 1}),
             'kings': Counter({'of': 2, 'shall': 2, 'that': 3, 'with': 1}),
             'guiding': Counter({'his': 1}),
             'Eri': Counter({',': 1}),
             'same': Counter({'.': 1,
                      'be': 2,
                      'became': 1,
                      'day': 4,
                      'is': 8,
                      'night': 2,
                      'red': 1,
                      'shall': 1,
                      'words': 1,
                      'year': 1}),
             'turtledove': Counter({',': 1}),
             'longeth': Counter({'for': 1}),
             'Zaavan': Counter({',': 1}),
             'provender': Counter({'.': 1, 'enough': 1, 'for': 1, 'in': 1}),
             'piece': Counter({'one': 1}),
             'worthy': Counter({'of': 1}),
             'elders': Counter({'of': 2}),
             'troop': Counter({'cometh': 1, 'shall': 1}),
             'knoweth': Counter({'that': 1}),
             'hardly': Counter({'with': 1}),
             'sle': Counter({'and': 1}),
             'bands': Counter({'.': 1, ';': 1}),
             'seventh': Counter({'day': 3, 'month': 1}),
             'Even': Counter({'as': 1, 'by': 1}),
             'strakes': Counter({'in': 1}),
             'art': Counter({',': 1,
                      'a': 4,
                      'but': 1,
                      'commanded': 1,
                      'cursed': 1,
                      'even': 1,
                      'gone': 1,
                      'he': 1,
                      'his': 1,
                      'much': 1,
                      'my': 4,
                      'northward': 1,
                      'now': 1,
                      'our': 1,
                      'thou': 9,
                      'with': 1,
                      'yet': 1}),
             'husbandman': Counter({',': 1}),
             'the': Counter({'Adullamite': 2,
                      'Almighty': 2,
                      'Amalekites': 1,
                      'Amorite': 3,
                      'Amorites': 3,
                      'Arkite': 1,
                      'Arvadite': 1,
                      'Canaanite': 2,
                      'Canaanites': 7,
                      'Chaldees': 3,
                      'Edomites': 2,
                      'Egyptia': 1,
                      'Egyptian': 5,
                      'Egyptians': 12,
                      'Emins': 1,
                      'Gentiles': 1,
                      'Girgashites': 1,
                      'Girgasite': 1,
                      'God': 17,
                      'Hamathite': 1,
                      'Hebrew': 1,
                      'Hebrews': 2,
                      'Hitti': 1,
                      'Hittite': 7,
                      'Hittites': 1,
                      'Hivite': 3,
                      'Horite': 1,
                      'Horites': 3,
                      'Ishmeelites': 3,
                      'Jebusite': 1,
                      'Jebusites': 1,
                      'Judge': 1,
                      'Kadmonites': 1,
                      'Kenizzites': 1,
                      'LO': 4,
                      'LORD': 154,
                      'Midianites': 1,
                      'Moabites': 1,
                      'Perizzit': 1,
                      'Perizzite': 1,
                      'Perizzites': 1,
                      'Philistines': 7,
                      'Rephaims': 2,
                      'Sinite': 1,
                      'Spirit': 2,
                      'Syrian': 5,
                      'Zemarite': 1,
                      'Zuzims': 1,
                      'a': 1,
                      'air': 8,
                      'altar': 3,
                      'and': 3,
                      'angel': 8,
                      'angels': 3,
                      'anguish': 1,
                      'ark': 24,
                      'arms': 1,
                      'ass': 2,
                      'asses': 1,
                      'audience': 3,
                      'baker': 1,
                      'bakers': 1,
                      'bank': 1,
                      'basket': 1,
                      'beast': 1,
                      'bed': 3,
                      'beginning': 5,
                      'best': 3,
                      'birds': 3,
                      'blame': 2,
                      'blessed': 1,
                      'blessing': 3,
                      'blessings': 1,
                      'blood': 3,
                      'bondwoman': 1,
                      'book': 1,
                      'border': 1,
                      'borders': 2,
                      'bottle': 2,
                      'bow': 2,
                      'boys': 1,
                      'bracelets': 1,
                      'bread': 2,
                      'breadth': 2,
                      'breaking': 1,
                      'breasts': 1,
                      'breath': 4,
                      'brink': 1,
                      'brook': 1,
                      'brother': 2,
                      'brown': 3,
                      'bundles': 1,
                      'burnt': 2,
                      'butler': 2,
                      'butlers': 1,
                      'calf': 1,
                      'camel': 2,
                      'camels': 9,
                      'captain': 4,
                      'carcases': 1,
                      'cattle': 16,
                      'cave': 10,
                      'chief': 14,
                      'child': 5,
                      'children': 30,
                      'choice': 2,
                      'citi': 1,
                      'cities': 6,
                      'city': 23,
                      'clo': 1,
                      'cloud': 2,
                      'clusters': 1,
                      'coat': 2,
                      'commandment': 1,
                      'company': 1,
                      'concubines': 1,
                      'cool': 1,
                      'corn': 3,
                      'countenance': 1,
                      'country': 8,
                      'covenant': 3,
                      'covering': 1,
                      'creeping': 2,
                      'crown': 1,
                      'cry': 3,
                      'cup': 5,
                      'custom': 1,
                      'd': 1,
                      'damsel': 8,
                      'darkne': 1,
                      'darkness': 2,
                      'daughers': 2,
                      'daughter': 25,
                      'daughters': 14,
                      'day': 14,
                      'days': 26,
                      'dearth': 1,
                      'death': 3,
                      'deep': 3,
                      'dew': 2,
                      'doer': 1,
                      'dominion': 1,
                      'door': 9,
                      'dove': 4,
                      'dread': 1,
                      'dream': 2,
                      'dreams': 1,
                      'drought': 1,
                      'droves': 1,
                      'dry': 3,
                      'dukes': 8,
                      'dunge': 1,
                      'dungeon': 1,
                      'dust': 4,
                      'ea': 1,
                      'ear': 7,
                      'earring': 2,
                      'ears': 1,
                      'earth': 105,
                      'east': 12,
                      'edge': 1,
                      'elder': 3,
                      'elders': 2,
                      'eldest': 1,
                      'eleven': 1,
                      'end': 4,
                      'evening': 10,
                      'eventide': 1,
                      'everlasting': 3,
                      'evil': 2,
                      'excellency': 2,
                      'eyes': 9,
                      'face': 23,
                      'faces': 1,
                      'fame': 1,
                      'families': 3,
                      'famine': 15,
                      'fashion': 1,
                      'fat': 2,
                      'father': 13,
                      'fatness': 2,
                      'fear': 4,
                      'feebler': 1,
                      'female': 2,
                      'fie': 2,
                      'field': 39,
                      'fifth': 5,
                      'fifty': 2,
                      'fir': 1,
                      'fire': 2,
                      'firmame': 1,
                      'firmament': 6,
                      'first': 12,
                      'firstborn': 10,
                      'firstlings': 1,
                      'fish': 2,
                      'fishes': 1,
                      'flesh': 5,
                      'flo': 1,
                      'floc': 1,
                      'flock': 8,
                      'flocks': 10,
                      'flood': 7,
                      'floor': 1,
                      'folk': 1,
                      'food': 4,
                      'ford': 1,
                      'foremost': 1,
                      'former': 1,
                      'fountain': 1,
                      'fountains': 1,
                      'fourteenth': 1,
                      'fourth': 3,
                      'fowl': 5,
                      'fowls': 2,
                      'frost': 1,
                      'fruit': 7,
                      'gard': 1,
                      'garden': 12,
                      'garments': 1,
                      'gate': 9,
                      'gathering': 2,
                      'generations': 11,
                      'ghost': 4,
                      'goa': 1,
                      'goats': 4,
                      'gold': 1,
                      'good': 3,
                      'goods': 4,
                      'governor': 1,
                      'grapes': 1,
                      'grave': 4,
                      'gray': 1,
                      'great': 2,
                      'greater': 1,
                      'green': 1,
                      'ground': 24,
                      'guard': 6,
                      'gutters': 2,
                      'hand': 14,
                      'handmaidens': 1,
                      'handmaids': 1,
                      'hands': 4,
                      'harlot': 2,
                      'harp': 1,
                      'haven': 1,
                      'hazel': 1,
                      'he': 1,
                      'head': 4,
                      'heap': 1,
                      'hearth': 1,
                      'heat': 1,
                      'heaven': 7,
                      'heavens': 3,
                      'height': 1,
                      'herb': 2,
                      'herd': 1,
                      'herdmen': 3,
                      'herds': 1,
                      'high': 1,
                      'hollow': 4,
                      'horse': 1,
                      'host': 1,
                      'house': 27,
                      'hundred': 1,
                      'ill': 2,
                      'image': 2,
                      'images': 3,
                      'imagination': 1,
                      'increase': 1,
                      'inhabitants': 3,
                      'iniquity': 3,
                      'inn': 2,
                      'integrity': 2,
                      'interpretation': 5,
                      'isles': 1,
                      'keeper': 2,
                      'kid': 1,
                      'kids': 1,
                      'kindness': 1,
                      'king': 15,
                      'kings': 4,
                      'kn': 1,
                      'knife': 1,
                      'knowledge': 1,
                      'la': 1,
                      'labour': 1,
                      'lad': 17,
                      'lads': 1,
                      'lamb': 1,
                      'lambs': 1,
                      'land': 156,
                      'language': 1,
                      'last': 2,
                      'lean': 1,
                      'least': 2,
                      'left': 4,
                      'length': 1,
                      'lesser': 1,
                      'life': 7,
                      'light': 4,
                      'likeness': 1,
                      'living': 1,
                      'lord': 2,
                      'loss': 1,
                      'love': 1,
                      'm': 1,
                      'magicians': 2,
                      'male': 4,
                      'males': 1,
                      'man': 34,
                      'manner': 2,
                      'men': 36,
                      'merchant': 1,
                      'mercies': 1,
                      'messengers': 1,
                      'midst': 6,
                      'midwife': 2,
                      'mighty': 2,
                      'money': 6,
                      'month': 5,
                      'moon': 1,
                      'more': 2,
                      'morning': 20,
                      'morrow': 1,
                      'most': 4,
                      'mother': 3,
                      'mou': 1,
                      'mount': 6,
                      'mountain': 4,
                      'mountains': 4,
                      'mourning': 1,
                      'mouth': 2,
                      'moving': 1,
                      'mules': 1,
                      'nakedness': 4,
                      'name': 42,
                      'names': 5,
                      'nations': 4,
                      'neck': 1,
                      'next': 1,
                      'nig': 1,
                      'night': 3,
                      'north': 1,
                      'oak': 1,
                      'oath': 1,
                      'old': 1,
                      'one': 6,
                      'onyx': 1,
                      'open': 1,
                      'other': 6,
                      'overthrow': 1,
                      'path': 1,
                      'people': 15,
                      'persons': 2,
                      'physicians': 2,
                      'pillar': 2,
                      'pit': 4,
                      'place': 22,
                      'plain': 10,
                      'plains': 1,
                      'plenty': 2,
                      'point': 1,
                      'portion': 1,
                      'possessor': 1,
                      'power': 1,
                      'presence': 8,
                      'present': 4,
                      'prey': 2,
                      'priest': 1,
                      'priests': 3,
                      'prison': 8,
                      'prisoners': 1,
                      'rain': 2,
                      'ram': 1,
                      'rams': 3,
                      'rest': 1,
                      'rib': 1,
                      'riches': 1,
                      'right': 4,
                      'righteous': 3,
                      'ringstraked': 1,
                      'riv': 1,
                      'river': 9,
                      'rods': 5,
                      'ruler': 1,
                      'sack': 1,
                      'salt': 1,
                      'same': 18,
                      'sand': 3,
                      'savoury': 1,
                      'saying': 1,
                      'scarlet': 1,
                      'sea': 7,
                      'seas': 1,
                      'second': 10,
                      'seed': 1,
                      'selfsame': 3,
                      'serpent': 4,
                      'servant': 9,
                      'servants': 2,
                      'service': 1,
                      'set': 1,
                      'seven': 14,
                      'seventeenth': 2,
                      'seventh': 4,
                      'shadow': 1,
                      'she': 1,
                      'sheep': 8,
                      'shepherd': 1,
                      'shrubs': 1,
                      'side': 1,
                      'sight': 7,
                      'signet': 1,
                      'silver': 2,
                      'sinew': 2,
                      'sister': 3,
                      'six': 2,
                      'sixth': 2,
                      'skins': 1,
                      'slain': 1,
                      'slaughter': 1,
                      'smell': 3,
                      'smoke': 2,
                      'smooth': 1,
                      'sole': 1,
                      'son': 19,
                      'sons': 69,
                      'sou': 1,
                      'souls': 7,
                      'south': 5,
                      'space': 1,
                      'speckled': 1,
                      'spirit': 1,
                      'spoil': 1,
                      'spotted': 1,
                      'stars': 4,
                      'stead': 1,
                      'steward': 3,
                      'stone': 6,
                      'stones': 1,
                      'storehouses': 1,
                      'strange': 2,
                      'stranger': 1,
                      'street': 1,
                      'stronger': 2,
                      'sun': 5,
                      'sweat': 1,
                      'sword': 2,
                      'tenor': 1,
                      'tent': 6,
                      'tenth': 3,
                      'tents': 1,
                      'terror': 1,
                      'thigh': 2,
                      'thin': 1,
                      'thing': 6,
                      'third': 9,
                      'thirteenth': 1,
                      'thoughts': 1,
                      'three': 2,
                      'threshingfloor': 1,
                      'throne': 1,
                      'tidings': 1,
                      'time': 9,
                      'token': 2,
                      'top': 2,
                      'tops': 1,
                      'tower': 2,
                      'tr': 1,
                      'tree': 12,
                      'trees': 3,
                      'trespass': 2,
                      'tribes': 1,
                      'trough': 1,
                      'truth': 1,
                      'twelve': 1,
                      'two': 2,
                      'uncircumcised': 1,
                      'uppermost': 1,
                      'utmost': 1,
                      'vale': 4,
                      'valley': 3,
                      'vine': 2,
                      'virgin': 1,
                      'visions': 1,
                      'voice': 6,
                      'w': 1,
                      'wa': 1,
                      'wagons': 2,
                      'ward': 1,
                      'water': 1,
                      'watering': 1,
                      'waters': 30,
                      'way': 18,
                      'well': 18,
                      'wells': 2,
                      'west': 2,
                      'which': 5,
                      'white': 1,
                      'whole': 10,
                      'wick': 1,
                      'wicked': 2,
                      'wickedness': 1,
                      'wife': 3,
                      'wilderness': 7,
                      'window': 1,
                      'windows': 2,
                      'wine': 1,
                      'wise': 1,
                      'wives': 1,
                      'wo': 2,
                      'woman': 16,
                      'womb': 1,
                      'wombs': 1,
                      'women': 2,
                      'wood': 4,
                      'word': 3,
                      'words': 6,
                      'years': 7,
                      'young': 2,
                      'younge': 1,
                      'younger': 8,
                      'youngest': 4}),
             'riv': Counter({'And': 1}),
             'blessed': Counter({'.': 3,
                      ';': 2,
                      'Abraham': 1,
                      'Jacob': 1,
                      'Joseph': 1,
                      'Noah': 1,
                      'Pharaoh': 2,
                      'Rebekah': 1,
                      'be': 2,
                      'h': 1,
                      'him': 10,
                      'his': 1,
                      'in': 1,
                      'me': 2,
                      'my': 1,
                      'of': 2,
                      'th': 1,
                      'the': 3,
                      'thee': 1,
                      'them': 6}),
             'birthday': Counter({',': 1}),
             'obeisance': Counter({'.': 1, 'to': 2}),
             'overdrive': Counter({'them': 1}),
             'greater': Counter({'in': 1, 'light': 1, 'than': 3}),
             'Wherefore': Counter({'come': 1,
                      'dealt': 1,
                      'did': 1,
                      'didst': 1,
                      'have': 1,
                      'he': 1,
                      'is': 1,
                      'look': 1,
                      'saith': 1,
                      'shall': 1,
                      'she': 1,
                      'the': 1}),
             'delight': Counter({'in': 1}),
             'dew': Counter({'of': 2}),
             'Kiriathaim': Counter({',': 1}),
             'help': Counter({'meet': 2, 'thee': 1}),
             'think': Counter({'on': 1}),
             'could': Counter({'declare': 1,
                      'interpret': 1,
                      'not': 8,
                      'we': 1}),
             'Bless': Counter({'me': 1}),
             'Moabites': Counter({'unto': 1}),
             'audience': Counter({'of': 3}),
             'rich': Counter({'in': 1}),
             'stopped': Counter({',': 1, 'them': 2}),
             'ev': Counter({'For': 1, 'Therefore': 1, 'and': 2}),
             'marvelled': Counter({'one': 1}),
             'aloud': Counter({':': 1}),
             'dealt': Counter({'graciously': 1, 'hardly': 1, 'ye': 1}),
             'Sell': Counter({'me': 1}),
             'uncircumcised': Counter({';': 1, 'man': 1}),
             'Leah': Counter({"'": 5,
                      ',': 7,
                      '.': 2,
                      ':': 1,
                      ';': 1,
                      'Zilpah': 1,
                      'also': 1,
                      'and': 1,
                      'answered': 1,
                      'conceived': 2,
                      'his': 2,
                      'said': 4,
                      'saw': 1,
                      'to': 1,
                      'was': 2,
                      'went': 1}),
             '?)': Counter({'and': 1}),
             'perceived': Counter({'not': 2}),
             'here': Counter({';': 1,
                      'I': 1,
                      'also': 1,
                      'am': 1,
                      'any': 1,
                      'before': 1,
                      'by': 2,
                      'is': 1,
                      'looked': 1,
                      'with': 2}),
             'ran': Counter({',': 1,
                      'again': 1,
                      'and': 1,
                      'out': 1,
                      'to': 4,
                      'unto': 1}),
             'wagons': Counter({',': 1, 'out': 1, 'which': 2}),
             'instruments': Counter({'of': 1}),
             'Babel': Counter({',': 1, ';': 1}),
             'meant': Counter({'it': 1}),
             'shield': Counter({',': 1}),
             'steal': Counter({'away': 1, 'out': 1}),
             'Mash': Counter({'.': 1}),
             'according': Counter({'as': 6, 'to': 23, 'unto': 3}),
             'Zohar': Counter({',': 2, 'the': 1}),
             'discern': Counter({'thou': 1}),
             'burnt': Counter({'.': 1,
                      'offeri': 1,
                      'offering': 5,
                      'offerings': 1}),
             'son': Counter({"'": 12,
                      ',': 35,
                      '.': 22,
                      ':': 4,
                      ';': 10,
                      '?': 3,
                      'Abram': 1,
                      'Esau': 3,
                      'Isaac': 5,
                      'Jacob': 1,
                      'Joseph': 3,
                      'Shechem': 1,
                      'again': 1,
                      'al': 1,
                      'also': 2,
                      'be': 1,
                      'came': 1,
                      'from': 2,
                      'had': 1,
                      'hearkened': 1,
                      'in': 4,
                      'indeed': 1,
                      'is': 2,
                      'many': 1,
                      'mourning': 1,
                      'of': 25,
                      'shall': 1,
                      'that': 1,
                      'thither': 2,
                      'to': 1,
                      'was': 1,
                      'were': 1,
                      'with': 1}),
             'Remain': Counter({'a': 1}),
             'enter': Counter({'into': 1}),
             'Pathrusim': Counter({',': 1}),
             'eaten': Counter({',': 2, '.': 1, 'of': 3, 'them': 2, 'up': 1}),
             'Tubal': Counter({',': 1}),
             'morrow': Counter({',': 1}),
             'for': Counter({',': 1,
                      'Adam': 1,
                      'Er': 1,
                      'Esau': 1,
                      'God': 3,
                      'I': 18,
                      'Ishmael': 1,
                      'Joseph': 1,
                      'Manasseh': 1,
                      'Phara': 1,
                      'Pharaoh': 2,
                      'Rachel': 3,
                      'Rebekah': 1,
                      'Sarah': 1,
                      'Sarai': 1,
                      'a': 15,
                      'age': 1,
                      'all': 5,
                      'am': 1,
                      'an': 8,
                      'as': 2,
                      'badne': 1,
                      'because': 1,
                      'bondmen': 1,
                      'bre': 1,
                      'bread': 1,
                      'days': 1,
                      'dust': 1,
                      'ev': 2,
                      'ever': 2,
                      'every': 1,
                      'food': 4,
                      'forty': 1,
                      'good': 1,
                      'harm': 1,
                      'he': 15,
                      'her': 4,
                      'him': 7,
                      'his': 11,
                      'horses': 1,
                      'if': 1,
                      'in': 5,
                      'it': 5,
                      'lack': 1,
                      'lights': 1,
                      'man': 1,
                      'me': 8,
                      'meat': 1,
                      'mine': 1,
                      'morter': 1,
                      'multitude': 2,
                      'my': 7,
                      'nought': 1,
                      'now': 1,
                      'out': 2,
                      'perpetual': 1,
                      'righteousness': 1,
                      'seasons': 1,
                      'seed': 1,
                      'servants': 1,
                      'she': 8,
                      'signs': 1,
                      'so': 1,
                      'spies': 1,
                      'stone': 1,
                      'store': 1,
                      'surely': 1,
                      'ten': 1,
                      'that': 7,
                      'the': 43,
                      'thee': 3,
                      'their': 4,
                      'them': 4,
                      'therefore': 3,
                      'these': 1,
                      'they': 3,
                      'this': 1,
                      'thou': 3,
                      'thy': 13,
                      'to': 3,
                      'twenty': 2,
                      'unto': 1,
                      'us': 4,
                      'venison': 1,
                      'we': 2,
                      'whom': 1,
                      'why': 1,
                      'with': 1,
                      'wives': 1,
                      'yet': 1,
                      'you': 4,
                      'your': 6}),
             'prayed': Counter({'unto': 1}),
             'between': Counter({'Bethel': 1,
                      'God': 1,
                      'Kadesh': 2,
                      'Nineveh': 1,
                      'his': 2,
                      'me': 12,
                      'my': 1,
                      'the': 1,
                      'thee': 1,
                      'those': 1,
                      'thy': 1,
                      'two': 1}),
             'Ephra': Counter({'and': 2}),
             'sh': Counter({'for': 1}),
             'host': Counter({',': 1, ':': 1, 'of': 1, 'spake': 1}),
             'shewed': Counter({'Pharaoh': 1,
                      'him': 1,
                      'kindness': 1,
                      'me': 1,
                      'thee': 1,
                      'unto': 2}),
             'Shuni': Counter({',': 1}),
             'alive': Counter({',': 2,
                      '.': 5,
                      ':': 1,
                      '?': 2,
                      'upon': 1,
                      'with': 1}),
             'Anah': Counter({',': 3,
                      '.': 1,
                      ':': 1,
                      'that': 1,
                      'the': 2,
                      'were': 1}),
             'Zebulun': Counter({'.': 1, ';': 1, 'shall': 1}),
             'report': Counter({'.': 1}),
             'ring': Counter({'from': 1}),
             'appear': Counter({'which': 1}),
             'state': Counter({',': 1}),
             'Elon': Counter({',': 1, 'the': 2}),
             'Tola': Counter({',': 1}),
             'inhabitants': Counter({'of': 3}),
             'bough': Counter({',': 1, 'by': 1}),
             'assigned': Counter({'them': 1}),
             'Slay': Counter({'my': 1}),
             'bakers': Counter({'.': 1}),
             'Jordan': Counter({',': 2, '.': 1, ';': 2}),
             'ki': Counter({'And': 1, 'and': 4}),
             'belly': Counter({'shalt': 1}),
             'sewed': Counter({'fig': 1}),
             'moreover': Counter({',': 1, 'unto': 1}),
             'filled': Counter({'her': 1, 'the': 1, 'them': 1, 'with': 2}),
             'hearth': Counter({'.': 1}),
             'Until': Counter({'thy': 1}),
             'dungeon': Counter({'.': 1}),
             'bitter': Counter({'cry': 1}),
             'Not': Counter({'so': 1}),
             'Sarah': Counter({"'": 2,
                      ',': 3,
                      '.': 2,
                      'Abraham': 1,
                      'after': 1,
                      'as': 2,
                      'bare': 1,
                      'conceived': 1,
                      'denied': 1,
                      'died': 1,
                      'hath': 1,
                      'he': 1,
                      'heard': 1,
                      'his': 5,
                      'laugh': 1,
                      'laughed': 1,
                      'my': 1,
                      'said': 1,
                      'saw': 1,
                      'shall': 3,
                      'should': 1,
                      'thy': 3,
                      'was': 1,
                      'were': 1}),
             'fruit': Counter({',': 1,
                      'after': 1,
                      'of': 5,
                      'thereof': 1,
                      'tree': 1}),
             'Naaman': Counter({',': 1}),
             'Shobal': Counter({',': 2, 'were': 1}),
             'habitations': Counter({'.': 1, 'in': 1}),
             'befell': Counter({'unto': 1}),
             'told': Counter({'Abraham': 1,
                      'Abram': 1,
                      'Isaac': 1,
                      'Jacob': 1,
                      'Joseph': 1,
                      'Judah': 1,
                      'Laban': 2,
                      'Pharaoh': 1,
                      'Rachel': 1,
                      'Tamar': 1,
                      'all': 1,
                      'her': 1,
                      'him': 10,
                      'his': 2,
                      'it': 3,
                      'mine': 1,
                      'thee': 1,
                      'them': 2,
                      'this': 1,
                      'to': 1}),
             'they': Counter({'and': 3,
                      'answered': 1,
                      'are': 2,
                      'bare': 2,
                      'begin': 1,
                      'blessed': 1,
                      'boug': 1,
                      'bowed': 3,
                      'brought': 6,
                      'buried': 2,
                      'called': 3,
                      'came': 10,
                      'can': 1,
                      'captive': 1,
                      'chose': 1,
                      'come': 1,
                      'communed': 1,
                      'conspired': 1,
                      'continued': 1,
                      'could': 1,
                      'cried': 1,
                      'delivered': 1,
                      'departed': 2,
                      'did': 10,
                      'digged': 2,
                      'drank': 1,
                      'dreamed': 1,
                      'drew': 1,
                      'dwelt': 2,
                      'embalmed': 1,
                      'emptied': 1,
                      'fed': 2,
                      'feed': 1,
                      'fell': 1,
                      'for': 1,
                      'found': 1,
                      'gave': 1,
                      'give': 1,
                      'had': 13,
                      'hated': 3,
                      'have': 8,
                      'heard': 3,
                      'joined': 1,
                      'journeyed': 3,
                      'knew': 3,
                      'laded': 1,
                      'lay': 1,
                      'left': 2,
                      'lifted': 1,
                      'made': 4,
                      'may': 4,
                      'might': 3,
                      'mourned': 1,
                      'overtook': 1,
                      'pressed': 1,
                      'rebelled': 1,
                      'rent': 1,
                      'returned': 2,
                      'rode': 1,
                      'roll': 1,
                      'rolled': 1,
                      'rose': 3,
                      'said': 30,
                      'sat': 2,
                      'saw': 2,
                      'seemed': 1,
                      'sent': 4,
                      'separated': 1,
                      'served': 1,
                      'set': 1,
                      'sewed': 1,
                      'shall': 10,
                      'should': 3,
                      'slew': 2,
                      'smote': 1,
                      'sold': 1,
                      'spake': 1,
                      'speedily': 1,
                      'stript': 1,
                      'strove': 2,
                      'sware': 1,
                      'that': 3,
                      'told': 1,
                      'took': 8,
                      'turned': 1,
                      'under': 1,
                      'washed': 1,
                      'watered': 1,
                      'wearied': 1,
                      'went': 6,
                      'wept': 1,
                      'were': 18,
                      'will': 4}),
             'done': Counter({',': 1,
                      '.': 4,
                      '?': 3,
                      'according': 1,
                      'altogether': 1,
                      'deeds': 1,
                      'drinking': 2,
                      'evil': 1,
                      'foolishly': 1,
                      'giving': 1,
                      'in': 1,
                      'nothing': 1,
                      'speaking': 2,
                      'that': 1,
                      'thee': 1,
                      'this': 5,
                      'to': 1,
                      'unto': 8}),
             'Edar': Counter({'.': 1}),
             'Korah': Counter({',': 1, '.': 1, ':': 1}),
             'find': Counter({'grace': 5,
                      'her': 1,
                      'him': 1,
                      'in': 1,
                      'such': 1,
                      'the': 1,
                      'there': 1,
                      'thirty': 1}),
             'may': Counter({'also': 1,
                      'be': 4,
                      'bless': 5,
                      'breed': 1,
                      'bring': 1,
                      'bury': 1,
                      'drink': 1,
                      'dwell': 1,
                      'eat': 4,
                      'feel': 1,
                      'find': 1,
                      'fly': 1,
                      'give': 1,
                      'go': 3,
                      'judge': 1,
                      'know': 1,
                      'live': 3,
                      'not': 2,
                      'obtain': 1,
                      'preserve': 2,
                      'reach': 1,
                      'remember': 1,
                      'seek': 1,
                      'send': 1,
                      'set': 1,
                      'tell': 1,
                      'turn': 1}),
             'The': Counter({'Angel': 1,
                      'God': 1,
                      'Hebrew': 1,
                      'Kenites': 1,
                      'LORD': 4,
                      'archers': 1,
                      'blessings': 1,
                      'child': 1,
                      'children': 4,
                      'days': 2,
                      'dream': 1,
                      'earth': 1,
                      'end': 1,
                      'field': 1,
                      'fountains': 1,
                      'keeper': 1,
                      'lad': 1,
                      'land': 1,
                      'length': 1,
                      'man': 3,
                      'name': 1,
                      'princes': 1,
                      'purchase': 1,
                      'ringstraked': 1,
                      'sceptre': 1,
                      'serpent': 1,
                      'seven': 1,
                      'sons': 5,
                      'soul': 1,
                      'speckled': 1,
                      'sun': 1,
                      'thing': 1,
                      'three': 2,
                      'voice': 1,
                      'water': 1,
                      'woman': 1}),
             'doeth': Counter({'unto': 1}),
             'garments': Counter({'in': 1, 'of': 1, 'off': 1}),
             'hotly': Counter({'pursued': 1}),
             'Gad': Counter({',': 2, '.': 1, ';': 1}),
             'lingered': Counter({',': 2}),
             'brick': Counter({',': 1, 'for': 1}),
             'Sodom': Counter({',': 6,
                      '.': 1,
                      ':': 1,
                      'and': 6,
                      'at': 1,
                      'fifty': 1,
                      'said': 1,
                      'went': 1,
                      'were': 1}),
             'portion': Counter({'.': 1,
                      'above': 1,
                      'assigned': 1,
                      'of': 1,
                      'or': 1,
                      'which': 1}),
             'parts': Counter({'shall': 1}),
             'Jacob': Counter({"'": 15,
                      ',': 26,
                      '.': 11,
                      ':': 5,
                      ';': 2,
                      '?': 1,
                      'a': 4,
                      'again': 1,
                      'all': 1,
                      'and': 1,
                      'answered': 3,
                      'asked': 1,
                      'awaked': 1,
                      'because': 1,
                      'beheld': 1,
                      'blessed': 2,
                      'buried': 1,
                      'called': 4,
                      'came': 6,
                      'did': 2,
                      'dwelt': 1,
                      'either': 2,
                      'fed': 1,
                      'gave': 1,
                      'had': 2,
                      'hath': 1,
                      'heard': 1,
                      'held': 1,
                      'her': 3,
                      'hid': 1,
                      'his': 2,
                      'in': 1,
                      'into': 1,
                      'is': 1,
                      'journeyed': 1,
                      'kissed': 1,
                      'knew': 1,
                      'laid': 1,
                      'lifted': 1,
                      'lived': 1,
                      'loved': 1,
                      'no': 1,
                      'obeyed': 1,
                      'offered': 1,
                      'rent': 1,
                      'rose': 3,
                      'said': 16,
                      'saith': 1,
                      'saw': 3,
                      'sent': 3,
                      'served': 1,
                      'set': 2,
                      'sod': 1,
                      'stole': 1,
                      'sware': 1,
                      'take': 1,
                      'the': 2,
                      'their': 5,
                      'to': 2,
                      'told': 1,
                      'took': 2,
                      'vowed': 1,
                      'was': 7,
                      'went': 6,
                      'were': 1}),
             'Resen': Counter({'between': 1}),
             'rained': Counter({'upon': 1}),
             'wombs': Counter({'of': 1}),
             'having': Counter({'Bethel': 1}),
             'Chaldees': Counter({',': 2, '.': 1}),
             'law': Counter({',': 5,
                      '.': 1,
                      '.)': 1,
                      'goeth': 1,
                      'hath': 1,
                      'over': 1}),
             'Asher': Counter({'.': 1, ':': 1, ';': 1, 'his': 1}),
             'barr': Counter({'and': 1}),
             'Mezahab': Counter({'.': 1}),
             'abroad': Counter({',': 2,
                      '.': 1,
                      'from': 1,
                      'to': 1,
                      'upon': 2}),
             'bury': Counter({'his': 2,
                      'me': 4,
                      'my': 4,
                      'therefore': 1,
                      'thy': 4}),
             'where': Counter({',': 1,
                      'Abraham': 1,
                      'God': 1,
                      'Joseph': 1,
                      'he': 5,
                      'his': 1,
                      'is': 2,
                      'it': 1,
                      'the': 1,
                      'there': 1,
                      'they': 1,
                      'thou': 3,
                      'to': 1}),
             'Dodanim': Counter({'.': 1}),
             'Ephron': Counter({';': 1,
                      'answered': 1,
                      'dwelt': 1,
                      'in': 1,
                      'the': 7,
                      'which': 1}),
             'observed': Counter({'the': 1}),
             'sixteen': Counter({'souls': 1}),
             'stalk': Counter({',': 2}),
             'rank': Counter({'and': 2}),
             'felt': Counter({'him': 1}),
             'sojourn': Counter({',': 1, 'in': 1, 'there': 1}),
             'sworn': Counter({',': 1}),
             'He': Counter({'if': 1, 'is': 3, 'that': 2, 'was': 1}),
             'lived': Counter({',': 2,
                      'after': 16,
                      'an': 6,
                      'five': 1,
                      'four': 1,
                      'in': 1,
                      'nine': 1,
                      'ninety': 1,
                      'seventy': 2,
                      'sixty': 2,
                      'thirty': 3,
                      'two': 1,
                      'were': 1}),
             'Beriah': Counter({',': 1, ';': 1}),
             'Ur': Counter({'of': 3}),
             'protest': Counter({'unto': 1}),
             'spoiled': Counter({'even': 1, 'the': 1}),
             'governor': Counter({'over': 2}),
             'hastened': Counter({'Lot': 1, 'into': 1}),
             'perish': Counter({'not': 1}),
             'feeding': Counter({'the': 1}),
             'fell': Counter({'.': 1,
                      'before': 1,
                      'down': 1,
                      'on': 3,
                      'there': 1,
                      'upon': 5}),
             'spee': Counter({'for': 1}),
             'lack': Counter({'five': 1, 'of': 1}),
             'Tidal': Counter({'king': 2}),
             'Sidon': Counter({',': 1, 'his': 1}),
             'Zo': Counter({'and': 1}),
             'maid': Counter({',': 2,
                      '.': 1,
                      ';': 1,
                      'Bilhah': 1,
                      'bare': 2,
                      'conceived': 1,
                      'for': 1,
                      'into': 1,
                      'is': 1,
                      'the': 1}),
             'couching': Counter({'down': 1}),
             'rewarded': Counter({'evil': 1}),
             'hate': Counter({'me': 1, 'them': 1, 'us': 1}),
             'forbid': Counter({'that': 2}),
             'hasted': Counter({',': 2, 'to': 1}),
             'Ephrath': Counter({',': 1, ';': 1}),
             'Jebusites': Counter({'.': 1}),
             'beari': Counter({'I': 1}),
             'weep': Counter({';': 1, 'for': 1}),
             'Obal': Counter({',': 1}),
             'too': Counter({'hard': 1}),
             'wherein': Counter({'his': 1,
                      'is': 2,
                      'there': 1,
                      'they': 1,
                      'thou': 3}),
             'reigned': Counter({'any': 1, 'in': 9}),
             'these': Counter({',': 2,
                      ';': 4,
                      '?': 1,
                      'are': 28,
                      'be': 1,
                      'before': 1,
                      'cattle': 1,
                      'children': 1,
                      'countries': 2,
                      'eight': 1,
                      'made': 1,
                      'men': 3,
                      'my': 1,
                      'same': 1,
                      'seven': 2,
                      'she': 1,
                      'things': 10,
                      'two': 2,
                      'unto': 1,
                      'were': 12,
                      'which': 1,
                      'wor': 1,
                      'words': 3}),
             'tower': Counter({',': 2, 'of': 1}),
             'cru': Counter({'I': 1}),
             'shear': Counter({'his': 2}),
             'Bedad': Counter({',': 1}),
             'Zemarite': Counter({',': 1}),
             'enquire': Counter({'at': 1, 'of': 1}),
             'language': Counter({',': 2, ';': 1, 'of': 1}),
             'book': Counter({'of': 1}),
             'every': Counter({'artificer': 1,
                      'beast': 10,
                      'bird': 1,
                      'city': 1,
                      'clean': 3,
                      'creeping': 6,
                      'drove': 1,
                      'fowl': 5,
                      'green': 1,
                      'herb': 2,
                      'imagination': 1,
                      'living': 12,
                      'm': 1,
                      'male': 4,
                      'man': 13,
                      'one': 6,
                      'place': 1,
                      'plant': 1,
                      'quart': 1,
                      'shepherd': 1,
                      'sort': 3,
                      'thing': 6,
                      'tree': 4,
                      'way': 1,
                      'where': 1,
                      'winged': 1}),
             'standest': Counter({'thou': 1}),
             'following': Counter({';': 1}),
             'droves': Counter({',': 1}),
             'therein': Counter({',': 5, '.': 2, ';': 1, '?': 1, 'was': 1}),
             'isles': Counter({'of': 1}),
             'rams': Counter({',': 1, 'of': 1, 'which': 2}),
             'married': Counter({'his': 1}),
             'plagued': Counter({'Pharaoh': 1}),
             'Ludim': Counter({',': 1}),
             'Midianites': Counter({'merchantmen': 1, 'sold': 1}),
             'oa': Counter({'only': 1}),
             'wherewith': Counter({'his': 1}),
             'bound': Counter({'.': 1,
                      'Isaac': 1,
                      'him': 1,
                      'in': 2,
                      'of': 1,
                      'up': 1,
                      'upon': 1}),
             'doing': Counter({'.': 2}),
             'counted': Counter({'it': 1, 'of': 1, 'stolen': 1}),
             'weighed': Counter({'to': 1}),
             'Kirjatharba': Counter({';': 1}),
             'bereaved': Counter({'.': 1, 'of': 2}),
             'above': Counter({',': 1,
                      ';': 2,
                      'all': 1,
                      'every': 1,
                      'it': 1,
                      'the': 4,
                      'thy': 1}),
             'Sabtech': Counter({'and': 1}),
             'mightier': Counter({'than': 1}),
             'breathed': Counter({'into': 1}),
             'give': Counter({'.': 1,
                      'I': 2,
                      'Pharaoh': 1,
                      'according': 1,
                      'all': 1,
                      'her': 3,
                      'his': 1,
                      'it': 8,
                      'light': 2,
                      'me': 9,
                      'not': 1,
                      'our': 2,
                      'seed': 1,
                      'the': 4,
                      'thee': 8,
                      'them': 2,
                      'this': 3,
                      'thy': 2,
                      'unto': 2,
                      'us': 1,
                      'you': 3,
                      'your': 1}),
             'quart': Counter({'And': 1}),
             'whereby': Counter({'indeed': 1, 'shall': 1}),
             'thirteenth': Counter({'year': 1}),
             'spicery': Counter({'and': 1}),
             'saidst': Counter({',': 1, 'thou': 2, 'unto': 3}),
             'deferred': Counter({'not': 1}),
             'Hiddekel': Counter({':': 1}),
             'Thus': Counter({'God': 1,
                      'I': 1,
                      'did': 1,
                      'dwelt': 1,
                      'have': 1,
                      'his': 1,
                      'saith': 1,
                      'shall': 1,
                      'spake': 1,
                      'the': 1,
                      'they': 1,
                      'were': 1}),
             'heart': Counter({',': 4,
                      '.': 1,
                      ';': 1,
                      'and': 1,
                      'failed': 1,
                      'fainted': 1,
                      'is': 1,
                      'was': 1}),
             'spi': Counter({'Hereby': 1}),
             'Blessed': Counter({'be': 3}),
             'eat': Counter({',': 6,
                      '.': 7,
                      ':': 1,
                      ';': 1,
                      '?': 1,
                      'all': 1,
                      'and': 3,
                      'bre': 1,
                      'bread': 5,
                      'not': 1,
                      'of': 10,
                      'the': 2,
                      'their': 1,
                      'them': 1,
                      'there': 1,
                      'thereof': 1,
                      'thy': 1,
                      'up': 2,
                      'with': 1}),
             'seeing': Counter({'I': 1,
                      'that': 2,
                      'the': 1,
                      'them': 1,
                      'thou': 1,
                      'ye': 1}),
             'harvest': Counter({',': 2, '.': 1}),
             'sorely': Counter({'grieved': 1}),
             'Kenaz': Counter({',': 2, '.': 1}),
             'sheepshearers': Counter({'to': 1}),
             'Hanoch': Counter({',': 2}),
             'far': Counter({'from': 2, 'off': 1}),
             'Gilead': Counter({'.': 3, 'with': 1}),
             'dreamer': Counter({'cometh': 1}),
             'Zibeon': Counter({',': 3, ';': 1, 'his': 1, 'the': 1}),
             '.': Counter({'A': 1,
                      'Abram': 1,
                      'After': 1,
                      'All': 4,
                      'Also': 2,
                      'And': 1038,
                      'Arise': 3,
                      'As': 1,
                      'Ask': 1,
                      'Behold': 5,
                      'Benjamin': 1,
                      'Binding': 1,
                      'Bring': 1,
                      'But': 23,
                      'By': 1,
                      'Come': 1,
                      'Cursed': 1,
                      'Dan': 2,
                      'Deliver': 1,
                      'Esau': 1,
                      'Every': 1,
                      'Except': 1,
                      'Fifteen': 1,
                      'For': 13,
                      'Fulfil': 1,
                      'Gad': 1,
                      'Gather': 1,
                      'Give': 1,
                      'Go': 2,
                      'God': 2,
                      'Haste': 2,
                      'He': 2,
                      'I': 4,
                      'If': 5,
                      'In': 5,
                      'Is': 1,
                      'Issachar': 1,
                      'It': 1,
                      'Joseph': 2,
                      'Judah': 2,
                      'Leah': 1,
                      'Let': 2,
                      'Make': 1,
                      'Moreover': 2,
                      'My': 1,
                      'Naphtali': 1,
                      'Neither': 1,
                      'Now': 24,
                      'O': 1,
                      'Of': 4,
                      'Only': 2,
                      'Out': 2,
                      'Reuben': 1,
                      'Say': 1,
                      'Send': 1,
                      'Shall': 1,
                      'She': 1,
                      'Simeon': 1,
                      'So': 17,
                      'Take': 1,
                      'That': 1,
                      'The': 15,
                      'Then': 34,
                      'There': 2,
                      'Therefore': 7,
                      'These': 16,
                      'They': 2,
                      'This': 4,
                      'Thus': 9,
                      'To': 1,
                      'Twelve': 1,
                      'Unto': 3,
                      'We': 1,
                      'When': 6,
                      'Wherefore': 2,
                      'While': 1,
                      'Whoso': 1,
                      'With': 1,
                      'Yet': 1,
                      'Zebulun': 1}),
             'shortly': Counter({'bring': 1}),
             'deliver': Counter({'Pharaoh': 1, 'him': 2, 'you': 1}),
             'doer': Counter({'of': 1}),
             'Jemuel': Counter({',': 1}),
             'Meshech': Counter({',': 1}),
             'Almighty': Counter({',': 1,
                      ':': 1,
                      'God': 1,
                      'appeared': 1,
                      'bless': 1,
                      'give': 1}),
             'walk': Counter({',': 2, 'before': 1, 'through': 1}),
             'wander': Counter({'from': 1}),
             'Sojourn': Counter({'in': 1}),
             'wroth': Counter({',': 3, '?': 1, 'against': 1, 'with': 1}),
             'dressed': Counter({',': 1}),
             'Sabtah': Counter({',': 1}),
             'renown': Counter({'.': 1}),
             'sea': Counter({',': 4, '.': 1, ';': 2, 'shore': 1}),
             'Behold': Counter({',': 40,
                      'my': 1,
                      'now': 9,
                      'the': 1,
                      'this': 1,
                      'thy': 1}),
             'thought': Counter({',': 1, 'evil': 1, 'her': 1, 'to': 1}),
             'ward': Counter({'.': 1, 'in': 2, 'of': 1, 'three': 1}),
             'reached': Counter({'to': 1}),
             'Casluhim': Counter({',': 1}),
             'on': Counter({',': 1,
                      '.': 1,
                      'Esau': 1,
                      'Peradventure': 1,
                      'a': 1,
                      'bread': 1,
                      'for': 1,
                      'her': 2,
                      'him': 2,
                      'his': 8,
                      'it': 1,
                      'me': 2,
                      'my': 3,
                      'softly': 1,
                      'still': 1,
                      'the': 28,
                      'them': 1,
                      'your': 1}),
             'open': Counter({'firmament': 1, 'place': 1}),
             'Our': Counter({'father': 1}),
             'grace': Counter({'in': 11}),
             'Shall': Counter({'I': 3, 'a': 1, 'not': 2}),
             'border': Counter({'of': 1, 'shall': 1}),
             'plant': Counter({'of': 1}),
             'draw': Counter({'for': 1, 'wat': 1, 'water': 4}),
             'lamentati': Counter({'and': 1}),
             'put': Counter({',': 1,
                      'a': 2,
                      'enmity': 1,
                      'every': 1,
                      'for': 1,
                      'forth': 3,
                      'her': 1,
                      'him': 2,
                      'his': 3,
                      'in': 1,
                      'into': 1,
                      'it': 1,
                      'me': 2,
                      'my': 1,
                      'on': 2,
                      'our': 1,
                      'out': 1,
                      'sackcloth': 1,
                      'the': 5,
                      'them': 7,
                      'thy': 1,
                      'to': 1}),
             'offended': Counter({'thee': 1, 'their': 1}),
             'bodies': Counter({',': 1}),
             'Yea': Counter({',': 2}),
             'nourished': Counter({'his': 1}),
             'Milcah': Counter({',': 5, 'bare': 1, 'did': 1}),
             'spread': Counter({'abroad': 2, 'his': 2}),
             'cold': Counter({'and': 1}),
             'nuts': Counter({',': 1}),
             'opened': Counter({',': 2,
                      '.': 1,
                      'all': 1,
                      'every': 1,
                      'her': 4,
                      'his': 1,
                      'our': 1,
                      'the': 1}),
             'leap': Counter({'upon': 1}),
             'honourable': Counter({'than': 1}),
             'feared': Counter({'to': 2}),
             'seemed': Counter({'as': 1, 'unto': 1}),
             'lade': Counter({'your': 1}),
             'Er': Counter({',': 2, '.': 1, 'and': 1, 'his': 1}),
             'They': Counter({',': 1,
                      'are': 2,
                      'be': 1,
                      'said': 1,
                      'took': 1}),
             'Naphtali': Counter({'.': 1, ':': 1, ';': 1, 'is': 1}),
             'everlasting': Counter({'God': 1,
                      'covenant': 4,
                      'hil': 1,
                      'possession': 2}),
             'rose': Counter({'early': 1, 'up': 19, 'upon': 1}),
             'former': Counter({'manner': 1}),
             'Phara': Counter({'and': 3, 'lo': 1}),
             'troughs': Counter({'when': 1}),
             'tar': Counter({'for': 1}),
             'street': Counter({'all': 1}),
             'rid': Counter({'him': 1}),
             'bow': Counter({',': 1,
                      '.': 1,
                      'abode': 1,
                      'down': 4,
                      'in': 1,
                      'sh': 1,
                      'shall': 2}),
             'issue': Counter({',': 1}),
             'leaf': Counter({'pluckt': 1}),
             'utmost': Counter({'bound': 1}),
             'him': Counter({',': 127,
                      '.': 56,
                      ':': 8,
                      ';': 19,
                      '?': 2,
                      'Benjamin': 1,
                      'Bilhah': 1,
                      'Rachel': 1,
                      'Sarah': 1,
                      'Zimran': 1,
                      'a': 10,
                      'according': 3,
                      'afar': 1,
                      'after': 1,
                      'all': 4,
                      'also': 2,
                      'an': 2,
                      'any': 1,
                      'appoint': 1,
                      'as': 2,
                      'at': 2,
                      'away': 2,
                      'before': 4,
                      'both': 1,
                      'but': 1,
                      'by': 3,
                      'concerning': 2,
                      'die': 1,
                      'down': 2,
                      'drink': 3,
                      'exceedingly': 1,
                      'favour': 1,
                      'fetch': 1,
                      'flocks': 1,
                      'for': 3,
                      'forth': 3,
                      'four': 1,
                      'from': 1,
                      'fruitful': 1,
                      'hastily': 1,
                      'hath': 1,
                      'he': 2,
                      'his': 1,
                      'in': 17,
                      'into': 11,
                      'mercy': 1,
                      'more': 1,
                      'no': 1,
                      'not': 8,
                      'of': 3,
                      'on': 2,
                      'out': 6,
                      'over': 1,
                      'overseer': 2,
                      'rods': 1,
                      'ruler': 1,
                      'seven': 1,
                      'sevenfold': 1,
                      'shall': 1,
                      'should': 1,
                      'six': 1,
                      'strangers': 1,
                      'that': 4,
                      'the': 5,
                      'there': 2,
                      'three': 1,
                      'threescore': 1,
                      'thy': 1,
                      'tithes': 1,
                      'to': 13,
                      'two': 1,
                      'until': 1,
                      'unto': 1,
                      'up': 2,
                      'went': 1,
                      'were': 1,
                      'wine': 1,
                      'with': 2,
                      'without': 1,
                      'yet': 4}),
             'current': Counter({'money': 1}),
             'fled': Counter({',': 3,
                      '.': 2,
                      'forth': 1,
                      'from': 2,
                      'out': 1,
                      'to': 1,
                      'with': 1}),
             'height': Counter({'of': 1}),
             'Dedan': Counter({'.': 2, 'were': 1}),
             'worth': Counter({'four': 1, 'he': 1}),
             'after': Counter({',': 2,
                      'Abram': 1,
                      'came': 1,
                      'he': 18,
                      'him': 5,
                      'his': 17,
                      'me': 1,
                      'my': 1,
                      'our': 1,
                      'seven': 1,
                      'th': 1,
                      'that': 6,
                      'the': 15,
                      'thee': 7,
                      'their': 14,
                      'them': 6,
                      'these': 5,
                      'this': 3,
                      'thy': 1,
                      'you': 1}),
             'thousands': Counter({'of': 1}),
             'overseer': Counter({'in': 1, 'over': 1}),
             'priest': Counter({'of': 4}),
             'slew': Counter({'.': 1, 'Hamor': 1, 'a': 1, 'all': 1, 'him': 3}),
             'Uzal': Counter({',': 1}),
             'iniquity': Counter({'of': 3}),
             'only': Counter({',': 1,
                      'bring': 1,
                      'evil': 1,
                      'in': 1,
                      'let': 1,
                      'obey': 1,
                      'remained': 1,
                      's': 1,
                      'son': 2,
                      'that': 1,
                      'their': 1,
                      'unto': 1}),
             'tenor': Counter({'of': 1}),
             'Elparan': Counter({',': 1}),
             'fulfilled': Counter({',': 2, 'for': 1, 'her': 1, 'the': 1}),
             'hands': Counter({',': 5,
                      '.': 2,
                      ';': 1,
                      'are': 1,
                      'have': 1,
                      'of': 4,
                      'to': 1,
                      'were': 2,
                      'wittingly': 1}),
             'Isui': Counter({',': 1}),
             'now': Counter({',': 17,
                      'I': 6,
                      'an': 1,
                      'arise': 1,
                      'art': 1,
                      'bone': 1,
                      'done': 1,
                      'empty': 1,
                      'he': 1,
                      'his': 1,
                      'if': 1,
                      'increased': 1,
                      'it': 1,
                      'leave': 1,
                      'nothing': 1,
                      'the': 2,
                      'then': 1,
                      'therefore': 4,
                      'thine': 2,
                      'thou': 1,
                      'thy': 2,
                      'to': 1,
                      'toward': 1,
                      'unto': 1,
                      'we': 1,
                      'when': 1,
                      'whether': 1,
                      'will': 2}),
             'Haggi': Counter({',': 1}),
             'wrong': Counter({'be': 1}),
             'reign': Counter({'over': 1}),
             'oak': Counter({'which': 1}),
             'A': Counter({'troop': 1, 'window': 1}),
             'Bilhah': Counter({',': 4,
                      'Rachel': 1,
                      'conceived': 1,
                      'her': 1,
                      'his': 2}),
             'Emins': Counter({'in': 1}),
             'thereon': Counter({',': 1, '.': 1}),
             'last': Counter({'.': 1, 'days': 1}),
             'grievous': Counter({'.': 1, ';': 1, 'in': 3, 'mourning': 1}),
             'proceedeth': Counter({'from': 1}),
             'Euphrat': Counter({'The': 1}),
             'torn': Counter({'in': 1, 'of': 1}),
             'lay': Counter({'down': 4,
                      'no': 1,
                      'up': 1,
                      'with': 5,
                      'yesternight': 1}),
             'themselv': Counter({'and': 1, 'because': 1}),
             'faces': Counter({'from': 1, 'of': 1, 'to': 1, 'were': 1}),
             'loss': Counter({'of': 1}),
             'afterward': Counter({'I': 1, 'came': 1, 'shall': 1, 'were': 1}),
             'tak': Counter({'for': 1}),
             'three': Counter({'.': 1,
                      'baskets': 1,
                      'branches': 2,
                      'da': 1,
                      'days': 5,
                      'flocks': 1,
                      'hundred': 6,
                      'measures': 1,
                      'men': 1,
                      'months': 1,
                      'so': 1,
                      'sons': 2,
                      'white': 1,
                      'wives': 1,
                      'years': 5}),
             'shut': Counter({'him': 1, 'the': 1, 'to': 1}),
             'itself': Counter({',': 2}),
             'nations': Counter({',': 1,
                      '.': 6,
                      ';': 2,
                      'are': 1,
                      'bow': 1,
                      'divided': 1,
                      'have': 1,
                      'of': 4,
                      'shall': 1}),
             'nati': Counter({'I': 1}),
             'rooms': Counter({'shalt': 1}),
             'Ajah': Counter({',': 1}),
             'direct': Counter({'his': 1}),
             'shed': Counter({':': 1}),
             'maiden': Counter({'to': 1}),
             'winged': Counter({'fowl': 1}),
             'dwell': Counter({';': 1,
                      'and': 1,
                      'at': 1,
                      'in': 9,
                      'the': 1,
                      'togeth': 1,
                      'together': 2,
                      'where': 1,
                      'with': 5}),
             'choice': Counter({'of': 1, 'vine': 1}),
             'Unstable': Counter({'as': 1}),
             'ford': Counter({'Jabbok': 1}),
             'twins': Counter({'in': 1, 'were': 1}),
             'stretched': Counter({'forth': 1, 'out': 1}),
             'prospered': Counter({'my': 1}),
             'token': Counter({'of': 4}),
             'beast': Counter({',': 4,
                      'after': 1,
                      'hath': 2,
                      'of': 11,
                      'thou': 1,
                      'will': 1}),
             'Gera': Counter({',': 1}),
             'nation': Counter({',': 4, '.': 2, '?': 1, 'and': 1}),
             'cloud': Counter({',': 1, ';': 1, 'over': 1}),
             'gro': Counter({'for': 1}),
             'west': Counter({',': 2}),
             'Egy': Counter({'And': 1, 'But': 1, 'come': 1}),
             'walketh': Counter({'in': 1}),
             'people': Counter({',': 7,
                      '.': 5,
                      ';': 4,
                      'alive': 1,
                      'be': 2,
                      'cried': 1,
                      'from': 1,
                      'give': 1,
                      'is': 1,
                      'might': 1,
                      'of': 5,
                      'serve': 1,
                      'shall': 3,
                      'that': 2}),
             'w': Counter({'and': 1, 'thus': 1}),
             'present': Counter({',': 2,
                      'against': 1,
                      'at': 1,
                      'for': 1,
                      'over': 1,
                      'sent': 1,
                      'that': 1,
                      'which': 1}),
             'comest': Counter({'to': 2, 'unto': 1}),
             'afterwards': Counter({'she': 1}),
             'unleavened': Counter({'bread': 1}),
             'as': Counter({'Ephraim': 1,
                      'God': 5,
                      'I': 5,
                      'Isaac': 1,
                      'Joseph': 3,
                      'Manass': 1,
                      'Nimrod': 1,
                      'Pharaoh': 2,
                      'Reuben': 1,
                      'a': 4,
                      'an': 1,
                      'any': 1,
                      'at': 1,
                      'before': 2,
                      'captives': 1,
                      'dwell': 1,
                      'for': 5,
                      'gods': 1,
                      'handle': 1,
                      'have': 1,
                      'he': 12,
                      'her': 1,
                      'his': 3,
                      'is': 1,
                      'it': 5,
                      'much': 2,
                      'one': 4,
                      'she': 1,
                      'soon': 2,
                      'the': 17,
                      'these': 1,
                      'they': 4,
                      'this': 1,
                      'thou': 9,
                      'though': 2,
                      'to': 1,
                      'touching': 1,
                      'water': 1,
                      'we': 3,
                      'with': 1,
                      'ye': 1}),
             'handle': Counter({'the': 1}),
             'treasure': Counter({'in': 1}),
             'be': Counter({',': 1,
                      '.': 3,
                      '?': 1,
                      'Abraham': 1,
                      'Abram': 1,
                      'Canaan': 1,
                      'God': 1,
                      'Pharaoh': 1,
                      'a': 15,
                      'able': 2,
                      'accepted': 1,
                      'according': 2,
                      'against': 1,
                      'alone': 1,
                      'an': 3,
                      'angry': 2,
                      'any': 1,
                      'as': 4,
                      'avenged': 1,
                      'before': 1,
                      'bereaved': 1,
                      'blameless': 1,
                      'blessed': 6,
                      'born': 1,
                      'bound': 1,
                      'brethren': 1,
                      'buried': 1,
                      'burnt': 1,
                      'called': 6,
                      'circumcis': 1,
                      'circumcised': 5,
                      'clean': 1,
                      'clear': 3,
                      'come': 1,
                      'comforted': 1,
                      'consumed': 2,
                      'counted': 1,
                      'cut': 2,
                      'delivered': 1,
                      'deprived': 1,
                      'desired': 1,
                      'destroyed': 1,
                      'done': 2,
                      'earing': 1,
                      'every': 1,
                      'far': 2,
                      'fat': 1,
                      'fetched': 1,
                      'fifty': 1,
                      'for': 8,
                      'forgotten': 1,
                      'forty': 1,
                      'found': 3,
                      'fruitful': 4,
                      'gathered': 4,
                      'go': 1,
                      'gone': 2,
                      'gracious': 1,
                      'great': 1,
                      'greater': 2,
                      'grievous': 1,
                      'gro': 1,
                      'he': 1,
                      'heir': 1,
                      'her': 1,
                      'hid': 1,
                      'his': 6,
                      'in': 5,
                      'joined': 1,
                      'kept': 1,
                      'known': 2,
                      'light': 1,
                      'lights': 1,
                      'lord': 1,
                      'male': 1,
                      'meat': 1,
                      'mine': 1,
                      'my': 6,
                      'named': 1,
                      'near': 1,
                      'no': 1,
                      'not': 5,
                      'now': 1,
                      'numbered': 3,
                      'of': 2,
                      'on': 1,
                      'one': 2,
                      'opened': 1,
                      'our': 1,
                      'over': 1,
                      'proved': 2,
                      'put': 1,
                      'red': 1,
                      'restrained': 1,
                      'rul': 1,
                      'scattered': 1,
                      'seen': 2,
                      'separated': 1,
                      'servants': 1,
                      'seven': 1,
                      'shamed': 1,
                      'she': 1,
                      'shed': 1,
                      'so': 3,
                      'stronger': 1,
                      'surety': 1,
                      'taken': 1,
                      'that': 1,
                      'the': 7,
                      'their': 2,
                      'thine': 3,
                      'thou': 2,
                      'three': 1,
                      'thy': 8,
                      'to': 2,
                      'true': 1,
                      'twelve': 1,
                      'twenty': 1,
                      'unto': 2,
                      'upon': 4,
                      'verified': 1,
                      'very': 1,
                      'well': 3,
                      'willing': 2,
                      'with': 11,
                      'witness': 2,
                      'ye': 2,
                      'your': 2}),
             'wealth': Counter({',': 1}),
             'must': Counter({'I': 1,
                      'be': 1,
                      'come': 1,
                      'die': 1,
                      'needs': 1,
                      'not': 1}),
             'names': Counter({',': 2,
                      ';': 1,
                      'after': 1,
                      'by': 1,
                      'of': 4,
                      'to': 1}),
             'ready': Counter({';': 1, 'his': 1, 'quickly': 1, 'the': 1}),
             'acknowledged': Counter({'them': 1}),
             'anything': Counter({'till': 1}),
             'an': Counter({'Egyptian': 2,
                      'Hebrew': 2,
                      'abomination': 2,
                      'adder': 1,
                      'altar': 10,
                      'answer': 1,
                      'archer': 1,
                      'ark': 1,
                      'end': 2,
                      'everlasting': 5,
                      'evil': 1,
                      'hairy': 1,
                      'handmaid': 2,
                      'harlot': 2,
                      'haven': 1,
                      'he': 1,
                      'heifer': 1,
                      'help': 2,
                      'horror': 1,
                      'house': 1,
                      'hundred': 20,
                      'hundredfo': 1,
                      'husbandman': 1,
                      'instructor': 1,
                      'interpreter': 1,
                      'o': 1,
                      'oath': 2,
                      'offering': 1,
                      'officer': 2,
                      'old': 3,
                      'olive': 1,
                      'open': 1,
                      'oversig': 1}),
             'Ithran': Counter({',': 1}),
             'Jehovahjireh': Counter({':': 1}),
             'Enoch': Counter({'.': 1,
                      ':': 1,
                      'eight': 1,
                      'lived': 1,
                      'walked': 2,
                      'was': 1,
                      'were': 1}),
             'Pharez': Counter({',': 1, '.': 1, 'were': 1}),
             'savour': Counter({';': 1}),
             'heap': Counter({',': 1,
                      '.': 1,
                      'and': 1,
                      'be': 1,
                      'is': 1,
                      'to': 1}),
             'requite': Counter({'us': 1}),
             'Moab': Counter({',': 1, ':': 1}),
             'ascending': Counter({'and': 1}),
             'Cainan': Counter({':': 1, 'eight': 1, 'lived': 2, 'were': 1}),
             'errand': Counter({'.': 1}),
             'being': Counter({'an': 1,
                      'eight': 1,
                      'few': 1,
                      'in': 1,
                      'merciful': 1,
                      'old': 2,
                      'seventeen': 1}),
             'confound': Counter({'the': 1, 'their': 1}),
             'arose': Counter({',': 6, '.': 2}),
             'dukes': Counter({'.': 1, 'in': 1, 'of': 3, 'that': 6}),
             'Send': Counter({'me': 2, 'one': 1, 'the': 1}),
             'Lasha': Counter({'.': 1}),
             'boug': Counter({'and': 1}),
             'touching': Counter({'thee': 1}),
             'adder': Counter({'in': 1}),
             'joined': Counter({'battle': 1, 'together': 1, 'unto': 1}),
             'were': Counter({'Asshurim': 1,
                      'Belah': 1,
                      'Hezron': 1,
                      'Hori': 1,
                      'Laban': 1,
                      'Shem': 1,
                      'Teman': 1,
                      'a': 3,
                      'abated': 3,
                      'above': 1,
                      'afraid': 3,
                      'all': 2,
                      'an': 1,
                      'at': 1,
                      'backward': 1,
                      'binding': 1,
                      'born': 10,
                      'both': 2,
                      'bou': 1,
                      'bought': 1,
                      'bound': 1,
                      'brought': 2,
                      'children': 1,
                      'circumcised': 1,
                      'come': 1,
                      'coming': 1,
                      'confederate': 1,
                      'content': 1,
                      'covered': 2,
                      'created': 2,
                      'destroyed': 1,
                      'dim': 2,
                      'dried': 2,
                      'dukes': 1,
                      'eight': 2,
                      'ended': 1,
                      'fair': 1,
                      'feeble': 1,
                      'finished': 1,
                      'fourteen': 1,
                      'fulfilled': 2,
                      'giants': 1,
                      'gone': 1,
                      'grieved': 1,
                      'hairy': 1,
                      'her': 1,
                      'in': 10,
                      'increased': 1,
                      'joined': 1,
                      'made': 3,
                      'merry': 1,
                      'more': 1,
                      'naked': 1,
                      'nine': 7,
                      'not': 1,
                      'of': 1,
                      'old': 1,
                      'on': 1,
                      'opened': 2,
                      'past': 1,
                      'ringstraked': 2,
                      'round': 1,
                      'sad': 1,
                      'sent': 1,
                      'seven': 2,
                      'sons': 1,
                      'sore': 2,
                      'speckled': 1,
                      'still': 1,
                      'stopped': 1,
                      'strangers': 1,
                      'the': 18,
                      'these': 2,
                      'thirty': 1,
                      'three': 3,
                      'threescore': 2,
                      'told': 1,
                      'troubled': 1,
                      'twel': 1,
                      'twins': 1,
                      'two': 2,
                      'under': 2,
                      'upon': 1,
                      'very': 1,
                      'wicked': 1,
                      'with': 10}),
             'haven': Counter({'of': 2}),
             'bowels': Counter({';': 1, 'did': 1, 'shall': 1}),
             'Ohad': Counter({',': 1}),
             'dried': Counter({'.': 1, 'up': 2}),
             'mistress': Counter({',': 1, 'Sarai': 1, 'was': 1}),
             'rent': Counter({'his': 2, 'in': 1, 'their': 1}),
             'covering': Counter({'of': 2}),
             'Javan': Counter({',': 1, ';': 1}),
             'stranger': Counter({',': 5, 'and': 1, 'in': 1}),
             'meditate': Counter({'in': 1}),
             'womenservan': Counter({'and': 1}),
             'Jahleel': Counter({'.': 1}),
             'womb': Counter({',': 1, '.': 3, ':': 1, '?': 1}),
             'Give': Counter({'me': 6, 'us': 1, 'your': 1}),
             'fail': Counter({'.': 1}),
             'Jared': Counter({':': 1, 'eight': 1, 'lived': 2, 'were': 1}),
             'consent': Counter({'unto': 3}),
             'dignity': Counter({',': 1}),
             'Hamul': Counter({'.': 1}),
             'Abide': Counter({'ye': 1}),
             ',)': Counter({'and': 1}),
             'midwife': Counter({'said': 1, 'took': 1}),
             'grass': Counter({',': 2}),
             'youth': Counter({';': 1, 'even': 1}),
             'Areli': Counter({'.': 1}),
             'judged': Counter({'me': 1}),
             'imagined': Counter({'to': 1}),
             'Bethuel': Counter({',': 2,
                      '.': 1,
                      'answered': 1,
                      'begat': 1,
                      'the': 3,
                      'thy': 1}),
             'Belah': Counter({',': 1}),
             'famished': Counter({',': 1}),
             'how': Counter({'I': 1,
                      'saidst': 1,
                      'shall': 2,
                      'that': 1,
                      'then': 2,
                      'thy': 1}),
             'ten': Counter({"'": 1,
                      '.': 1,
                      ';': 1,
                      'asses': 1,
                      'brethren': 1,
                      'bulls': 1,
                      'camels': 1,
                      'days': 1,
                      'foals': 1,
                      'shall': 1,
                      'she': 1,
                      'shekels': 1,
                      'times': 2,
                      'years': 4}),
             'honey': Counter({',': 1}),
             'Thahash': Counter({',': 1}),
             'Galeed': Counter({'.': 1, ';': 1}),
             'shalt': Counter({'afflict': 1,
                      'be': 7,
                      'bear': 1,
                      'break': 1,
                      'bring': 2,
                      'bruise': 1,
                      'call': 2,
                      'carry': 1,
                      'come': 1,
                      'deliver': 1,
                      'do': 1,
                      'dwell': 1,
                      'eat': 1,
                      'gather': 1,
                      'give': 1,
                      'go': 3,
                      'have': 2,
                      'keep': 1,
                      'li': 1,
                      'make': 1,
                      'not': 10,
                      'pitch': 1,
                      'rule': 1,
                      'say': 1,
                      'serve': 2,
                      'shew': 1,
                      'spread': 1,
                      'surely': 2,
                      'take': 4,
                      'thou': 19}),
             'hired': Counter({'thee': 1}),
             'alone': Counter({';': 2, 'is': 1}),
             'rightly': Counter({'named': 1}),
             'Rameses': Counter({',': 1}),
             'Feed': Counter({'me': 1}),
             'half': Counter({'a': 1}),
             'Ham': Counter({',': 8, ';': 1, 'is': 1}),
             'food': Counter({',': 3,
                      '.': 3,
                      ':': 1,
                      ';': 1,
                      'are': 1,
                      'for': 3,
                      'in': 2,
                      'of': 3,
                      'shall': 1,
                      'that': 1}),
             'whether': Counter({'it': 2,
                      'stolen': 1,
                      'the': 1,
                      'there': 1,
                      'they': 1,
                      'thou': 1,
                      'ye': 1}),
             'ride': Counter({'in': 1}),
             'beneath': Counter({'Bethel': 1}),
             'rib': Counter({',': 1}),
             'shoelatchet': Counter({',': 1}),
             'then': Counter({',': 1,
                      'Abimelech': 1,
                      'I': 5,
                      'all': 1,
                      'bare': 1,
                      'began': 1,
                      'can': 1,
                      'defiledst': 1,
                      'hast': 1,
                      'he': 1,
                      'in': 2,
                      'let': 1,
                      'make': 1,
                      'receive': 1,
                      'shall': 4,
                      'should': 1,
                      'the': 2,
                      'thou': 1,
                      'we': 1,
                      'why': 1,
                      'will': 3,
                      'your': 1}),
             'living': Counter({',': 1,
                      '.': 1,
                      'creature': 7,
                      'soul': 1,
                      'substance': 2,
                      'thing': 4}),
             'slimepits': Counter({';': 1}),
             'urged': Counter({'him': 1}),
             'intreat': Counter({'for': 1}),
             'lion': Counter({"'": 1, ',': 1, ';': 1}),
             'into': Counter({'Egypt': 17,
                      'Jacob': 1,
                      'Joseph': 3,
                      'Leah': 1,
                      'Pharaoh': 5,
                      'Rachel': 1,
                      'Zoar': 1,
                      'a': 2,
                      'four': 1,
                      'his': 7,
                      'my': 1,
                      'some': 1,
                      'the': 36,
                      'their': 1,
                      'this': 2,
                      'thy': 2,
                      'two': 1,
                      'ward': 1,
                      'your': 2}),
             'Zarah': Counter({'.': 1}),
             'southward': Counter({',': 1}),
             'yet': Counter({'a': 1,
                      'again': 2,
                      'alive': 6,
                      'another': 1,
                      'any': 1,
                      'before': 1,
                      'but': 1,
                      'far': 1,
                      'full': 1,
                      'heard': 1,
                      'high': 1,
                      'his': 1,
                      'indeed': 1,
                      'live': 1,
                      'lived': 1,
                      'other': 2,
                      'scarce': 1,
                      'seven': 3,
                      'spake': 1,
                      'the': 3,
                      'there': 3,
                      'wherefore': 1}),
             'exceedingly': Counter({',': 3, '.': 3, ';': 1, 'upon': 1}),
             'Get': Counter({'me': 1, 'thee': 1}),
             'trough': Counter({',': 1}),
             'Abraham': Counter({"'": 13,
                      ',': 28,
                      '.': 4,
                      ';': 1,
                      'a': 1,
                      'and': 6,
                      'answered': 1,
                      'begat': 1,
                      'bought': 2,
                      'bowed': 1,
                      'built': 1,
                      'buried': 2,
                      'called': 2,
                      'came': 1,
                      'circumcised': 2,
                      'drew': 1,
                      'dwelt': 1,
                      'fell': 1,
                      'for': 2,
                      'gat': 1,
                      'gave': 3,
                      'had': 1,
                      'hastened': 1,
                      'hearkened': 1,
                      'his': 4,
                      'in': 2,
                      'journeyed': 1,
                      'lifted': 2,
                      'made': 1,
                      'obeyed': 1,
                      'out': 1,
                      'planted': 1,
                      'prayed': 1,
                      'purchased': 1,
                      'ran': 1,
                      'reproved': 1,
                      'returned': 2,
                      'rose': 2,
                      'said': 8,
                      'set': 1,
                      'shall': 1,
                      'sojourned': 1,
                      'stood': 3,
                      'stretched': 1,
                      'that': 2,
                      'thy': 3,
                      'took': 4,
                      'was': 3,
                      'weighed': 1,
                      'went': 2}),
             'countenance': Counter({',': 1, 'fallen': 1, 'fell': 1, 'of': 1}),
             'tongues': Counter({',': 2}),
             'else': Counter({'I': 1, 'by': 1}),
             'bowed': Counter({'down': 5,
                      'himself': 6,
                      'his': 1,
                      'themselv': 1,
                      'themselves': 3}),
             'let': Counter({'Pharaoh': 1,
                      'down': 2,
                      'fowl': 1,
                      'her': 2,
                      'him': 4,
                      'it': 4,
                      'loose': 1,
                      'me': 8,
                      'my': 1,
                      'not': 4,
                      'one': 1,
                      'the': 4,
                      'thee': 1,
                      'them': 9,
                      'thy': 5,
                      'us': 16}),
             'elder': Counter({',': 1, 'shall': 1, 'son': 1, 'was': 1}),
             'conceived': Counter({',': 12,
                      '.': 1,
                      ':': 1,
                      'again': 6,
                      'before': 1,
                      'by': 1}),
             'dea': Counter({'Now': 1}),
             'thirty': Counter({'and': 2,
                      'be': 1,
                      'cubits': 1,
                      'there': 1,
                      'yea': 2,
                      'years': 10}),
             'gave': Counter({'Abraham': 1,
                      'Esau': 1,
                      'all': 1,
                      'also': 2,
                      'each': 1,
                      'gifts': 1,
                      'her': 3,
                      'him': 7,
                      'it': 3,
                      'me': 1,
                      'names': 1,
                      'straw': 1,
                      'th': 1,
                      'the': 4,
                      'their': 1,
                      'them': 9,
                      'three': 1,
                      'to': 2,
                      'unto': 4,
                      'up': 3}),
             'verified': Counter({',': 1}),
             'straw': Counter({'and': 2}),
             'We': Counter({'are': 3,
                      'be': 1,
                      'came': 1,
                      'cannot': 3,
                      'have': 4,
                      'know': 1,
                      'may': 1,
                      'saw': 1,
                      'will': 2}),
             'Huz': Counter({'his': 1}),
             'grown': Counter({',': 1}),
             'follow': Counter({'after': 1, 'me': 2, 'thee': 1}),
             'bondman': Counter({'to': 1}),
             'Should': Counter({'he': 1}),
             'kindled': Counter({'.': 1, 'against': 1}),
             'nourish': Counter({'thee': 1, 'you': 1}),
             'Deliver': Counter({'me': 1}),
             'sanctified': Counter({'it': 1}),
             'given': Counter({'all': 1,
                      'children': 1,
                      'every': 1,
                      'him': 1,
                      'me': 4,
                      'my': 2,
                      'no': 1,
                      'them': 1,
                      'this': 1,
                      'thy': 2,
                      'to': 2,
                      'unto': 1,
                      'you': 3}),
             'scatter': Counter({'them': 2}),
             'unawares': Counter({'to': 2}),
             'laws': Counter({'.': 1}),
             'goeth': Counter({'before': 2, 'toward': 1, 'up': 1}),
             'anoth': Counter({'and': 1, 'but': 1}),
             'gopher': Counter({'wood': 1}),
             'shepherds': Counter({',': 2}),
             'fifth': Counter({'day': 1, 'part': 3, 'son': 1}),
             'dwelled': Counter({'between': 1, 'in': 2, 'then': 1}),
             'bowing': Counter({'himself': 1}),
             'asked': Counter({'Pharaoh': 1,
                      'her': 1,
                      'him': 3,
                      'his': 1,
                      'the': 1,
                      'them': 1,
                      'us': 1}),
             'Kor': Counter({'these': 1}),
             'trespass': Counter({'?': 1, 'of': 2}),
             'sat': Counter({'before': 1,
                      'down': 1,
                      'her': 1,
                      'in': 3,
                      'over': 1,
                      'upon': 2}),
             'Mizz': Counter({'these': 2}),
             'sister': Counter({"'": 2,
                      ',': 4,
                      '.': 1,
                      ':': 2,
                      ';': 2,
                      '?': 3,
                      'as': 1,
                      'of': 3,
                      'to': 2,
                      'was': 1}),
             'Bring': Counter({'forth': 1,
                      'her': 1,
                      'him': 1,
                      'it': 1,
                      'me': 1,
                      'them': 1,
                      'these': 1,
                      'your': 1}),
             'ha': Counter({'And': 1, 'But': 1, 'and': 3, 'but': 1, 'for': 1}),
             'Gentiles': Counter({'divided': 1}),
             'blesseth': Counter({'thee': 1}),
             'merchant': Counter({'.': 1}),
             'firstlings': Counter({'of': 1}),
             'lien': Counter({'with': 1}),
             'purposing': Counter({'to': 1}),
             'bruise': Counter({'his': 1, 'thy': 1}),
             'abode': Counter({'in': 1, 'with': 1}),
             'Arise': Counter({',': 5}),
             'Sichem': Counter({',': 1}),
             'ears': Counter({',': 2,
                      '.': 1,
                      ':': 1,
                      ';': 1,
                      'and': 1,
                      'are': 1,
                      'blasted': 1,
                      'came': 1,
                      'devoured': 2,
                      'of': 2}),
             'righteous': Counter({':': 1,
                      'before': 1,
                      'nation': 1,
                      'should': 1,
                      'than': 1,
                      'that': 1,
                      'with': 2,
                      'within': 2}),
             'healed': Counter({'Abimelech': 1}),
             'second': Counter({',': 2,
                      'called': 1,
                      'chariot': 1,
                      'day': 1,
                      'month': 2,
                      'river': 1,
                      'son': 2,
                      'time': 3,
                      'year': 1}),
             'forgive': Counter({'the': 1}),
             'favoured': Counter({',': 1,
                      '.': 2,
                      ';': 1,
                      'and': 4,
                      'kine': 3}),
             'Look': Counter({'now': 1}),
             'many': Counter({'colours': 3, 'days': 2, 'nations': 2}),
             'gold': Counter({',': 2,
                      '.': 1,
                      ';': 2,
                      '?': 1,
                      'chain': 1,
                      'of': 1}),
             'number': Counter({',': 1, '.': 1, 'th': 1, 'the': 1}),
             'Amalekites': Counter({',': 1}),
             'followed': Counter({'the': 2}),
             'blossoms': Counter({'shot': 1}),
             '?': Counter({'And': 74,
                      'Are': 1,
                      'At': 1,
                      'But': 1,
                      'Come': 1,
                      'For': 1,
                      'God': 2,
                      'Hast': 1,
                      'If': 1,
                      'Is': 3,
                      'Now': 1,
                      'Peradventure': 1,
                      'Said': 1,
                      'Shall': 1,
                      'That': 2,
                      'The': 1,
                      'Then': 1,
                      'Whereas': 1,
                      'Wherefore': 2,
                      'Why': 1,
                      'With': 1,
                      'and': 10,
                      'bless': 1,
                      'bring': 1,
                      'bury': 1,
                      'buy': 1,
                      'come': 1,
                      'did': 1,
                      'fear': 1,
                      'for': 5,
                      'have': 1,
                      'lest': 1,
                      'let': 1,
                      'one': 1,
                      'only': 1,
                      'or': 2,
                      'separate': 1,
                      'set': 1,
                      'so': 1,
                      'son': 1,
                      'tell': 3,
                      'the': 1,
                      'therefore': 1,
                      'this': 1,
                      'thou': 2,
                      'what': 2,
                      'where': 1,
                      'wherefore': 1,
                      'why': 1,
                      'wot': 1,
                      'ye': 1,
                      'yea': 1}),
             'Magdiel': Counter({',': 1}),
             'garment': Counter({',': 2, ';': 1, 'by': 1, 'in': 2, 'with': 2}),
             'a': Counter({'Canaanitish': 1,
                      'God': 1,
                      'Thou': 1,
                      'and': 2,
                      'blessi': 1,
                      'blessing': 2,
                      'bondman': 1,
                      'bottle': 1,
                      'bow': 1,
                      'brother': 3,
                      'burning': 1,
                      'burnt': 4,
                      'buryingplace': 5,
                      'calf': 1,
                      'cave': 1,
                      'certain': 4,
                      'charge': 1,
                      'child': 3,
                      'city': 3,
                      'cloud': 1,
                      'coat': 1,
                      'coffin': 1,
                      'company': 2,
                      'covenant': 6,
                      'covering': 1,
                      'cubit': 1,
                      'cunning': 1,
                      'curse': 1,
                      'daughter': 2,
                      'dead': 1,
                      'deceiver': 1,
                      'deep': 2,
                      'dove': 1,
                      'dream': 13,
                      'drink': 1,
                      'fair': 1,
                      'famine': 2,
                      'father': 5,
                      'feast': 4,
                      'few': 3,
                      'field': 2,
                      'firmament': 1,
                      'flaming': 1,
                      'flood': 4,
                      'fountain': 1,
                      'fruitful': 2,
                      'fugitive': 2,
                      'furnace': 1,
                      'garden': 1,
                      'garment': 1,
                      'gold': 1,
                      'golden': 1,
                      'good': 5,
                      'goodly': 1,
                      'great': 12,
                      'grief': 1,
                      'grievous': 1,
                      'grove': 1,
                      'hairy': 1,
                      'hind': 1,
                      'jud': 1,
                      'just': 1,
                      'keeper': 1,
                      'kid': 2,
                      'knife': 1,
                      'ladder': 1,
                      'lamb': 1,
                      'land': 2,
                      'law': 1,
                      'lawgiver': 1,
                      'lion': 2,
                      'little': 12,
                      'living': 1,
                      'long': 1,
                      'loud': 1,
                      'man': 13,
                      'mark': 1,
                      'mead': 1,
                      'meadow': 1,
                      'messenger': 1,
                      'mighty': 3,
                      'mist': 1,
                      'month': 1,
                      'morsel': 1,
                      'mother': 1,
                      'mount': 1,
                      'mountain': 1,
                      'mourning': 1,
                      'multitude': 5,
                      'name': 1,
                      'nation': 2,
                      'one': 1,
                      'parcel': 1,
                      'people': 1,
                      'pillar': 7,
                      'pit': 1,
                      'place': 1,
                      'plain': 2,
                      'pledge': 1,
                      'portion': 1,
                      'possession': 7,
                      'posterity': 1,
                      'present': 3,
                      'prince': 1,
                      'prophet': 1,
                      'prosperous': 1,
                      'ram': 2,
                      'raven': 1,
                      'reproach': 1,
                      'righteous': 1,
                      'river': 1,
                      'ruler': 1,
                      's': 2,
                      'scarlet': 1,
                      'season': 1,
                      'second': 2,
                      'serpent': 1,
                      'servant': 2,
                      'she': 1,
                      'shekel': 1,
                      'shoelatchet': 1,
                      'small': 1,
                      'smoking': 1,
                      'smooth': 1,
                      'sojourner': 1,
                      'son': 24,
                      'space': 1,
                      'stone': 1,
                      'stranger': 5,
                      'strife': 1,
                      'strong': 1,
                      'surety': 3,
                      'sweet': 1,
                      'thicket': 1,
                      'thousand': 1,
                      'thread': 1,
                      'tiller': 1,
                      'token': 2,
                      'tower': 1,
                      'tree': 3,
                      'troop': 1,
                      'turtledove': 1,
                      'vagabond': 2,
                      'vail': 2,
                      'very': 1,
                      'vine': 1,
                      'vineyard': 1,
                      'virgin': 1,
                      'vision': 1,
                      'vow': 2,
                      'wall': 1,
                      'well': 7,
                      'widow': 1,
                      'wife': 14,
                      'wild': 1,
                      'wind': 1,
                      'window': 1,
                      'witness': 3,
                      'wolf': 1,
                      'woman': 1,
                      'word': 1,
                      'young': 4}),
             'praise': Counter({':': 1, 'the': 1}),
             'violence': Counter({'.': 1, 'through': 1}),
             'Go': Counter({',': 1,
                      'again': 2,
                      'forth': 1,
                      'from': 1,
                      'in': 1,
                      'not': 1,
                      'now': 1,
                      'to': 4,
                      'unto': 1,
                      'up': 1}),
             'fou': Counter({'know': 1}),
             'serva': Counter({'Let': 1, 'for': 1}),
             'refused': Counter({',': 2, 'to': 1}),
             'about': Counter({',': 2,
                      'cattle': 1,
                      'every': 1,
                      'from': 1,
                      'his': 1,
                      'them': 1,
                      'this': 1,
                      'three': 1,
                      'to': 2}),
             'naked': Counter({',': 1, ';': 2, '?': 1}),
             'Perizzite': Counter({'dwelled': 1}),
             'shot': Counter({'at': 1, 'forth': 1}),
             'fir': Counter({'and': 1}),
             'like': Counter({'an': 1, 'the': 1}),
             'herds': Counter({',': 9, 'of': 1, 'with': 1}),
             'Up': Counter({',': 2}),
             'guard': Counter({"'": 1, ',': 2, '.': 1, ';': 1, 'charged': 1}),
             'fleddest': Counter({'from': 1}),
             'week': Counter({',': 1, ':': 1}),
             'Onan': Counter({',': 2, '.': 1, 'died': 1, 'knew': 1}),
             'Kenizzites': Counter({',': 1}),
             'asswaged': Counter({';': 1}),
             'Edomites': Counter({'.': 1, 'in': 1}),
             'Eno': Counter({'And': 1}),
             'hearkened': Counter({'all': 1, 'not': 1, 'to': 2, 'unto': 3}),
             'Deborah': Counter({'Rebekah': 1}),
             'thy': Counter({'God': 1,
                      'affliction': 1,
                      'belly': 1,
                      'birthright': 1,
                      'blessing': 1,
                      'bondwoman': 1,
                      'bosom': 1,
                      'bow': 1,
                      'bowels': 1,
                      'bracelets': 1,
                      'brethren': 10,
                      'brother': 14,
                      'came': 1,
                      'camels': 3,
                      'catt': 1,
                      'cattle': 1,
                      'children': 2,
                      'conception': 1,
                      'countenance': 1,
                      'country': 2,
                      'curse': 1,
                      'daughter': 1,
                      'daughters': 2,
                      'dead': 4,
                      'desire': 1,
                      'dwelling': 1,
                      'ewes': 1,
                      'exceeding': 1,
                      'fa': 2,
                      'face': 4,
                      'fath': 1,
                      'father': 19,
                      'fathers': 2,
                      'first': 1,
                      'firstborn': 1,
                      'flesh': 1,
                      'flock': 3,
                      'flocks': 1,
                      'gods': 1,
                      'hand': 5,
                      'head': 2,
                      'heart': 1,
                      'herdmen': 1,
                      'herds': 1,
                      'hire': 1,
                      'house': 3,
                      'household': 2,
                      'husband': 1,
                      'issue': 1,
                      'kindness': 1,
                      'kindred': 4,
                      'li': 1,
                      'life': 2,
                      'loins': 1,
                      'lord': 2,
                      'maid': 1,
                      'master': 1,
                      'mercy': 1,
                      'mistress': 1,
                      'money': 1,
                      'mother': 4,
                      'na': 1,
                      'name': 6,
                      'neck': 1,
                      'pitcher': 3,
                      'pla': 1,
                      'presence': 1,
                      'quiver': 1,
                      'right': 1,
                      'sake': 3,
                      'salvation': 1,
                      'seed': 35,
                      'serva': 2,
                      'servan': 1,
                      'servant': 14,
                      'servants': 10,
                      'she': 1,
                      'shield': 1,
                      'sight': 6,
                      'son': 10,
                      'sons': 5,
                      'sorrow': 1,
                      'soul': 2,
                      'staff': 1,
                      'sword': 1,
                      'two': 3,
                      'voice': 1,
                      'wages': 3,
                      'way': 2,
                      'weapons': 1,
                      'wife': 12,
                      'womb': 1,
                      'word': 2,
                      'younger': 1}),
             'grew': Counter({',': 3, ':': 1, 'until': 1, 'upon': 1}),
             'Reuel': Counter({';': 2, 'Esau': 1, 'in': 1, 'the': 1}),
             'own': Counter({',': 1,
                      'bowels': 1,
                      'flocks': 1,
                      'house': 2,
                      'image': 1,
                      'likeness': 1,
                      'place': 1}),
             'bread': Counter({',': 8,
                      '.': 2,
                      ':': 1,
                      'and': 3,
                      'for': 1,
                      'in': 2,
                      'shall': 1,
                      'there': 1,
                      'to': 1,
                      'which': 1,
                      'with': 1}),
             'Ye': Counter({'are': 2, 'have': 1, 'know': 1, 'shall': 5}),
             'binding': Counter({'sheaves': 1}),
             'make': Counter({'a': 3,
                      'between': 1,
                      'brick': 1,
                      'cakes': 1,
                      'coats': 1,
                      'him': 5,
                      'in': 1,
                      'it': 2,
                      'man': 1,
                      'me': 3,
                      'mention': 1,
                      'my': 1,
                      'nations': 1,
                      'of': 3,
                      'one': 1,
                      'our': 1,
                      'ready': 1,
                      'thee': 5,
                      'them': 2,
                      'there': 3,
                      'thy': 5,
                      'to': 1,
                      'us': 1,
                      'ye': 1}),
             'substance': Counter({',': 1,
                      '.': 1,
                      'and': 1,
                      'that': 2,
                      'was': 2}),
             'way': Counter({',': 7,
                      '.': 7,
                      ';': 2,
                      'in': 1,
                      'of': 4,
                      'off': 1,
                      'side': 1,
                      'that': 1,
                      'to': 6,
                      'unto': 1,
                      'upon': 1,
                      'which': 2}),
             'another': Counter({"'": 1,
                      ',': 4,
                      '.': 2,
                      '?': 1,
                      'brother': 1,
                      'dream': 1,
                      'm': 1,
                      'seed': 1,
                      'son': 1,
                      'well': 2}),
             'mayest': Counter({'be': 1,
                      'bury': 1,
                      'come': 1,
                      'freely': 1,
                      'inherit': 1}),
             'from': Counter({'Abraham': 2,
                      'Beersheba': 2,
                      'Bethel': 1,
                      'Ephraim': 1,
                      'Gerar': 1,
                      'Gilead': 1,
                      'Havilah': 1,
                      'Isaac': 1,
                      'Judah': 1,
                      'Mesha': 1,
                      'Padan': 1,
                      'Padanaram': 1,
                      'Sidon': 1,
                      'Ur': 1,
                      'a': 1,
                      'above': 1,
                      'all': 4,
                      'another': 1,
                      'beari': 1,
                      'before': 3,
                      'behind': 1,
                      'between': 2,
                      'every': 1,
                      'heaven': 1,
                      'hence': 1,
                      'her': 4,
                      'him': 4,
                      'his': 7,
                      'if': 1,
                      'man': 1,
                      'me': 7,
                      'mine': 1,
                      'my': 4,
                      'off': 9,
                      'one': 1,
                      'our': 2,
                      'sinning': 1,
                      'th': 1,
                      'the': 43,
                      'thee': 4,
                      'them': 3,
                      'then': 1,
                      'thence': 16,
                      'this': 3,
                      'thy': 6,
                      'under': 1,
                      'us': 1,
                      'whence': 2,
                      'you': 1}),
             'it': Counter({',': 19,
                      '.': 16,
                      ':': 2,
                      ';': 3,
                      '?': 3,
                      'EleloheIsrael': 1,
                      'Galeed': 1,
                      'Jegarsahadutha': 1,
                      'Rehoboth': 1,
                      'Shebah': 1,
                      'Sitnah': 1,
                      'a': 2,
                      'above': 1,
                      'again': 2,
                      'all': 1,
                      'and': 2,
                      'be': 7,
                      'before': 1,
                      'bring': 1,
                      'budded': 1,
                      'called': 2,
                      'came': 63,
                      'ceased': 1,
                      'come': 1,
                      'could': 1,
                      'displeased': 1,
                      'divide': 1,
                      'down': 1,
                      'fifty': 1,
                      'for': 3,
                      'from': 2,
                      'gr': 1,
                      'grieved': 1,
                      'he': 1,
                      'her': 1,
                      'here': 1,
                      'his': 2,
                      'if': 1,
                      'in': 2,
                      'is': 15,
                      'may': 2,
                      'me': 2,
                      'might': 1,
                      'must': 1,
                      'near': 2,
                      'not': 4,
                      'of': 2,
                      'on': 2,
                      'pleased': 1,
                      'pleaseth': 2,
                      'reached': 1,
                      'repented': 1,
                      'repenteth': 1,
                      'shall': 18,
                      'so': 1,
                      'th': 1,
                      'that': 5,
                      'thee': 1,
                      'thirty': 1,
                      'time': 1,
                      'to': 14,
                      'unto': 5,
                      'up': 2,
                      'upon': 4,
                      'was': 41,
                      'wast': 1,
                      'were': 1,
                      'which': 2,
                      'within': 1}),
             'sweet': Counter({'savour': 1}),
             'freely': Counter({'e': 1}),
             'bondmen': Counter({',': 1, '.': 1}),
             'Fulfil': Counter({'her': 1}),
             'ought': Counter({'he': 1, 'left': 1, 'not': 2}),
             'seedtime': Counter({'and': 1}),
             'also': Counter({',': 9,
                      '.': 5,
                      ':': 2,
                      ';': 1,
                      '?': 3,
                      'Rachel': 1,
                      'Tebah': 1,
                      'a': 1,
                      'after': 1,
                      'and': 2,
                      'bare': 2,
                      'be': 1,
                      'born': 1,
                      'brought': 2,
                      'destroy': 2,
                      'draw': 1,
                      'for': 1,
                      'from': 1,
                      'had': 1,
                      'hath': 1,
                      'have': 2,
                      'he': 2,
                      'heard': 1,
                      'here': 1,
                      'his': 1,
                      'in': 1,
                      'is': 1,
                      'let': 1,
                      'of': 8,
                      'our': 4,
                      'shall': 2,
                      'stood': 1,
                      'surely': 1,
                      'that': 1,
                      'the': 2,
                      'there': 1,
                      'thy': 1,
                      'to': 1,
                      'unto': 2,
                      'was': 2,
                      'went': 1,
                      'will': 1,
                      'with': 2,
                      'withheld': 1,
                      'your': 1}),
             'al': Counter({'and': 1, 'let': 1, 'so': 1}),
             'back': Counter({'.': 1,
                      'all': 1,
                      'any': 1,
                      'from': 1,
                      'his': 1}),
             'putting': Counter({'it': 1}),
             'bracelets': Counter({',': 2, 'for': 1, 'upon': 2}),
             'prevail': Counter({';': 1, 'and': 1}),
             'Bilhan': Counter({',': 1}),
             'Whence': Counter({'come': 1}),
             'possessor': Counter({'of': 2}),
             'han': Counter({'so': 1}),
             'provision': Counter({'for': 2}),
             'space': Counter({'betwixt': 1, 'of': 1}),
             'rebelled': Counter({'.': 1}),
             'Tubalcain': Counter({',': 1, 'was': 1}),
             'fury': Counter({'turn': 1}),
             'deeds': Counter({'unto': 1}),
             'years': Counter({',': 35,
                      '.': 9,
                      ':': 6,
                      ';': 4,
                      'after': 1,
                      'and': 1,
                      'for': 4,
                      'hath': 1,
                      'have': 1,
                      'in': 2,
                      'of': 14,
                      'old': 21,
                      'that': 1,
                      'the': 1,
                      'they': 1}),
             'provide': Counter({'for': 1, 'himself': 1}),
             'compasseth': Counter({'the': 2}),
             ...})

In [52]:
cfd['living']


Out[52]:
Counter({',': 1, '.': 1, 'creature': 7, 'soul': 1, 'substance': 2, 'thing': 4})

In [53]:
generate_model(cfd, 'living')


living creature that he said , and the land of the land of the land 

This code generates bigrams from text, constructs a CFD for which words follow an input word -- this is used to generate text.

"this simple approach to text generation tends to get stuck in loops"


More Python: Reusing Code

skipping this section


Lexical Resources ('lexicon')

a collection of words and/or phrases along with associated information such as part of speech and sense definitions

A lexical entry consists of a headword (lemma) along with additional information such as the part of speech and the sense definition

Wordlist Corpora

the Words Corpus is the /usr/share/dict/words file from Unix systems


In [54]:
def unusual_words(text):
    text_vocab = set(w.lower() for w in text if w.isalpha())
    english_vocab = set(w.lower() for w in nltk.corpus.words.words()) ## Words Corpus
    unusual = text_vocab - english_vocab
    return sorted(unusual)

unusual_words(nltk.corpus.gutenberg.words('austen-sense.txt'))


Out[54]:
['abbeyland',
 'abhorred',
 'abilities',
 'abounded',
 'abridgement',
 'abused',
 'abuses',
 'accents',
 'accepting',
 'accommodations',
 'accompanied',
 'accounted',
 'accounts',
 'accustomary',
 'aches',
 'acknowledging',
 'acknowledgment',
 'acknowledgments',
 'acquaintances',
 'acquiesced',
 'acquitted',
 'acquitting',
 'acted',
 'actions',
 'adapted',
 'adding',
 'additions',
 'addressed',
 'addresses',
 'addressing',
 'adhering',
 'adieus',
 'adjusting',
 'administering',
 'admirers',
 'admires',
 'admitting',
 'adorned',
 'advances',
 'advantages',
 'affairs',
 'affections',
 'affects',
 'affixed',
 'afflictions',
 'afforded',
 'affording',
 'ages',
 'agitated',
 'agonies',
 'ailments',
 'aimed',
 'alarms',
 'alienated',
 'alighted',
 'alleged',
 'allenham',
 'allowances',
 'allowed',
 'allowing',
 'alluded',
 'alterations',
 'altered',
 'altering',
 'amended',
 'amounted',
 'amusements',
 'ankles',
 'annamaria',
 'annexed',
 'announced',
 'announcing',
 'annuities',
 'annum',
 'answered',
 'answering',
 'answers',
 'anticipated',
 'anticipating',
 'anticipations',
 'anymore',
 'apartments',
 'apologies',
 'apologising',
 'apologized',
 'appearances',
 'appeared',
 'appearing',
 'appeased',
 'appetites',
 'applauded',
 'applying',
 'appointed',
 'apprehended',
 'apprehensions',
 'approached',
 'approved',
 'arbour',
 'ardour',
 'arguments',
 'arranged',
 'arrangements',
 'arranging',
 'arrived',
 'arrives',
 'arriving',
 'ascended',
 'ascertained',
 'asked',
 'asking',
 'assembled',
 'assemblies',
 'asserted',
 'assertions',
 'assiduities',
 'assisted',
 'assisting',
 'associating',
 'assurances',
 'astonished',
 'atoned',
 'atoning',
 'attaching',
 'attachments',
 'attacked',
 'attacks',
 'attained',
 'attempted',
 'attempting',
 'attempts',
 'attendants',
 'attended',
 'attending',
 'attentions',
 'attracted',
 'attractions',
 'attributed',
 'attributing',
 'auditors',
 'augmenting',
 'austen',
 'authorised',
 'authors',
 'availed',
 'avignon',
 'avoided',
 'avoiding',
 'awaited',
 'awakened',
 'awaking',
 'bags',
 'balls',
 'banished',
 'barouches',
 'bathed',
 'bears',
 'beasts',
 'beauties',
 'became',
 'bedrooms',
 'beds',
 'befallen',
 'befalls',
 'befell',
 'began',
 'begged',
 'begins',
 'behaved',
 'beings',
 'believed',
 'believes',
 'belonged',
 'belongs',
 'benefited',
 'bequeathed',
 'berkeley',
 'bestowed',
 'betrayed',
 'betraying',
 'biased',
 'blackest',
 'blameable',
 'blessings',
 'blights',
 'blossoms',
 'blundered',
 'blushed',
 'blushes',
 'bolder',
 'bones',
 'bonomi',
 'books',
 'booksellers',
 'borrowed',
 'bottoms',
 'boys',
 'breakfasting',
 'bribing',
 'brightened',
 'brighter',
 'bringing',
 'brings',
 'broader',
 'brothers',
 'bruised',
 'buildings',
 'bursts',
 'buying',
 'called',
 'calls',
 'calming',
 'candles',
 'candour',
 'canvassing',
 'cards',
 'cares',
 'caresses',
 'careys',
 'carriages',
 'carries',
 'cases',
 'casts',
 'cats',
 'caused',
 'ceased',
 'ceasing',
 'censured',
 'centre',
 'certainties',
 'chagrined',
 'chairs',
 'chambers',
 'chanced',
 'changed',
 'changes',
 'changing',
 'characters',
 'charged',
 'charmed',
 'charms',
 'cheated',
 'checking',
 'cheeks',
 'cheerfuller',
 'cherished',
 'cherries',
 'children',
 'choked',
 'chuse',
 'chusing',
 'circles',
 'circumstances',
 'civilities',
 'claimed',
 'claiming',
 'claims',
 'clarke',
 'cleared',
 'cleveland',
 'clogged',
 'closing',
 'clouds',
 'coats',
 'collecting',
 'coloured',
 'colouring',
 'combe',
 'comforted',
 'comforts',
 'comings',
 'commanded',
 'commands',
 'commended',
 'comments',
 'commissioned',
 'commonest',
 'communicated',
 'companions',
 'compared',
 'compares',
 'comparisons',
 'complained',
 'complaining',
 'complaints',
 'completed',
 'compliments',
 'comprehended',
 'concealing',
 'concerns',
 'concessions',
 'concluded',
 'conclusions',
 'conditions',
 'conducted',
 'confessed',
 'confidante',
 'conforming',
 'congratulated',
 'congratulating',
 'congratulations',
 'conjectured',
 'conjectures',
 'conjecturing',
 'connections',
 'conquests',
 'consented',
 'consequences',
 'considerations',
 'considers',
 'consisted',
 'consists',
 'consoled',
 'conspired',
 'constantia',
 'consulted',
 'contained',
 'containing',
 'contend',
 'contenting',
 'continuing',
 'contradicted',
 'contrasted',
 'contributed',
 'contributing',
 'contrived',
 'contrives',
 'contriving',
 'controlled',
 'conveniences',
 'conversations',
 'conversed',
 'conversing',
 'conveyed',
 'conveying',
 'copying',
 'cordials',
 'cottages',
 'counsellor',
 'counteracted',
 'couples',
 'courted',
 'courting',
 'courtland',
 'cousins',
 'cowper',
 'cows',
 'coxcombs',
 'cramps',
 'created',
 'creating',
 'creatures',
 'cries',
 'crimsoned',
 'curtsying',
 'cutlets',
 'danced',
 'dances',
 'dared',
 'darker',
 'dartford',
 'dashwood',
 'dashwoods',
 'daughters',
 'davies',
 'dawdled',
 'dawlish',
 'dawned',
 'dearer',
 'dearest',
 'debated',
 'debts',
 'deceived',
 'deciding',
 'decisions',
 'declares',
 'declaring',
 'declining',
 'deemed',
 'deeper',
 'deepest',
 'defects',
 'defended',
 'deficiencies',
 'degrees',
 'delaford',
 'delayed',
 'delays',
 'deliberating',
 'delicacies',
 'delighful',
 'delineated',
 'delivered',
 'demanded',
 'demands',
 'demonstrations',
 'demur',
 'denied',
 'dennison',
 'denoted',
 'denoting',
 'departing',
 'depended',
 'depends',
 'deprived',
 'described',
 'describing',
 'deserts',
 'deserves',
 'designs',
 'desiring',
 'despatch',
 'despatching',
 'despised',
 'despising',
 'destroyed',
 'destroys',
 'detaining',
 'detected',
 'detecting',
 'determining',
 'deterred',
 'detested',
 'devolved',
 'died',
 'dies',
 'differed',
 'differing',
 'difficulties',
 'dimensions',
 'diminished',
 'dined',
 'dinners',
 'directing',
 'directions',
 'disagreements',
 'disappeared',
 'disappointments',
 'disapproved',
 'disapproves',
 'disapproving',
 'discarded',
 'discharged',
 'disclaiming',
 'disclosing',
 'discontents',
 'discovering',
 'discussions',
 'disgraced',
 'disinherited',
 'disliked',
 'dismissed',
 'dismounted',
 'dispatched',
 'dispatches',
 'dispersing',
 'disposing',
 'disputes',
 'disqualifications',
 'disregarded',
 'dissembling',
 'dissented',
 'distresses',
 'distrusts',
 'diverted',
 'doatingly',
 'donavan',
 'doomed',
 'dooming',
 'doors',
 'dorsetshire',
 'doubted',
 'doubts',
 'douceur',
 'downs',
 'dr',
 'drains',
 'drawings',
 'draws',
 'dreaded',
 'dreading',
 'dreaming',
 'dresses',
 'drives',
 'dropped',
 'drops',
 'drury',
 'duets',
 'duties',
 'earlier',
 'earliest',
 'earned',
 'ears',
 'echoed',
 'editions',
 'edtions',
 'effected',
 'effecting',
 'effusions',
 'ellison',
 'ellisons',
 'eloping',
 'eluded',
 'embellishments',
 'embraced',
 'embraces',
 'employments',
 'enabled',
 'enamoured',
 'encouraged',
 'encouragements',
 'encroachments',
 'encumbered',
 'endeavoring',
 'endeavors',
 'endeavour',
 'endeavoured',
 'endeavouring',
 'endeavours',
 'endowed',
 'ends',
 'endured',
 'enfeebled',
 'enforcing',
 'engagements',
 'england',
 'enjoyed',
 'enjoyments',
 'enquired',
 'enquiries',
 'enquiring',
 'ensued',
 'ensured',
 'entered',
 'entertained',
 'entitled',
 'entreated',
 'entreaties',
 'entrusted',
 'equalled',
 'equals',
 'erred',
 'errors',
 'escaped',
 'esq',
 'establishing',
 'esteemed',
 'esteeming',
 'esteems',
 'estimating',
 'estranged',
 'evenings',
 'events',
 'evils',
 'examined',
 'exceeded',
 'excellencies',
 'exchanged',
 'exclaimed',
 'exclamations',
 'excused',
 'excuses',
 'exercised',
 'exercising',
 'exerted',
 'exertions',
 'exeter',
 'exhilarated',
 'existed',
 'expectations',
 'expected',
 'expecting',
 'expects',
 'expenses',
 'experiencing',
 'explained',
 'explanations',
 'expressing',
 'expressions',
 'extolling',
 'extorted',
 'extorting',
 'extremest',
 'eyeing',
 'eyes',
 'faces',
 'facts',
 'failed',
 'falls',
 'familiarized',
 'families',
 'fancying',
 'fates',
 'fatigued',
 'fatigues',
 'faults',
 'favour',
 'favourable',
 'favourite',
 'favourites',
 'fearing',
 'fears',
 'features',
 'feelings',
 'feels',
 'feet',
 'felicitations',
 'females',
 'ferrars',
 'fetches',
 'fettered',
 'finds',
 'finest',
 'fingers',
 'flattered',
 'flatteries',
 'flowed',
 'fluctuating',
 'flushed',
 'foibles',
 'followed',
 'follows',
 'fond',
 'footsteps',
 'forebodings',
 'foreplanned',
 'foresaw',
 'foreseeing',
 'foreseen',
 'forfeited',
 'forfeiting',
 'forgave',
 'forgiven',
 'forms',
 'forsaking',
 'fortunes',
 'forwarded',
 'foundations',
 'founded',
 'fowls',
 'friendliest',
 'friends',
 'frightens',
 'froid',
 'frosts',
 'fulfil',
 'fulfilled',
 'fullest',
 'gained',
 'gales',
 'gardens',
 'garrets',
 'gates',
 'gathered',
 'generations',
 'gentlemen',
 'gigs',
 'gilberts',
 'girls',
 'gives',
 'glances',
 'gloried',
 'gloves',
 'godby',
 'goings',
 'goodby',
 'governed',
 'gowns',
 'graces',
 'grandmothers',
 'granted',
 'greatest',
 'grieves',
 'grows',
 'guardians',
 'guessed',
 'guests',
 'guided',
 'guineas',
 'habits',
 'hallooing',
 'hands',
 'handsomer',
 'handsomest',
 'hang',
 'hanover',
 'happened',
 'happens',
 'hardened',
 'hardships',
 'harley',
 'has',
 'hastened',
 'hastening',
 'hated',
 'hates',
 'hating',
 'having',
 'hazarded',
 'hazarding',
 'heads',
 'heard',
 'hears',
 'heightened',
 'heightening',
 'heights',
 'heirs',
 'held',
 'hens',
 'henshawe',
 'hesitated',
 'hiding',
 'hills',
 'hinted',
 'hints',
 'hoarded',
 'holborn',
 'holburn',
 'holds',
 'holidays',
 'homes',
 'honeysuckles',
 'honiton',
 'honour',
 'honourable',
 'honourably',
 'honoured',
 'honours',
 'hopes',
 'hoping',
 'horrors',
 'horses',
 'hours',
 'houses',
 'howsever',
 'humbled',
 'humiliations',
 'humored',
 'humoured',
 'humouring',
 'hunted',
 'hunters',
 'hunts',
 'hurrying',
 'husbands',
 'huswifes',
 'ideas',
 'idled',
 'idolized',
 'ii',
 'imaginations',
 'imagined',
 'imagining',
 'imbibed',
 'immoveable',
 'imparted',
 'imperfections',
 'implied',
 'implies',
 'impoverished',
 'impoverishing',
 'improved',
 'improvements',
 'imputed',
 'inclinations',
 'inclined',
 'inclosing',
 'including',
 'incommoded',
 'inconveniences',
 'increased',
 'incurred',
 'incurring',
 'indulged',
 'infants',
 'inflicted',
 'inflicting',
 'influenced',
 'inforce',
 'inforced',
 'informing',
 'inhabitants',
 'inhabiting',
 'inheriting',
 'injuries',
 'inquired',
 'inquiries',
 'insinuations',
 'insisted',
 'installed',
 'instigated',
 'instructions',
 'insulted',
 'intends',
 'intentions',
 'intents',
 'interests',
 'interposed',
 'interspersed',
 'intervals',
 'interviews',
 'intimated',
 'introduced',
 'introducing',
 'intruded',
 'invented',
 'inventing',
 'invitations',
 'invited',
 'irritated',
 'irritates',
 'issued',
 'jealousies',
 'jenning',
 'jennings',
 'jewels',
 'jilting',
 'joined',
 'joked',
 'jokes',
 'joking',
 'joys',
 'judged',
 'judging',
 'judgments',
 'jumbled',
 'justified',
 'keeps',
 'keys',
 'kicked',
 'kinder',
 'kindest',
 'kingham',
 'kissed',
 'kisses',
 'knees',
 'knives',
 'knows',
 'laboured',
 'lamentations',
 'lamps',
 'lanes',
 'languages',
 'larger',
 'largest',
 'lasted',
 'laughed',
 'laughs',
 'leagued',
 'legacies',
 'lengthened',
 'lengths',
 'lessened',
 'lessening',
 'letters',
 'letting',
 'lies',
 'lifted',
 'lightened',
 'liked',
 'likes',
 'limbs',
 'limits',
 'lines',
 'lingered',
 'lingering',
 'lips',
 'listened',
 'lives',
 'livings',
 'll',
 'lodges',
 'loitered',
 'lombardy',
 'london',
 'longed',
 'longest',
 'longstaple',
 'looked',
 'looks',
 'loved',
 'lovers',
 'loves',
 'lowered',
 'lurking',
 'magna',
 'maids',
 'maintained',
 'makes',
 'mama',
 'managed',
 'marlborough',
 'marriages',
 'marries',
 'matters',
 'maxims',
 'meadows',
 'meals',
 'means',
 'meantime',
 'measures',
 'medicines',
 'meditated',
 'meditations',
 'meetings',
 'mentioned',
 'mentioning',
 'merest',
 'merits',
 'merrier',
 'messages',
 'middleton',
 'middletons',
 'militated',
 'minds',
 'minutes',
 'misapplied',
 'misinformed',
 'missed',
 'misses',
 'mistakes',
 'mixing',
 'modestest',
 'mohrs',
 'moments',
 'months',
 'mosquitoes',
 'mothers',
 'motives',
 'moved',
 'murmurings',
 'muttered',
 'nabobs',
 'named',
 'names',
 'natured',
 'nearer',
 'needed',
 'neglected',
 'neighbour',
 'neighbourhood',
 'neighbouring',
 'neighbourly',
 'neighbours',
 'nerves',
 'nests',
 'nettles',
 'newer',
 'newspapers',
 'nicest',
 'nieces',
 'nipped',
 'nodded',
 'nods',
 'noisier',
 'notes',
 'noticed',
 'noticing',
 'notions',
 'nt',
 'nurses',
 'obeyed',
 'objected',
 'objections',
 'objects',
 'obligations',
 'observations',
 'observed',
 'obstacles',
 'obstructed',
 'obtained',
 'obtaining',
 'obviated',
 'obviating',
 'occasioned',
 'occasions',
 'occupations',
 'occupied',
 'occurred',
 'oddest',
 'offence',
 'offences',
 'offending',
 'offered',
 'offices',
 'oftener',
 'oftenest',
 'oldest',
 'olives',
 'omitted',
 'ones',
 'opened',
 'opinions',
 'opportunities',
 'ordained',
 'orders',
 'originated',
 'ornamented',
 'ornaments',
 'others',
 'outdone',
 ...]

In [55]:
unusual_words(nltk.corpus.nps_chat.words())


Out[55]:
['aaaaaaaaaaaaaaaaa',
 'aaahhhh',
 'abortions',
 'abou',
 'abourted',
 'abs',
 'ack',
 'acros',
 'actualy',
 'adams',
 'adds',
 'adduser',
 'adjusts',
 'adoted',
 'adreniline',
 'ads',
 'adults',
 'afe',
 'affairs',
 'affari',
 'affects',
 'afk',
 'agaibn',
 'ages',
 'aggravated',
 'agurlwithbigguns',
 'ahah',
 'ahahah',
 'ahahh',
 'ahahha',
 'ahh',
 'ahhah',
 'ahhahahaha',
 'ahhh',
 'ahhhh',
 'ahhhhhh',
 'ahhhhhhhhhhhhhh',
 'aiiiiiiiiiiiiiiiiiiiiiiii',
 'aiken',
 'aime',
 'akdt',
 'akon',
 'akron',
 'akst',
 'aligator',
 'allergies',
 'allo',
 'allowed',
 'aloha',
 'alohaaa',
 'alohas',
 'alot',
 'alotta',
 'alternatives',
 'alterz',
 'alwys',
 'alzheimers',
 'amazingness',
 'americans',
 'anithing',
 'ans',
 'answering',
 'answers',
 'antidepressants',
 'anygirl',
 'anymore',
 'anythin',
 'anytime',
 'anyyyyyyyyyyyyyyyyy',
 'aok',
 'apoligize',
 'appearently',
 'appears',
 'applaudes',
 'appleton',
 'appologise',
 'appologize',
 'aqwesome',
 'arggghhh',
 'argh',
 'armtnpeat',
 'arrested',
 'arrived',
 'arround',
 'asked',
 'askin',
 'asking',
 'asks',
 'asl',
 'asnwer',
 'asses',
 'asshole',
 'assholes',
 'asss',
 'assumes',
 'aterry',
 'atl',
 'attempted',
 'attracted',
 'aussies',
 'awesomee',
 'awesomes',
 'awrighty',
 'awsome',
 'aww',
 'awww',
 'awwww',
 'awwwwww',
 'awwwwwww',
 'awwwwwwwwww',
 'aynawy',
 'az',
 'azerbaijan',
 'baaaaalllllllliiiiiiinnnnnnnnnnn',
 'babay',
 'babblein',
 'babes',
 'babies',
 'babiess',
 'babycakeses',
 'bachelorette',
 'backatchya',
 'backfrontsidewaysandallaroundtheworld',
 'backroom',
 'bacl',
 'bagels',
 'bahahahaa',
 'bak',
 'balad',
 'balck',
 'ballin',
 'balls',
 'bandito',
 'bandsaw',
 'banjoes',
 'banned',
 'baord',
 'barbie',
 'barbieee',
 'bares',
 'barfights',
 'barks',
 'bbbbbyyyyyyyeeeeeeeee',
 'bbiam',
 'bbl',
 'bbs',
 'bc',
 'beachhhh',
 'beams',
 'beans',
 'bears',
 'beatles',
 'beats',
 'beattles',
 'beckley',
 'beeeeehave',
 'beeehave',
 'bein',
 'beleive',
 'belongings',
 'benz',
 'beuty',
 'bf',
 'bi',
 'biatch',
 'biebsa',
 'bied',
 'bigest',
 'biiiatch',
 'biiiiiitch',
 'bikes',
 'bio',
 'biographys',
 'birdgang',
 'birfday',
 'bishes',
 'bitches',
 'bitdh',
 'bites',
 'biyatch',
 'bj',
 'blankie',
 'blazed',
 'blech',
 'blessings',
 'blew',
 'bloe',
 'blondes',
 'bloooooooood',
 'bloooooooooood',
 'bloooooooooooood',
 'blowjob',
 'bodies',
 'boed',
 'boght',
 'boi',
 'boing',
 'boinked',
 'bones',
 'boning',
 'booboo',
 'boobs',
 'books',
 'boooooooooooglyyyyyy',
 'bootay',
 'booyah',
 'borat',
 'bored',
 'bothering',
 'bounced',
 'bouncers',
 'bouts',
 'boyfriend',
 'boys',
 'boyz',
 'brady',
 'brakes',
 'brb',
 'brbbb',
 'breaks',
 'brightened',
 'brings',
 'bro',
 'brooklyn',
 'brothers',
 'brrrrrrr',
 'bruises',
 'brwn',
 'btw',
 'bucks',
 'buddyyyyyy',
 'bugs',
 'buh',
 'builds',
 'bulls',
 'bumber',
 'bumped',
 'burger',
 'burito',
 'burns',
 'burpin',
 'burps',
 'burried',
 'burryed',
 'buses',
 'buying',
 'bwahahahahahahahahahaha',
 'bwhaha',
 'byb',
 'byeee',
 'byeeee',
 'byeeeeeeee',
 'byeeeeeeeeeeeee',
 'byes',
 'caan',
 'caint',
 'caiuse',
 'cakes',
 'cali',
 'called',
 'callifornia',
 'calls',
 'cams',
 'canadaian',
 'canadain',
 'canehda',
 'caps',
 'cardinals',
 'cardnials',
 'cards',
 'cares',
 'cars',
 'casts',
 'catches',
 'categories',
 'catterick',
 'caused',
 'cdt',
 'cepn',
 'cereals',
 'chamillionaire',
 'changed',
 'changing',
 'chanop',
 'chanowner',
 'chathide',
 'chatland',
 'chatr',
 'chatroom',
 'chats',
 'chatt',
 'chatted',
 'chattin',
 'chcken',
 'cheaking',
 'checkin',
 'checking',
 'checks',
 'cheeeez',
 'cheers',
 'cheking',
 'chews',
 'chica',
 'chickens',
 'chics',
 'chik',
 'children',
 'chineese',
 'chingy',
 'choc',
 'chocha',
 'chococake',
 'choices',
 'chokes',
 'chops',
 'chp',
 'chuckles',
 'chunks',
 'churches',
 'ciao',
 'ciara',
 'cigars',
 'ciggareets',
 'ck',
 'claws',
 'cleared',
 'cleveland',
 'clicked',
 'clients',
 'clinicals',
 'clooney',
 'closes',
 'clubs',
 'cmon',
 'cnnecticut',
 'co',
 'coconuts',
 'coem',
 'coffe',
 'coggieeee',
 'com',
 'combo',
 'comenting',
 'comin',
 'commanded',
 'comments',
 'comon',
 'comp',
 'complains',
 'completly',
 'compliments',
 'compoud',
 'comprende',
 'comps',
 'computers',
 'computor',
 'concernin',
 'confessed',
 'confusing',
 'confusting',
 'congrat',
 'congrats',
 'connecticutt',
 'constituents',
 'contast',
 'contemplating',
 'controll',
 'controllers',
 'controllin',
 'controlling',
 'convo',
 'conway',
 'cookie',
 'cookies',
 'coolcat',
 'coonarsee',
 'cooooooooookiiiiiiiiiiiieeeeeeeeeeee',
 'copone',
 'cops',
 'costumes',
 'cottons',
 'coudl',
 'coughed',
 'coughs',
 'cougs',
 'couldnt',
 'counts',
 'coupons',
 'courst',
 'covers',
 'coworkers',
 'cpr',
 'cramps',
 'crashed',
 'creme',
 'cries',
 'cripos',
 'crosses',
 'csi',
 'cst',
 'ct',
 'ctrl',
 'cuddlicious',
 'cuffed',
 'cums',
 'cuppers',
 'curls',
 'currious',
 'cus',
 'cusion',
 'cutes',
 'cuz',
 'cya',
 'cyas',
 'cyber',
 'czeching',
 'daamn',
 'dahlin',
 'damnit',
 'dances',
 'danes',
 'dangit',
 'darlin',
 'darlings',
 'darwin',
 'dat',
 'dated',
 'daughters',
 'daveeee',
 'davis',
 'dawg',
 'dawnstar',
 'dayum',
 'dd',
 'deals',
 'deaths',
 'decades',
 'deceived',
 'deciding',
 'declaw',
 'declawed',
 'deeper',
 'definately',
 'definitley',
 'defragging',
 'defrags',
 'degrees',
 'deleware',
 'delivers',
 'dem',
 'democrats',
 'denver',
 'denzel',
 'deop',
 'depeche',
 'depends',
 'descriminate',
 'despie',
 'despises',
 'detroit',
 'didnts',
 'died',
 'dies',
 'diggin',
 'dik',
 'dipset',
 'dirrrrty',
 'diseast',
 'disocvered',
 'dissing',
 'divorced',
 'dj',
 'dl',
 'dman',
 'docs',
 'doggies',
 'doin',
 'dojn',
 'dokey',
 'dokken',
 'dollars',
 'dolls',
 'donno',
 'dontcha',
 'donuts',
 'dood',
 'doody',
 'doors',
 'dork',
 'dotn',
 'doublewide',
 'douchebag',
 'doupt',
 'downloaded',
 'downloading',
 'downnnnnn',
 'downs',
 'dr',
 'drags',
 'dratts',
 'dreaded',
 'dreammm',
 'dreams',
 'drinks',
 'driveby',
 'drivers',
 'drivin',
 'drools',
 'dropped',
 'drops',
 'drugs',
 'dryer',
 'dsklgjsdk',
 'du',
 'duh',
 'dumbass',
 'dummmm',
 'dunkin',
 'dunno',
 'dvd',
 'dya',
 'dyed',
 'dyslexic',
 'earplugs',
 'earrings',
 'ears',
 'eay',
 'ebay',
 'edgewood',
 'edmonton',
 'eeeeeeeeewwwwwwww',
 'eeeek',
 'eeek',
 'eeekk',
 'eeewww',
 'eeewwwwww',
 'eeww',
 'eggs',
 'ehh',
 'eitther',
 'elbows',
 'elected',
 'elections',
 'elev',
 'ello',
 'elo',
 'email',
 'eng',
 'england',
 'enjoys',
 'enters',
 'entitled',
 'enuf',
 'enuff',
 'ep',
 'erics',
 'erm',
 'eroticaust',
 'errrrr',
 'escaped',
 'est',
 'este',
 'estefan',
 'estra',
 'eticket',
 'euphamisms',
 'evah',
 'everbody',
 'everone',
 'evertonr',
 'everytime',
 'ewedding',
 'eww',
 'ewww',
 'ewwww',
 'ewwwww',
 'ewwwwww',
 'ewwwwwww',
 'exchanged',
 'exchanging',
 'excitin',
 'excuuuuuuse',
 'experimenting',
 'extensions',
 'extras',
 'exwife',
 'eyes',
 'facilitated',
 'fails',
 'fairbanks',
 'fallout',
 'falls',
 'fallz',
 'farms',
 'fart',
 'farting',
 'farts',
 'fav',
 'fawk',
 'fawked',
 'fawker',
 'fck',
 'febe',
 'feeds',
 'feelings',
 'feels',
 'feet',
 'fella',
 'fellows',
 'females',
 'femine',
 'fer',
 'fergalicious',
 'fergie',
 'fetterline',
 'fettish',
 'ff',
 'fiddles',
 'files',
 'fillin',
 'finds',
 'finers',
 'fingers',
 'firs',
 'fishercat',
 'fishercats',
 'fishers',
 'fishin',
 'fits',
 'fkajslf',
 'fl',
 'flames',
 'flashed',
 'flattered',
 'flatts',
 'flavors',
 'flippin',
 'flirts',
 'fliuds',
 'flops',
 'fluids',
 'fock',
 'foley',
 'folks',
 'fongul',
 'foothills',
 'footprints',
 'foreplay',
 'forgets',
 'forwads',
 'foxes',
 'foxwoods',
 'fractured',
 'fragged',
 'fraggle',
 'frags',
 'freaing',
 'freaked',
 'freakin',
 'freaking',
 'freaks',
 'freeeezinggg',
 'freeezinggggg',
 'freesbee',
 'freind',
 'frenchkiss',
 'friends',
 'fries',
 'frm',
 'froogle',
 'frst',
 'frustrating',
 'ft',
 'fuck',
 'fucker',
 'fuckin',
 'fucking',
 'fucks',
 'fuddahnut',
 'fuked',
 'fulfilling',
 'futurama',
 'fwd',
 'gaaaaaaay',
 'gagas',
 'gags',
 'gained',
 'gals',
 'gamefly',
 'games',
 'garciae',
 'gaspppp',
 'gawd',
 'gayoholic',
 'gays',
 'geeks',
 'gees',
 'geesh',
 'geeshh',
 'geeshhh',
 'geessh',
 'geeze',
 'gente',
 'gentlemen',
 'gets',
 'gettign',
 'gettin',
 'gettysburg',
 'gezzz',
 'gf',
 'gguyyyzzzz',
 'ghet',
 'giggles',
 'girlfriend',
 'girls',
 'giva',
 'gives',
 'givs',
 'glitches',
 'gm',
 'gn',
 'gng',
 'goddamn',
 'goin',
 'goneee',
 'gonna',
 'goodbye',
 'goodie',
 'goodnight',
 'goodnite',
 'google',
 'gooo',
 'goooooo',
 'gorda',
 'gotaa',
 'goths',
 'gotta',
 'gottsa',
 'grabs',
 'gracemont',
 'grea',
 'greetings',
 'gret',
 'grettings',
 'grilfriend',
 'grins',
 'gritt',
 'grlz',
 'groups',
 'grrl',
 'grrr',
 'grrrrrring',
 'grrrrrrrr',
 'grrrrrrrrr',
 'grrrrrrrrrrrrrrrrr',
 'gs',
 'gtg',
 'guns',
 'gurlie',
 'gurls',
 'gurrrrl',
 'guts',
 'guys',
 'guyz',
 'guyzz',
 'haaa',
 'hafta',
 'haha',
 'hahaaa',
 'hahaaaa',
 'hahah',
 'hahaha',
 'hahahaa',
 'hahahah',
 'hahahaha',
 'hahahahaaa',
 'hahahahahaha',
 'hahahahahahaha',
 'hahahahahahahahahahahahahahahaha',
 'hahahhahah',
 'hahhaa',
 'hahhahahaha',
 'hairs',
 'halfa',
 'hallo',
 'handheld',
 'handing',
 'hands',
 'handyman',
 'hang',
 'hangin',
 'hangs',
 'happend',
 'happened',
 'happens',
 'happpy',
 'harley',
 'hartford',
 'has',
 'haters',
 'hates',
 'hav',
 'havin',
 'having',
 'hawaii',
 'hawt',
 'hb',
 'headach',
 'headlights',
 'heads',
 'heard',
 'hearin',
 'hearthechatters',
 'heee',
 'heeee',
 'heeeey',
 'heeheeheeheeheehee',
 'heh',
 'heheh',
 'hehehe',
 'hehehee',
 'hehehehe',
 'hel',
 'helloooo',
 'hellos',
 'helped',
 'hendrix',
 'heroes',
 'hertory',
 'hes',
 'heya',
 'heyheyhey',
 'heys',
 'heyy',
 'heyyy',
 'heyyyy',
 'heyyyyy',
 'heyyyyyy',
 'heyyyyyyy',
 'heyyyyyyyy',
 'heyyyyyyyyy',
 'heyyyyyyyyyy',
 'heyyyyyyyyyyyyyy',
 'hfglhs',
 'hgey',
 'hgfhgfjgf',
 'hhaaaaatttee',
 'hheeyy',
 'hheeyyy',
 'hheeyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyy',
 'hhehe',
 'hiding',
 'hih',
 'hii',
 'hiii',
 'hiiii',
 'hio',
 'hiom',
 'hissy',
 'hits',
 'hitting',
 'hix',
 'hiy',
 'hiya',
 'hiyas',
 'hm',
 'hmm',
 'hmmm',
 'hmmmm',
 'hmmmmm',
 'hmmmmmmm',
 'hmmmmmmmm',
 'hmmmmmmmmmm',
 'hmph',
 'hogs',
 'hohohohhohhoo',
 'hola',
 'holdin',
 'holds',
 'holocaustyourmom',
 'hom',
 'homeade',
 'homeboys',
 'homes',
 'homies',
 'homophobic',
 'hoo',
 'hooo',
 'hooooo',
 'hopin',
 'hoping',
 'horace',
 'horriable',
 'horrified',
 'hots',
 'hott',
 'hottie',
 'hotties',
 'hours',
 'houses',
 'hows',
 'howz',
 'hpa',
 'hr',
 'hrs',
 'http',
 'hubbys',
 'hugggs',
 'huggs',
 'huggss',
 'hugs',
 'hugss',
 'hugsss',
 'hugsssss',
 'hugsssssssss',
 'hugzzzzzzz',
 'humm',
 'hummmm',
 'humple',
 'hunters',
 'huskers',
 'husteling',
 'huuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuugz',
 'hx',
 'hyy',
 'iamahotnip',
 'iamahotniplickme',
 'iamahotnipwithhotnippics',
 'iamahotnipwithpics',
 'icky',
 'ico',
 'idiots',
 'idnt',
 'ifr',
 'ignored',
 'ignoring',
 'ihavehotnips',
 'ii',
 'il',
 'illin',
 'im',
 'images',
 'imhave',
 'imma',
 'immersed',
 'impaired',
 'impared',
 'impressed',
 'indiantown',
 'infor',
 'inhales',
 'injureis',
 'innit',
 'innocense',
 'insuklting',
 'internet',
 'intersting',
 'invisiable',
 'involves',
 'irc',
 'irl',
 'isnt',
 'issues',
 'italians',
 'italy',
 'itches',
 'items',
 'itz',
 'ive',
 'iz',
 'ja',
 'jackie',
 'jacking',
 'jajajaaa',
 'jammed',
 'jammers',
 'jammies',
 'jayse',
 'jerimiah',
 'jerkettes',
 'jerketts',
 'jerks',
 'jeter',
 'jk',
 'johnson',
 'johny',
 'joined',
 'jokes',
 'joking',
 'jonesboro',
 'jordison',
 'joshy',
 'jr',
 'jrz',
 'jto',
 'jucilicious',
 'judges',
 'jujubees',
 'jumpin',
 'jumps',
 ...]

Corpus of stopwords (high-frequency words like the, to, also that we often want to filter out)


In [56]:
from nltk.corpus import stopwords
stopwords.words('english')


Out[56]:
['i',
 'me',
 'my',
 'myself',
 'we',
 'our',
 'ours',
 'ourselves',
 'you',
 'your',
 'yours',
 'yourself',
 'yourselves',
 'he',
 'him',
 'his',
 'himself',
 'she',
 'her',
 'hers',
 'herself',
 'it',
 'its',
 'itself',
 'they',
 'them',
 'their',
 'theirs',
 'themselves',
 'what',
 'which',
 'who',
 'whom',
 'this',
 'that',
 'these',
 'those',
 'am',
 'is',
 'are',
 'was',
 'were',
 'be',
 'been',
 'being',
 'have',
 'has',
 'had',
 'having',
 'do',
 'does',
 'did',
 'doing',
 'a',
 'an',
 'the',
 'and',
 'but',
 'if',
 'or',
 'because',
 'as',
 'until',
 'while',
 'of',
 'at',
 'by',
 'for',
 'with',
 'about',
 'against',
 'between',
 'into',
 'through',
 'during',
 'before',
 'after',
 'above',
 'below',
 'to',
 'from',
 'up',
 'down',
 'in',
 'out',
 'on',
 'off',
 'over',
 'under',
 'again',
 'further',
 'then',
 'once',
 'here',
 'there',
 'when',
 'where',
 'why',
 'how',
 'all',
 'any',
 'both',
 'each',
 'few',
 'more',
 'most',
 'other',
 'some',
 'such',
 'no',
 'nor',
 'not',
 'only',
 'own',
 'same',
 'so',
 'than',
 'too',
 'very',
 's',
 't',
 'can',
 'will',
 'just',
 'don',
 'should',
 'now']

What fraction of words in a text are not in the stopwords list?


In [57]:
def content_fraction(text):
    
    stopwords = nltk.corpus.stopwords.words('english')
    
    content = [w for w in text if w.lower() not in stopwords] # filtering out all stopwords
    
    return len(content)/len(text)

# this is in the example, but something is wrong with this corpus + Python 3.5.1...
# content_fraction(nltk.corpus.reuters.words())

content_fraction(nltk.corpus.gutenberg.words())


Out[57]:
0.5795676173409271

Wordlist and word puzzle solving -- How many words of 4 letters or more can you make from the characters [E, G, I, V, R, V, O, N, L]?

Rules:

  • No plurals ending in "s"
  • No foreign words
  • No proper names

use FreqDist to generate frequencies for each character in the "word" egivrvonl


In [58]:
puzzle_letters = nltk.FreqDist('egivrvonl')
puzzle_letters


Out[58]:
Counter({'e': 1, 'g': 1, 'i': 1, 'l': 1, 'n': 1, 'o': 1, 'r': 1, 'v': 2})

In [59]:
obligatory = 'r'
wordlist = nltk.corpus.words.words() # Words Corpus
[w for w in wordlist if len(w) >= 6
                     and obligatory in w
                     and nltk.FreqDist(w) <= puzzle_letters]


Out[59]:
['glover',
 'gorlin',
 'govern',
 'grovel',
 'ignore',
 'involver',
 'lienor',
 'linger',
 'longer',
 'lovering',
 'noiler',
 'overling',
 'region',
 'renvoi',
 'revolving',
 'ringle',
 'roving',
 'violer',
 'virole']

comparing FreqDist objects...

lets us check if the frequency of each letter in the given word is <= the frequency of the letter in the puzzle (puzzle_letters)


In [60]:
puzzle_letters = nltk.FreqDist('egivrvonl')

nltk.FreqDist('glover') < puzzle_letters


Out[60]:
True

In [61]:
nltk.FreqDist('egivrvon') <= puzzle_letters


Out[61]:
True

In [62]:
nltk.FreqDist('egivrvonl') <= puzzle_letters


Out[62]:
True

In [63]:
nltk.FreqDist('egivrvonll') <= puzzle_letters


Out[63]:
False

Names corpus -- contains 8,000 first names categorized by gender

(male / female names stored in separate files)

Find names which appear in both files


In [64]:
names = nltk.corpus.names
names.fileids()


Out[64]:
['female.txt', 'male.txt']

In [65]:
male_names = names.words('male.txt')
female_names = names.words('female.txt')
[w for w in male_names if w in female_names]


Out[65]:
['Abbey',
 'Abbie',
 'Abby',
 'Addie',
 'Adrian',
 'Adrien',
 'Ajay',
 'Alex',
 'Alexis',
 'Alfie',
 'Ali',
 'Alix',
 'Allie',
 'Allyn',
 'Andie',
 'Andrea',
 'Andy',
 'Angel',
 'Angie',
 'Ariel',
 'Ashley',
 'Aubrey',
 'Augustine',
 'Austin',
 'Averil',
 'Barrie',
 'Barry',
 'Beau',
 'Bennie',
 'Benny',
 'Bernie',
 'Bert',
 'Bertie',
 'Bill',
 'Billie',
 'Billy',
 'Blair',
 'Blake',
 'Bo',
 'Bobbie',
 'Bobby',
 'Brandy',
 'Brett',
 'Britt',
 'Brook',
 'Brooke',
 'Brooks',
 'Bryn',
 'Cal',
 'Cam',
 'Cammy',
 'Carey',
 'Carlie',
 'Carlin',
 'Carmine',
 'Carroll',
 'Cary',
 'Caryl',
 'Casey',
 'Cass',
 'Cat',
 'Cecil',
 'Chad',
 'Chris',
 'Chrissy',
 'Christian',
 'Christie',
 'Christy',
 'Clair',
 'Claire',
 'Clare',
 'Claude',
 'Clem',
 'Clemmie',
 'Cody',
 'Connie',
 'Constantine',
 'Corey',
 'Corrie',
 'Cory',
 'Courtney',
 'Cris',
 'Daffy',
 'Dale',
 'Dallas',
 'Dana',
 'Dani',
 'Daniel',
 'Dannie',
 'Danny',
 'Darby',
 'Darcy',
 'Darryl',
 'Daryl',
 'Deane',
 'Del',
 'Dell',
 'Demetris',
 'Dennie',
 'Denny',
 'Devin',
 'Devon',
 'Dion',
 'Dionis',
 'Dominique',
 'Donnie',
 'Donny',
 'Dorian',
 'Dory',
 'Drew',
 'Eddie',
 'Eddy',
 'Edie',
 'Elisha',
 'Emmy',
 'Erin',
 'Esme',
 'Evelyn',
 'Felice',
 'Fran',
 'Francis',
 'Frank',
 'Frankie',
 'Franky',
 'Fred',
 'Freddie',
 'Freddy',
 'Gabriel',
 'Gabriell',
 'Gail',
 'Gale',
 'Gay',
 'Gayle',
 'Gene',
 'George',
 'Georgia',
 'Georgie',
 'Geri',
 'Germaine',
 'Gerri',
 'Gerry',
 'Gill',
 'Ginger',
 'Glen',
 'Glenn',
 'Grace',
 'Gretchen',
 'Gus',
 'Haleigh',
 'Haley',
 'Hannibal',
 'Harley',
 'Hazel',
 'Heath',
 'Henrie',
 'Hilary',
 'Hillary',
 'Holly',
 'Ike',
 'Ikey',
 'Ira',
 'Isa',
 'Isador',
 'Isadore',
 'Jackie',
 'Jaime',
 'Jamie',
 'Jan',
 'Jean',
 'Jere',
 'Jermaine',
 'Jerrie',
 'Jerry',
 'Jess',
 'Jesse',
 'Jessie',
 'Jo',
 'Jodi',
 'Jodie',
 'Jody',
 'Joey',
 'Jordan',
 'Juanita',
 'Jude',
 'Judith',
 'Judy',
 'Julie',
 'Justin',
 'Karel',
 'Kellen',
 'Kelley',
 'Kelly',
 'Kelsey',
 'Kerry',
 'Kim',
 'Kip',
 'Kirby',
 'Kit',
 'Kris',
 'Kyle',
 'Lane',
 'Lanny',
 'Lauren',
 'Laurie',
 'Lee',
 'Leigh',
 'Leland',
 'Lesley',
 'Leslie',
 'Lin',
 'Lind',
 'Lindsay',
 'Lindsey',
 'Lindy',
 'Lonnie',
 'Loren',
 'Lorne',
 'Lorrie',
 'Lou',
 'Luce',
 'Lyn',
 'Lynn',
 'Maddie',
 'Maddy',
 'Marietta',
 'Marion',
 'Marlo',
 'Martie',
 'Marty',
 'Mattie',
 'Matty',
 'Maurise',
 'Max',
 'Maxie',
 'Mead',
 'Meade',
 'Mel',
 'Meredith',
 'Merle',
 'Merrill',
 'Merry',
 'Meryl',
 'Michal',
 'Michel',
 'Michele',
 'Mickie',
 'Micky',
 'Millicent',
 'Morgan',
 'Morlee',
 'Muffin',
 'Nat',
 'Nichole',
 'Nickie',
 'Nicky',
 'Niki',
 'Nikki',
 'Noel',
 'Ollie',
 'Page',
 'Paige',
 'Pat',
 'Patrice',
 'Patsy',
 'Pattie',
 'Patty',
 'Pen',
 'Pennie',
 'Penny',
 'Perry',
 'Phil',
 'Pooh',
 'Quentin',
 'Quinn',
 'Randi',
 'Randie',
 'Randy',
 'Ray',
 'Regan',
 'Reggie',
 'Rene',
 'Rey',
 'Ricki',
 'Rickie',
 'Ricky',
 'Rikki',
 'Robbie',
 'Robin',
 'Ronnie',
 'Ronny',
 'Rory',
 'Ruby',
 'Sal',
 'Sam',
 'Sammy',
 'Sandy',
 'Sascha',
 'Sasha',
 'Saundra',
 'Sayre',
 'Scotty',
 'Sean',
 'Shaine',
 'Shane',
 'Shannon',
 'Shaun',
 'Shawn',
 'Shay',
 'Shayne',
 'Shea',
 'Shelby',
 'Shell',
 'Shelley',
 'Sibyl',
 'Simone',
 'Sonnie',
 'Sonny',
 'Stacy',
 'Sunny',
 'Sydney',
 'Tabbie',
 'Tabby',
 'Tallie',
 'Tally',
 'Tammie',
 'Tammy',
 'Tate',
 'Ted',
 'Teddie',
 'Teddy',
 'Terri',
 'Terry',
 'Theo',
 'Tim',
 'Timmie',
 'Timmy',
 'Tobe',
 'Tobie',
 'Toby',
 'Tommie',
 'Tommy',
 'Tony',
 'Torey',
 'Trace',
 'Tracey',
 'Tracie',
 'Tracy',
 'Val',
 'Vale',
 'Valentine',
 'Van',
 'Vin',
 'Vinnie',
 'Vinny',
 'Virgie',
 'Wallie',
 'Wallis',
 'Wally',
 'Whitney',
 'Willi',
 'Willie',
 'Willy',
 'Winnie',
 'Winny',
 'Wynn']

names ending in 'a' are almost always female


In [66]:
cfd = nltk.ConditionalFreqDist((fileid, name[-1])
                               for fileid in names.fileids()
                               for name in names.words(fileid))
plot = plt.figure(figsize=(18,10))
cfd.plot()


A Pronouncing Dictionary

CMU Pronouncing Dictionary -- contains words followed by some properties of the word


In [67]:
entries = nltk.corpus.cmudict.entries()
len(entries)


Out[67]:
133737

In [68]:
for entry in entries[42371:42379]:
    print(entry)


('fir', ['F', 'ER1'])
('fire', ['F', 'AY1', 'ER0'])
('fire', ['F', 'AY1', 'R'])
('firearm', ['F', 'AY1', 'ER0', 'AA2', 'R', 'M'])
('firearm', ['F', 'AY1', 'R', 'AA2', 'R', 'M'])
('firearms', ['F', 'AY1', 'ER0', 'AA2', 'R', 'M', 'Z'])
('firearms', ['F', 'AY1', 'R', 'AA2', 'R', 'M', 'Z'])
('fireball', ['F', 'AY1', 'ER0', 'B', 'AO2', 'L'])

Lexicon above provides a list of phonetic codes (distinct labels for each contrastive sound -- phones) for each word.


Look for entries whose pronunciation consists of 3 phones, and print those with the first and third equal to 'P' and 'T', respectively


In [69]:
for word, pron in entries:
    if len(pron) == 3:
        ph1, ph2, ph3 = pron
        if ph1 == 'P' and ph3 == 'T':
            print(word, ph2, end=' ')


pait EY1 pat AE1 pate EY1 patt AE1 peart ER1 peat IY1 peet IY1 peete IY1 pert ER1 pet EH1 pete IY1 pett EH1 piet IY1 piette IY1 pit IH1 pitt IH1 pot AA1 pote OW1 pott AA1 pout AW1 puett UW1 purt ER1 put UH1 putt AH1 

Find all words whose pronunciation ends with a syllable sounding like nicks (similar for but in a list comprehension)


In [70]:
syllable = ['N', 'IH0', 'K', 'S']
[word for word, pron in entries if pron[-4:] == syllable]


Out[70]:
["atlantic's",
 'audiotronics',
 'avionics',
 'beatniks',
 'calisthenics',
 'centronics',
 'chamonix',
 'chetniks',
 "clinic's",
 'clinics',
 'conics',
 'conics',
 'cryogenics',
 'cynics',
 'diasonics',
 "dominic's",
 'ebonics',
 'electronics',
 "electronics'",
 "endotronics'",
 'endotronics',
 'enix',
 'environics',
 'ethnics',
 'eugenics',
 'fibronics',
 'flextronics',
 'harmonics',
 'hispanics',
 'histrionics',
 'identics',
 'ionics',
 'kibbutzniks',
 'lasersonics',
 'lumonics',
 'mannix',
 'mechanics',
 "mechanics'",
 'microelectronics',
 'minix',
 'minnix',
 'mnemonics',
 'mnemonics',
 'molonicks',
 'mullenix',
 'mullenix',
 'mullinix',
 'mulnix',
 "munich's",
 'nucleonics',
 'onyx',
 'organics',
 "panic's",
 'panics',
 'penix',
 'pennix',
 'personics',
 'phenix',
 "philharmonic's",
 'phoenix',
 'phonics',
 'photronics',
 'pinnix',
 'plantronics',
 'pyrotechnics',
 'refuseniks',
 "resnick's",
 'respironics',
 'sconnix',
 'siliconix',
 'skolniks',
 'sonics',
 'sputniks',
 'technics',
 'tectonics',
 'tektronix',
 'telectronics',
 'telephonics',
 'tonics',
 'unix',
 "vinick's",
 "vinnick's",
 'vitronics']

words ending in n but with last syllable pronounced like m


In [71]:
[w for w, pron in entries if pron[-1] == 'M' and w[-1] == 'n']


Out[71]:
['autumn', 'column', 'condemn', 'damn', 'goddamn', 'hymn', 'solemn']

nonredundant/sorted list of first two letters of words that have a first syllable pronounced like N but a first letter that is not n


In [72]:
sorted(set([w[:2] for w, pron in entries if pron[0] == 'N' and w[0] != 'n']))


Out[72]:
['gn', 'kn', 'mn', 'pn']

Phones contain digits to represent primary stress (1), secondary stress (2), and no stress (0).


In [73]:
def stress(pron):
    return [char 
            for phone in pron   
            for char in phone 
            if char.isdigit()]

[w for w, pron in entries if stress(pron) == ['0', '1', '0', '2', '0']]


Out[73]:
['abbreviated',
 'abbreviated',
 'abbreviating',
 'accelerated',
 'accelerating',
 'accelerator',
 'accelerators',
 'accentuated',
 'accentuating',
 'accommodated',
 'accommodating',
 'accommodative',
 'accumulated',
 'accumulating',
 'accumulative',
 'accumulator',
 'accumulators',
 'accusatory',
 'adenovirus',
 'adjudicated',
 'adjudicating',
 'administrating',
 'administrative',
 'administrator',
 "administrators'",
 "administrator's",
 'administrators',
 'adulterated',
 'adventurism',
 'adventurism',
 'affiliated',
 'affiliated',
 "affiliated's",
 'affiliating',
 'alleviated',
 'alleviated',
 'alleviating',
 'alliteration',
 'alliterative',
 'amalgamated',
 "amalgamated's",
 'amalgamating',
 'ameliorated',
 'ameridata',
 'amoxicillin',
 'anachronism',
 'anachronisms',
 'annihilated',
 'annihilating',
 'antagonism',
 'antagonisms',
 'antagonizing',
 'anticipated',
 'anticipated',
 'anticipating',
 'apologizes',
 'apologizing',
 'apothecary',
 'appreciated',
 'appreciating',
 'appreciative',
 'appropriated',
 'appropriating',
 'appropriator',
 'appropriators',
 'approximated',
 'approximating',
 'articulated',
 'articulating',
 'asphyxiated',
 'asphyxiating',
 'assassinated',
 'assassinating',
 'assemblywoman',
 'assimilated',
 'assimilating',
 'associated',
 'associated',
 'associating',
 'astigmatism',
 'attenuated',
 'authenticated',
 'authenticating',
 'authoritative',
 'bicentenary',
 'bilingualism',
 'biosciences',
 'buchananism',
 'cadiddlehopper',
 'capitulated',
 'catholicism',
 'celebratory',
 'coagulating',
 'cogenerator',
 'cogenerators',
 'collaborated',
 'collaborated',
 'collaborating',
 'collaborative',
 'collaborator',
 'collaborators',
 'collectivism',
 'commemorated',
 'commemorating',
 'commemorative',
 'commercialism',
 'commercializing',
 'communicated',
 'communicating',
 'communicator',
 'compensatory',
 'computerizing',
 'computervision',
 'concatenated',
 'concatenating',
 'concessionary',
 'conciliator',
 "conciliator's",
 'conciliatory',
 'confectionaries',
 'confectionary',
 'confectionery',
 'confirmatory',
 'confiscatory',
 'confucianism',
 'congratulated',
 'congratulating',
 'conservatism',
 'conservatories',
 'consolidated',
 "consolidated's",
 'consolidating',
 'consolidator',
 'consolidators',
 'constabulary',
 'construcciones',
 'consumerism',
 'contaminated',
 'contaminated',
 'contaminating',
 'contemporaries',
 'contemporary',
 'contributory',
 'cooperated',
 'cooperating',
 'cooperative',
 'coordinating',
 'coordinator',
 'coordinators',
 'corroborated',
 'corroborating',
 'creationism',
 'debilitated',
 'debilitating',
 'decaffeinated',
 'decaffeinating',
 'decapitated',
 'decelerated',
 'decelerating',
 'decentralizing',
 'decisionmaker',
 'decisionmaking',
 'declaratory',
 'deemphasizing',
 'defamatory',
 'defibrillator',
 'defibrillators',
 'deflationary',
 'degenerated',
 'degenerating',
 'dehumanizing',
 'deliberated',
 'deliberating',
 'deliberative',
 'delineated',
 'delineating',
 'demobilizes',
 'demobilizing',
 'democratizes',
 'democratizing',
 'demoralizing',
 'denominated',
 'denominator',
 'depilatory',
 'depositary',
 'depositary',
 'depository',
 'depreciated',
 'depreciating',
 'depressurizes',
 'depressurizing',
 'deregulating',
 'derogatory',
 'desegregated',
 'desensitizing',
 'destabilizing',
 'determinism',
 'devaluated',
 'diastrophism',
 'dilapidated',
 'discretionary',
 'discriminated',
 'discriminated',
 'discriminating',
 'disintegrated',
 'disintegrating',
 'disoriented',
 'disorienting',
 'disqualifying',
 'disseminated',
 'disseminating',
 'diversifying',
 'diversifying',
 'diversionary',
 'diversionary',
 'domesticated',
 'domesticating',
 'economizes',
 'economizes',
 'economizing',
 'economizing',
 'elaborating',
 'electrifying',
 'electrocuted',
 'electroplating',
 'eliminated',
 'eliminated',
 'eliminating',
 'elucidated',
 'elucidative',
 'emaciated',
 'emaciating',
 'emancipated',
 'emancipating',
 'emasculated',
 'empiricism',
 'emulsifier',
 'encapsulated',
 'encapsulating',
 'enthusiasm',
 'enthusiasms',
 'enumerated',
 'enunciated',
 'enunciating',
 'epistolary',
 'epitomizes',
 'equivocating',
 'eradicated',
 'eradicating',
 'eroticism',
 'evacuated',
 'evacuated',
 'evacuating',
 'evacuating',
 'evaluated',
 'evaluated',
 'evaluating',
 'evaluating',
 'evangelism',
 'evangelism',
 'evaporated',
 'evaporated',
 'evaporated',
 'evaporated',
 'evaporating',
 'evaporating',
 'evaporator',
 'evaporator',
 'eviscerated',
 'exacerbated',
 'exacerbated',
 'exacerbating',
 'exaggerated',
 'exaggerated',
 'exaggerating',
 'exasperated',
 'exasperating',
 'exclusionary',
 'excoriated',
 'excoriating',
 'excoriation',
 'excruciating',
 'exemplifying',
 'exhilarated',
 'exhilarating',
 'exonerated',
 'exonerating',
 'expansionary',
 'expansionary',
 'expansionism',
 'expansionism',
 'experimenter',
 'experimenters',
 'experimenting',
 'expiratory',
 'explanatory',
 'exploratory',
 'exploravision',
 'expressionism',
 'expropriated',
 'extenuating',
 'exterminated',
 'exterminating',
 'exterminator',
 'exterminators',
 'extraordinary',
 'extrapolated',
 'extrapolating',
 'facilitated',
 'facilitating',
 'facilitator',
 "facilitator's",
 'facilitators',
 'fanaticism',
 'fiduciares',
 'fiduciaries',
 'fiduciary',
 'ganatieuganauf',
 'geotropism',
 'hereditary',
 'humidifier',
 'humidifiers',
 'humiliated',
 'humiliating',
 'hydrogenated',
 'identifier',
 'identifier',
 'identifying',
 'identifying',
 'illuminated',
 'illuminating',
 'illuminator',
 'illusionary',
 'illusionism',
 'imaginary',
 'immeasurable',
 'immeasurably',
 'immobilizing',
 'impersonated',
 'impersonating',
 'impersonators',
 'impressionism',
 'inaccuracies',
 'inactivated',
 'inaugurated',
 'inaugurated',
 'inaugurating',
 'incantatory',
 'incarcerated',
 'incarcerating',
 'incinerated',
 'incinerating',
 'incineration',
 'incinerator',
 'incinerators',
 'inconsistencies',
 'incorporated',
 'incorporated',
 "incorporated's",
 'incorporating',
 'incriminating',
 'indisputably',
 'indoctrinated',
 'inebriated',
 'inebriating',
 'infatuated',
 'infatuating',
 'inflammatory',
 'inflationary',
 'infuriated',
 'infuriated',
 'infuriating',
 'ingratiating',
 'inhibitory',
 'initiated',
 'initiated',
 'initiating',
 'innoculated',
 'innoculating',
 'inoculated',
 'instantiated',
 'instantiating',
 'intensifying',
 'interrogated',
 'interrogating',
 'intimidated',
 'intimidating',
 'intoxicated',
 'intoxicated',
 'intoxicating',
 'invalidated',
 'invalidated',
 'invalidating',
 'investigated',
 'investigated',
 'investigating',
 'investigative',
 'investigator',
 "investigator's",
 'investigators',
 "investigators'",
 'invigorated',
 'invigorating',
 'involuntary',
 'irradiated',
 'itineraries',
 'itinerary',
 'judiciary',
 'legitimizes',
 'legitimizing',
 'manipulated',
 'manipulating',
 'manipulative',
 'manipulator',
 'manipulators',
 'mercantilism',
 'metabolism',
 'metabolisms',
 'misallocated',
 'misallocating',
 'miscalculated',
 'misrecognizes',
 'misrecognizing',
 'monasticism',
 'monopolizes',
 'monopolizing',
 'necessitated',
 'necessitating',
 'negotiated',
 'negotiated',
 'negotiating',
 'negotiator',
 'negotiator',
 "negotiators'",
 "negotiator's",
 'negotiators',
 'nonmilitary',
 'nonregulated',
 'obituaries',
 'obituary',
 'obligatory',
 'obliterated',
 'obliterating',
 'observatories',
 'observatory',
 "observatory's",
 'obstructionism',
 'officiated',
 'officiating',
 'opinionated',
 'originated',
 'originated',
 'originating',
 'originator',
 'originators',
 'participated',
 'participated',
 'participating',
 'paternalism',
 'pecuniary',
 'perfectionism',
 'perpetuated',
 'perpetuating',
 'personifying',
 'pituitary',
 'pituitary',
 'politicizing',
 'pontificated',
 'pontificater',
 'pontificaters',
 'pontificating',
 'precipitated',
 'precipitating',
 'predominated',
 'predominating',
 'prefabricated',
 'prekindergarten',
 'preliminaries',
 'preliminaries',
 'preliminary',
 'preliminary',
 'premeditated',
 'preparatory',
 'prioritizes',
 'prioritizing',
 'probationary',
 'procrastinated',
 'procrastinating',
 'procrastinator',
 'procrastinators',
 'prohibitory',
 'proliferated',
 'proliferating',
 'proprietaries',
 'proprietary',
 'protectionism',
 'protectionism',
 'provincialism',
 'reactionaries',
 'reactionary',
 'reallocating',
 'reanalyses',
 'reanalysing',
 'reauthorizing',
 'recalculated',
 'recalculating',
 'recessionary',
 'recidivism',
 'reciprocated',
 'reciprocating',
 'reclassifying',
 'reconstituted',
 'reconstituting',
 'recuperated',
 'recuperater',
 'recuperating',
 'recuperating',
 'redecorated',
 'redecorating',
 'reformatories',
 'reformatory',
 'reformulated',
 'refrigerated',
 'refrigerator',
 'refrigerator',
 'refrigerators',
 'regenerated',
 'regenerating',
 'reinstituting',
 'reintegrated',
 'reintegration',
 'reiterated',
 'reiterating',
 'rejuvenated',
 'rejuvenating',
 'renominated',
 'reorganizes',
 'reorganizing',
 'repatriated',
 'repatriating',
 'repositories',
 'repository',
 'repudiated',
 'repudiating',
 'resuscitated',
 'resuscitating',
 'retaliated',
 'retaliated',
 'retaliating',
 'retaliatory',
 'revelatory',
 'reverberated',
 'reverberated',
 'reverberated',
 'reverberating',
 'reverberating',
 'revisionism',
 'revitalizing',
 'romanticism',
 'romanticizing',
 'securitizing',
 'solidifying',
 'sophisticated',
 'sophisticated',
 'subordinated',
 'subordinating',
 'subsidiaries',
 "subsidiaries'",
 'subsidiary',
 "subsidiary's",
 'substantiated',
 'substantiated',
 'surrealism',
 "surrealism's",
 'surrealisms',
 'unallocated',
 'uncompensated',
 'uncomplicated',
 'uneducated',
 'unmitigated',
 'unnecessary',
 'unprecedented',
 'unregulated',
 'unsanitary',
 'unsatisfying',
 'unsaturated',
 'velociraptor',
 'vocabulary',
 'voluntarism']

In [74]:
[w for w, pron in entries if stress(pron) == ['0', '2', '0', '1', '0']]


Out[74]:
['abbreviation',
 'abbreviations',
 'abomination',
 'abortifacient',
 'abortifacients',
 'academicians',
 'accommodation',
 'accommodations',
 'accreditation',
 'accreditations',
 'accumulation',
 'accumulations',
 'acetylcholine',
 'acetylcholine',
 'adjudication',
 'administration',
 "administration's",
 'administrations',
 "administrations'",
 'aduliadae',
 'adulteration',
 'affiliation',
 'affiliations',
 'aficionados',
 'agglomeration',
 'ahasuerus',
 'ajinomoto',
 'alleviation',
 'amalgamation',
 'ambrosiano',
 'amelioration',
 'americana',
 "americana's",
 'americanas',
 'americano',
 'americanos',
 'anachronistic',
 'anencephalic',
 'annihilation',
 'antagonistic',
 'anticipation',
 'anticipations',
 'apocalyptic',
 'apologetic',
 'apotheosis',
 'appreciation',
 'appropriation',
 'appropriations',
 'approximation',
 'approximations',
 'aristocratic',
 'arunachalam',
 'assassination',
 'assassinations',
 'assimilation',
 'association',
 'association',
 'associations',
 "association's",
 'associations',
 'authentication',
 'authentications',
 'bancoklahoma',
 'biagioni',
 'boguslavskaya',
 'bollapragada',
 'bollettieri',
 'burkina-faso',
 "burkina-faso's",
 'canariensis',
 'capitulation',
 'cartusciello',
 'catacosinos',
 'cohabitation',
 'coincidental',
 'collaboration',
 'commemoration',
 'commemorations',
 'communication',
 'communications',
 "communications'",
 "communication's",
 'concatenation',
 'conciliation',
 'confabulation',
 'confederation',
 'configuration',
 'configurations',
 'conglomeration',
 'congratulation',
 'congratulations',
 'conigliaro',
 'consideration',
 'considerations',
 'consolidation',
 'consolidations',
 'contamination',
 'continuation',
 'controladora',
 'cooperation',
 'cooperations',
 'coordination',
 'corroboration',
 'decaffeination',
 'decapitation',
 'decapitations',
 'deceleration',
 'deforestation',
 'degeneration',
 'deliberation',
 'deliberations',
 'delineation',
 'denomination',
 "denomination's",
 'denominations',
 'denunciation',
 'denunciations',
 'depopulation',
 'depreciation',
 'depreciations',
 'deregulation',
 'desalination',
 'desegregation',
 'determination',
 "determination's",
 'determinations',
 'deterministic',
 'detoxication',
 'devaluation',
 'developmental',
 'developmental',
 'diguglielmo',
 'dilophosaurus',
 'discoloration',
 'discolorations',
 'discrimination',
 'disembarkation',
 'disinformation',
 'disintegration',
 'disintegration',
 'dissemination',
 'dissociation',
 'domeniconi',
 'domestication',
 'dominicana',
 'dominicana',
 'ecclesiastic',
 'econometric',
 'econometrics',
 'economico',
 'edizione',
 'ejaculation',
 'elaboration',
 'electioneering',
 'electrocution',
 'electrocutions',
 'electrolytic',
 'electromagnet',
 'electromagnets',
 'electromedics',
 'electrostatic',
 'elimination',
 'eliminations',
 'emancipation',
 'emancipations',
 'embarcadero',
 'encephalitis',
 'encyclopedic',
 'encyclopedic',
 'encyclopedist',
 'encyclopedist',
 'enthusiastic',
 'enumeration',
 'environmental',
 'environmental',
 'episcopalian',
 'episcopalians',
 'eradication',
 'erythropoietin',
 'evacuation',
 'evacuations',
 'evaluation',
 'evaluation',
 'evaluations',
 'evaluations',
 'evaporation',
 'evaporation',
 'exacerbation',
 'exacerbations',
 'exaggeration',
 'exaggerations',
 'examination',
 'examinations',
 'exfoliation',
 'exhilaration',
 'expatriation',
 'experiential',
 'experimental',
 'experimental',
 'expressionistic',
 'extermination',
 'extrapolation',
 'extravaganza',
 'extravaganzas',
 'facilitation',
 'famiglietti',
 'fedeccredito',
 'financiera',
 'financiero',
 'garagiola',
 'giovanniello',
 'hallucination',
 'hallucinations',
 'hermaphroditic',
 'humiliation',
 'iacobelli',
 'iacobellis',
 'iacovelli',
 'iannaccone',
 'iannacone',
 'iannamico',
 'ianniello',
 'iavarone',
 'idealistic',
 'illumination',
 'imagination',
 'imaginations',
 'imagineering',
 'impressionistic',
 'inactivation',
 'inauguration',
 'inaugurations',
 'incarceration',
 'inconsequential',
 'incorporation',
 "incorporation's",
 'incorporations',
 'incrimination',
 'indemnifying',
 'indoctrination',
 'industriali',
 'infatuation',
 'inhabitation',
 'initiation',
 'innoculation',
 'inoculation',
 'inoculations',
 'insemination',
 'insinuation',
 'insinuations',
 'instantiation',
 'interpretation',
 'interpretations',
 'interrogation',
 'interrogations',
 'intimidation',
 'intoxication',
 'invalidation',
 'investigation',
 'investigations',
 'investimento',
 'ionospheric',
 'irimajiri',
 'irradiation',
 'italiana',
 'kartasasmita',
 'kilimanjaro',
 'kolodziejski',
 'korzeniewski',
 'kumaratunga',
 'kumarisami',
 'lagomarsino',
 'louisiana',
 "louisiana's",
 'manipulation',
 'manipulations',
 'maquiladoras',
 'marielito',
 'marielitos',
 'mazowiecki',
 'mcalexander',
 'mediobanca',
 'mericantante',
 'mezhdumarodnom',
 'misallocation',
 'misapplication',
 'misapprehension',
 'misdiagnoses',
 'misdiagnoses',
 'misdiagnosis',
 'misrecognition',
 'misrepresenting',
 'misrepresenting',
 'modigliani',
 'modigliani',
 'modiliani',
 'monongahela',
 'monopolistic',
 'monticciolo',
 'napoleonic',
 'navratilova',
 "navratilova's",
 'negotiation',
 'negotiations',
 'negotiations',
 'nomenklatura',
 'nonacademic',
 'noncontroversial',
 'nongovernmental',
 'obliteration',
 'okoniewski',
 'origination',
 'originations',
 'participation',
 'participations',
 'paternalistic',
 'perpetuation',
 'pontification',
 'pontifications',
 'popieluszko',
 'precipitation',
 'premeditation',
 'preoccupation',
 'preoccupations',
 'prevarication',
 'procrastination',
 'prognostication',
 'prognostications',
 'pronunciation',
 'pronunciation',
 'pronunciations',
 'pronunciations',
 'prudentialbache',
 'psychosomatic',
 'reallocation',
 'recalculation',
 'recrimination',
 'recriminations',
 'recuperation',
 'rededication',
 'redefinition',
 'redeposition',
 'reengineering',
 'refrigeration',
 'reiteration',
 'rejuvenation',
 'religione',
 'remediation',
 'remuneration',
 'renunciation',
 'repatriation',
 'repudiation',
 'resuscitation',
 'retaliation',
 'reverberation',
 'reverberations',
 'ricigliano',
 'rovaniemi',
 'sebastiana',
 'sebastiani',
 'sebastianis',
 'solicitation',
 'solicitations',
 'sophistication',
 'subordination',
 'substantiation',
 'substantiation',
 'sundararajan',
 'surrealistic',
 'tabacalera',
 'ticonderoga',
 "ticonderoga's",
 'triangulation',
 'tuberculosis',
 'tuberculosis',
 'tuberculosis',
 'uenohara',
 'undiplomatic',
 'uneconomic',
 'unpatriotic',
 'unrepresented',
 'unscientific',
 'unsentimental',
 'unsympathetic',
 'wakabayashi',
 'yekaterinburg']

Finding minimally-contrasting sets of words -- find all the p-words consisting of three sounds, and group them according to their first and last sounds


In [75]:
p3 = [(pron[0] + '-' + pron[2], word)
      for (word, pron) in entries
      if pron[0] == 'P' and len(pron) == 3]

cfd = nltk.ConditionalFreqDist(p3)
cfd


Out[75]:
defaultdict(nltk.probability.FreqDist,
            {'P-AA1': Counter({'pla': 1}),
             'P-AH0': Counter({'pera': 1, 'pia': 1}),
             'P-AW1': Counter({'plough': 1,
                      'plow': 1,
                      'prough': 1,
                      'prow': 1}),
             'P-AY1': Counter({'ply': 1, 'pri': 1, 'pry': 1}),
             'P-B': Counter({'pub': 1}),
             'P-CH': Counter({'patch': 1,
                      'pautsch': 1,
                      'peach': 1,
                      'perch': 1,
                      'petsch': 1,
                      'petsche': 1,
                      'piche': 1,
                      'piech': 1,
                      'pietsch': 1,
                      'pitch': 1,
                      'pitsch': 1,
                      'poach': 1,
                      'poche': 1,
                      'pooch': 1,
                      'pouch': 1,
                      'puche': 1,
                      'putsch': 1}),
             'P-D': Counter({'pad': 1,
                      'paid': 1,
                      'pawed': 1,
                      'peed': 1,
                      'pied': 1,
                      'pod': 1,
                      'poohed': 1}),
             'P-ER0': Counter({'payer': 1,
                      'poer': 1,
                      'power': 1,
                      'poyer': 1,
                      'pyre': 1}),
             'P-ER1': Counter({'payeur': 1}),
             'P-EY1': Counter({'play': 1, 'pray': 1, 'prey': 1}),
             'P-F': Counter({'paff': 1, 'poff': 1, 'poof': 1, 'puff': 1}),
             'P-G': Counter({'peg': 1,
                      'pegg': 1,
                      'pig': 1,
                      'pigg': 1,
                      'pigue': 1,
                      'poag': 1,
                      'pog': 1,
                      'pogue': 1,
                      'pug': 1,
                      'puig': 1}),
             'P-IY0': Counter({'pai': 1}),
             'P-IY1': Counter({'plea': 1,
                      'pre': 1,
                      'pree': 1,
                      'pri': 1,
                      'prix': 1}),
             'P-JH': Counter({'page': 1, 'paige': 1, 'podge': 1, 'purge': 1}),
             'P-K': Counter({'pac': 1,
                      'pack': 1,
                      'paek': 1,
                      'paik': 1,
                      'pak': 1,
                      'pake': 1,
                      'paque': 1,
                      'peak': 1,
                      'peake': 1,
                      'pech': 1,
                      'peck': 1,
                      'peek': 1,
                      'perc': 1,
                      'perk': 1,
                      'pic': 1,
                      'pick': 1,
                      'pik': 1,
                      'pike': 1,
                      'pique': 1,
                      'poch': 1,
                      'pock': 1,
                      'poke': 1,
                      'polk': 1,
                      'puck': 1,
                      'purk': 1,
                      'pyke': 1}),
             'P-L': Counter({'pahl': 1,
                      'pail': 1,
                      'paille': 1,
                      'pal': 1,
                      'pale': 1,
                      'pall': 2,
                      'paul': 1,
                      'paule': 1,
                      'paull': 1,
                      'peal': 1,
                      'peale': 1,
                      'pearl': 1,
                      'pearle': 1,
                      'peel': 1,
                      'peele': 1,
                      'pehl': 1,
                      'peil': 1,
                      'pell': 1,
                      'pelle': 1,
                      'perl': 1,
                      'perle': 1,
                      'piehl': 1,
                      'piel': 1,
                      'pihl': 1,
                      'pil': 1,
                      'pile': 1,
                      'pill': 1,
                      'pille': 1,
                      'poehl': 1,
                      'pohl': 1,
                      'pol': 1,
                      'pole': 1,
                      'poll': 1,
                      'pool': 1,
                      'poole': 1,
                      'poul': 1,
                      'puhl': 1,
                      'pull': 1,
                      'pyle': 1}),
             'P-M': Counter({'palm': 1,
                      'palme': 1,
                      'pam': 1,
                      'pimm': 1,
                      'pom': 1,
                      'pymm': 1}),
             'P-N': Counter({'paign': 1,
                      'pain': 1,
                      'paine': 1,
                      'pan': 1,
                      'pane': 1,
                      'pawn': 1,
                      'payne': 1,
                      'peine': 1,
                      'pen': 1,
                      'penh': 1,
                      'penn': 1,
                      'pin': 1,
                      'pine': 1,
                      'pinn': 1,
                      'pon': 1,
                      'poon': 1,
                      'pun': 1,
                      'pyne': 1}),
             'P-NG': Counter({'pang': 1,
                      'peng': 1,
                      'ping': 1,
                      'pong': 1,
                      'pung': 1}),
             'P-OW0': Counter({'pero': 1, 'pio': 1}),
             'P-OW1': Counter({'perot': 1, 'plough': 1, 'pro': 1}),
             'P-OY1': Counter({'ploy': 1}),
             'P-P': Counter({'paap': 1,
                      'paape': 1,
                      'pap': 1,
                      'pape': 1,
                      'papp': 1,
                      'paup': 1,
                      'peep': 1,
                      'pep': 1,
                      'pip': 1,
                      'pipe': 1,
                      'pipp': 1,
                      'poop': 1,
                      'pop': 1,
                      'pope': 1,
                      'popp': 1,
                      'poppe': 1,
                      'pup': 1}),
             'P-R': Counter({'paar': 1,
                      'pair': 1,
                      'par': 1,
                      'pare': 1,
                      'parr': 1,
                      'pear': 1,
                      'peer': 1,
                      'pier': 1,
                      'poor': 1,
                      'poore': 1,
                      'por': 1,
                      'pore': 1,
                      'porr': 1,
                      'pour': 1}),
             'P-S': Counter({'pace': 1,
                      'pass': 1,
                      'pasts': 1,
                      'peace': 1,
                      'pearse': 1,
                      'pease': 1,
                      'perce': 1,
                      'pers': 1,
                      'perse': 1,
                      'pesce': 1,
                      'piece': 1,
                      'piss': 1,
                      'pos': 1,
                      'poss': 1,
                      'posts': 1,
                      'purse': 1,
                      'pus': 1,
                      'puss': 2}),
             'P-SH': Counter({'paasch': 1,
                      'pash': 1,
                      'pesch': 1,
                      'pesh': 1,
                      'posch': 1,
                      'posh': 1,
                      'pusch': 1,
                      'push': 1}),
             'P-T': Counter({'pait': 1,
                      'pat': 1,
                      'pate': 1,
                      'patt': 1,
                      'peart': 1,
                      'peat': 1,
                      'peet': 1,
                      'peete': 1,
                      'pert': 1,
                      'pet': 1,
                      'pete': 1,
                      'pett': 1,
                      'piet': 1,
                      'piette': 1,
                      'pit': 1,
                      'pitt': 1,
                      'pot': 1,
                      'pote': 1,
                      'pott': 1,
                      'pout': 1,
                      'puett': 1,
                      'purt': 1,
                      'put': 1,
                      'putt': 1}),
             'P-TH': Counter({'paeth': 1,
                      'path': 1,
                      'pathe': 1,
                      'perth': 1,
                      'peth': 1,
                      'pith': 1,
                      'poth': 1,
                      'puth': 1}),
             'P-UW1': Counter({'peru': 1,
                      'peugh': 1,
                      'pew': 1,
                      'plew': 1,
                      'plue': 1,
                      'prew': 1,
                      'pru': 1,
                      'prue': 1,
                      'prugh': 1,
                      'pshew': 1,
                      'pugh': 1}),
             'P-V': Counter({'pave': 1, 'peeve': 1}),
             'P-Z': Counter({"p's": 1,
                      "p.'s": 1,
                      'p.s': 1,
                      'pais': 1,
                      'paiz': 1,
                      "pao's": 1,
                      'pas': 1,
                      'pause': 1,
                      'paws': 1,
                      'pays': 1,
                      'paz': 1,
                      'peas': 1,
                      'pease': 1,
                      "pei's": 1,
                      'perz': 1,
                      'pez': 1,
                      'pies': 1,
                      "poe's": 1,
                      'poise': 1,
                      'pose': 1,
                      'pows': 1,
                      'purrs': 1})})

In [76]:
cfd.conditions()


Out[76]:
['P-R',
 'P-K',
 'P-AW1',
 'P-CH',
 'P-Z',
 'P-UW1',
 'P-NG',
 'P-ER1',
 'P-G',
 'P-OW0',
 'P-N',
 'P-AY1',
 'P-SH',
 'P-IY0',
 'P-IY1',
 'P-P',
 'P-V',
 'P-EY1',
 'P-JH',
 'P-S',
 'P-M',
 'P-T',
 'P-D',
 'P-AH0',
 'P-ER0',
 'P-TH',
 'P-F',
 'P-L',
 'P-B',
 'P-OW1',
 'P-OY1',
 'P-AA1']

In [77]:
for template in sorted(cfd.conditions()):
    if len(cfd[template]) > 10:
        words = sorted(cfd[template])
        wordstring = ' '.join(words)
        print(template, wordstring[:70] + '...')


P-CH patch pautsch peach perch petsch petsche piche piech pietsch pitch pit...
P-K pac pack paek paik pak pake paque peak peake pech peck peek perc perk ...
P-L pahl pail paille pal pale pall paul paule paull peal peale pearl pearl...
P-N paign pain paine pan pane pawn payne peine pen penh penn pin pine pinn...
P-P paap paape pap pape papp paup peep pep pip pipe pipp poop pop pope pop...
P-R paar pair par pare parr pear peer pier poor poore por pore porr pour...
P-S pace pass pasts peace pearse pease perce pers perse pesce piece piss p...
P-T pait pat pate patt peart peat peet peete pert pet pete pett piet piett...
P-UW1 peru peugh pew plew plue prew pru prue prugh pshew pugh...
P-Z p's p.'s p.s pais paiz pao's pas pause paws pays paz peas pease pei's ...

We can also look up particular words using dictionary syntax...


In [78]:
prondict = nltk.corpus.cmudict.dict()
prondict['fire']


Out[78]:
[['F', 'AY1', 'ER0'], ['F', 'AY1', 'R']]

In [79]:
# prondict['blog'] # KeyError
prondict['blog'] = [['B', 'L', 'AA1', 'G']]
prondict['blog']


Out[79]:
[['B', 'L', 'AA1', 'G']]

text-to-speech to map every word of input text


In [80]:
text = ['natural', 'language', 'processing']
[ph for w in text 
    for ph in prondict[w][0]]


Out[80]:
['N',
 'AE1',
 'CH',
 'ER0',
 'AH0',
 'L',
 'L',
 'AE1',
 'NG',
 'G',
 'W',
 'AH0',
 'JH',
 'P',
 'R',
 'AA1',
 'S',
 'EH0',
 'S',
 'IH0',
 'NG']

Comparative Wordlists

Swadesh wordlists - lists of ~200 common words in several languages (specified by 2-letter codes)


In [81]:
from nltk.corpus import swadesh
swadesh.fileids()


Out[81]:
['be',
 'bg',
 'bs',
 'ca',
 'cs',
 'cu',
 'de',
 'en',
 'es',
 'fr',
 'hr',
 'it',
 'la',
 'mk',
 'nl',
 'pl',
 'pt',
 'ro',
 'ru',
 'sk',
 'sl',
 'sr',
 'sw',
 'uk']

In [82]:
swadesh.words('en')


Out[82]:
['I',
 'you (singular), thou',
 'he',
 'we',
 'you (plural)',
 'they',
 'this',
 'that',
 'here',
 'there',
 'who',
 'what',
 'where',
 'when',
 'how',
 'not',
 'all',
 'many',
 'some',
 'few',
 'other',
 'one',
 'two',
 'three',
 'four',
 'five',
 'big',
 'long',
 'wide',
 'thick',
 'heavy',
 'small',
 'short',
 'narrow',
 'thin',
 'woman',
 'man (adult male)',
 'man (human being)',
 'child',
 'wife',
 'husband',
 'mother',
 'father',
 'animal',
 'fish',
 'bird',
 'dog',
 'louse',
 'snake',
 'worm',
 'tree',
 'forest',
 'stick',
 'fruit',
 'seed',
 'leaf',
 'root',
 'bark (from tree)',
 'flower',
 'grass',
 'rope',
 'skin',
 'meat',
 'blood',
 'bone',
 'fat (noun)',
 'egg',
 'horn',
 'tail',
 'feather',
 'hair',
 'head',
 'ear',
 'eye',
 'nose',
 'mouth',
 'tooth',
 'tongue',
 'fingernail',
 'foot',
 'leg',
 'knee',
 'hand',
 'wing',
 'belly',
 'guts',
 'neck',
 'back',
 'breast',
 'heart',
 'liver',
 'drink',
 'eat',
 'bite',
 'suck',
 'spit',
 'vomit',
 'blow',
 'breathe',
 'laugh',
 'see',
 'hear',
 'know (a fact)',
 'think',
 'smell',
 'fear',
 'sleep',
 'live',
 'die',
 'kill',
 'fight',
 'hunt',
 'hit',
 'cut',
 'split',
 'stab',
 'scratch',
 'dig',
 'swim',
 'fly (verb)',
 'walk',
 'come',
 'lie',
 'sit',
 'stand',
 'turn',
 'fall',
 'give',
 'hold',
 'squeeze',
 'rub',
 'wash',
 'wipe',
 'pull',
 'push',
 'throw',
 'tie',
 'sew',
 'count',
 'say',
 'sing',
 'play',
 'float',
 'flow',
 'freeze',
 'swell',
 'sun',
 'moon',
 'star',
 'water',
 'rain',
 'river',
 'lake',
 'sea',
 'salt',
 'stone',
 'sand',
 'dust',
 'earth',
 'cloud',
 'fog',
 'sky',
 'wind',
 'snow',
 'ice',
 'smoke',
 'fire',
 'ashes',
 'burn',
 'road',
 'mountain',
 'red',
 'green',
 'yellow',
 'white',
 'black',
 'night',
 'day',
 'year',
 'warm',
 'cold',
 'full',
 'new',
 'old',
 'good',
 'bad',
 'rotten',
 'dirty',
 'straight',
 'round',
 'sharp',
 'dull',
 'smooth',
 'wet',
 'dry',
 'correct',
 'near',
 'far',
 'right',
 'left',
 'at',
 'in',
 'with',
 'and',
 'if',
 'because',
 'name']

access cognate words from multiple languages using entries()


In [83]:
fr2en = swadesh.entries(['fr', 'en'])
fr2en


Out[83]:
[('je', 'I'),
 ('tu, vous', 'you (singular), thou'),
 ('il', 'he'),
 ('nous', 'we'),
 ('vous', 'you (plural)'),
 ('ils, elles', 'they'),
 ('ceci', 'this'),
 ('cela', 'that'),
 ('ici', 'here'),
 ('là', 'there'),
 ('qui', 'who'),
 ('quoi', 'what'),
 ('où', 'where'),
 ('quand', 'when'),
 ('comment', 'how'),
 ('ne...pas', 'not'),
 ('tout', 'all'),
 ('plusieurs', 'many'),
 ('quelques', 'some'),
 ('peu', 'few'),
 ('autre', 'other'),
 ('un', 'one'),
 ('deux', 'two'),
 ('trois', 'three'),
 ('quatre', 'four'),
 ('cinq', 'five'),
 ('grand', 'big'),
 ('long', 'long'),
 ('large', 'wide'),
 ('épais', 'thick'),
 ('lourd', 'heavy'),
 ('petit', 'small'),
 ('court', 'short'),
 ('étroit', 'narrow'),
 ('mince', 'thin'),
 ('femme', 'woman'),
 ('homme', 'man (adult male)'),
 ('homme', 'man (human being)'),
 ('enfant', 'child'),
 ('femme, épouse', 'wife'),
 ('mari, époux', 'husband'),
 ('mère', 'mother'),
 ('père', 'father'),
 ('animal', 'animal'),
 ('poisson', 'fish'),
 ('oiseau', 'bird'),
 ('chien', 'dog'),
 ('pou', 'louse'),
 ('serpent', 'snake'),
 ('ver', 'worm'),
 ('arbre', 'tree'),
 ('forêt', 'forest'),
 ('bâton', 'stick'),
 ('fruit', 'fruit'),
 ('graine', 'seed'),
 ('feuille', 'leaf'),
 ('racine', 'root'),
 ('écorce', 'bark (from tree)'),
 ('fleur', 'flower'),
 ('herbe', 'grass'),
 ('corde', 'rope'),
 ('peau', 'skin'),
 ('viande', 'meat'),
 ('sang', 'blood'),
 ('os', 'bone'),
 ('graisse', 'fat (noun)'),
 ('œuf', 'egg'),
 ('corne', 'horn'),
 ('queue', 'tail'),
 ('plume', 'feather'),
 ('cheveu', 'hair'),
 ('tête', 'head'),
 ('oreille', 'ear'),
 ('œil', 'eye'),
 ('nez', 'nose'),
 ('bouche', 'mouth'),
 ('dent', 'tooth'),
 ('langue', 'tongue'),
 ('ongle', 'fingernail'),
 ('pied', 'foot'),
 ('jambe', 'leg'),
 ('genou', 'knee'),
 ('main', 'hand'),
 ('aile', 'wing'),
 ('ventre', 'belly'),
 ('entrailles', 'guts'),
 ('cou', 'neck'),
 ('dos', 'back'),
 ('sein, poitrine', 'breast'),
 ('cœur', 'heart'),
 ('foie', 'liver'),
 ('boire', 'drink'),
 ('manger', 'eat'),
 ('mordre', 'bite'),
 ('sucer', 'suck'),
 ('cracher', 'spit'),
 ('vomir', 'vomit'),
 ('souffler', 'blow'),
 ('respirer', 'breathe'),
 ('rire', 'laugh'),
 ('voir', 'see'),
 ('entendre', 'hear'),
 ('savoir', 'know (a fact)'),
 ('penser', 'think'),
 ('sentir', 'smell'),
 ('craindre, avoir peur', 'fear'),
 ('dormir', 'sleep'),
 ('vivre', 'live'),
 ('mourir', 'die'),
 ('tuer', 'kill'),
 ('se battre', 'fight'),
 ('chasser', 'hunt'),
 ('frapper', 'hit'),
 ('couper', 'cut'),
 ('fendre', 'split'),
 ('poignarder', 'stab'),
 ('gratter', 'scratch'),
 ('creuser', 'dig'),
 ('nager', 'swim'),
 ('voler', 'fly (verb)'),
 ('marcher', 'walk'),
 ('venir', 'come'),
 ("s'étendre", 'lie'),
 ("s'asseoir", 'sit'),
 ('se lever', 'stand'),
 ('tourner', 'turn'),
 ('tomber', 'fall'),
 ('donner', 'give'),
 ('tenir', 'hold'),
 ('serrer', 'squeeze'),
 ('frotter', 'rub'),
 ('laver', 'wash'),
 ('essuyer', 'wipe'),
 ('tirer', 'pull'),
 ('pousser', 'push'),
 ('jeter', 'throw'),
 ('lier', 'tie'),
 ('coudre', 'sew'),
 ('compter', 'count'),
 ('dire', 'say'),
 ('chanter', 'sing'),
 ('jouer', 'play'),
 ('flotter', 'float'),
 ('couler', 'flow'),
 ('geler', 'freeze'),
 ('gonfler', 'swell'),
 ('soleil', 'sun'),
 ('lune', 'moon'),
 ('étoile', 'star'),
 ('eau', 'water'),
 ('pluie', 'rain'),
 ('rivière', 'river'),
 ('lac', 'lake'),
 ('mer', 'sea'),
 ('sel', 'salt'),
 ('pierre', 'stone'),
 ('sable', 'sand'),
 ('poussière', 'dust'),
 ('terre', 'earth'),
 ('nuage', 'cloud'),
 ('brouillard', 'fog'),
 ('ciel', 'sky'),
 ('vent', 'wind'),
 ('neige', 'snow'),
 ('glace', 'ice'),
 ('fumée', 'smoke'),
 ('feu', 'fire'),
 ('cendres', 'ashes'),
 ('brûler', 'burn'),
 ('route', 'road'),
 ('montagne', 'mountain'),
 ('rouge', 'red'),
 ('vert', 'green'),
 ('jaune', 'yellow'),
 ('blanc', 'white'),
 ('noir', 'black'),
 ('nuit', 'night'),
 ('jour', 'day'),
 ('an, année', 'year'),
 ('chaud', 'warm'),
 ('froid', 'cold'),
 ('plein', 'full'),
 ('nouveau', 'new'),
 ('vieux', 'old'),
 ('bon', 'good'),
 ('mauvais', 'bad'),
 ('pourri', 'rotten'),
 ('sale', 'dirty'),
 ('droit', 'straight'),
 ('rond', 'round'),
 ('tranchant, pointu, aigu', 'sharp'),
 ('émoussé', 'dull'),
 ('lisse', 'smooth'),
 ('mouillé', 'wet'),
 ('sec', 'dry'),
 ('juste, correct', 'correct'),
 ('proche', 'near'),
 ('loin', 'far'),
 ('à droite', 'right'),
 ('à gauche', 'left'),
 ('à', 'at'),
 ('dans', 'in'),
 ('avec', 'with'),
 ('et', 'and'),
 ('si', 'if'),
 ('parce que', 'because'),
 ('nom', 'name')]

In [84]:
translate = dict(fr2en)
translate['chien']


Out[84]:
'dog'

In [85]:
translate['jeter']


Out[85]:
'throw'

add other source languages


In [86]:
de2en = swadesh.entries(['de', 'en'])
es2en = swadesh.entries(['es', 'en'])
translate.update(dict(de2en))
translate.update(dict(es2en))

translate['Hund']


Out[86]:
'dog'

In [87]:
translate['perro']


Out[87]:
'dog'

Compare words in various Germanic and Romance languages:


In [88]:
languages = ['en', 'de', 'nl', 'es', 'fr', 'pt', 'la']
for i in [139, 140, 141, 142]:
    print(swadesh.entries(languages)[i])


('say', 'sagen', 'zeggen', 'decir', 'dire', 'dizer', 'dicere')
('sing', 'singen', 'zingen', 'cantar', 'chanter', 'cantar', 'canere')
('play', 'spielen', 'spelen', 'jugar', 'jouer', 'jogar, brincar', 'ludere')
('float', 'schweben', 'zweven', 'flotar', 'flotter', 'flutuar, boiar', 'fluctuare')

Shoebox and Toolbox Lexicons

Toolbox file consists of a collection of entries, where each entry is made up of one+ fields

Dictionary for the Rotokas language

(first entry: kaa means to gag)


In [89]:
from nltk.corpus import toolbox
toolbox.entries('rotokas.dic')


Out[89]:
[('kaa',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'gag'),
   ('tkp', 'nek i pas'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Apoka ira kaaroi aioa-ia reoreopaoro.'),
   ('xp', 'Kaikai i pas long nek bilong Apoka bikos em i kaikai na toktok.'),
   ('xe', 'Apoka is gagging from food while talking.')]),
 ('kaa',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'strangle'),
   ('tkp', 'pasim nek'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '07/Oct/2006'),
   ('ex', 'Rera rauroro rera kaarevoi.'),
   ('xp', 'Em i holim pas em na nekim em.'),
   ('xe', 'He is holding him and strangling him.'),
   ('ex', 'Iroiro-ia oirato okoearo kaaivoi uvare rirovira kaureoparoveira.'),
   ('xp', 'Ol i pasim nek bilong man long rop bikos em i save bikhet tumas.'),
   ('xe',
    "They strangled the man's neck with rope because he was very stubborn and arrogant."),
   ('ex',
    'Oirato okoearo kaaivoi iroiro-ia. Uva viapau uvuiparoi ra vovouparo uva kopiiroi.'),
   ('xp',
    'Ol i pasim nek bilong man long rop. Olsem na em i no pulim win olsem na em i dai.'),
   ('xe',
    "They strangled the man's neck with a rope. And he couldn't breathe and he died.")]),
 ('kaa',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('cl', 'isi'),
   ('ge', 'cooking banana'),
   ('tkp', 'banana bilong kukim'),
   ('pt', 'itoo'),
   ('sf', 'FLORA'),
   ('dt', '12/Aug/2005'),
   ('ex', 'Taeavi iria kaa isi kovopaueva kaparapasia.'),
   ('xp', 'Taeavi i bin planim gaden banana bilong kukim tasol long paia.'),
   ('xe', 'Taeavi planted banana in order to cook it.')]),
 ('kaakaaro',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'mixture'),
   ('tkp', '???'),
   ('eng', 'mixtures'),
   ('eng', 'charm used to keep married men and women youthful and attractive'),
   ('cmt',
    'Check vowel length. Is it kaakaaro or kaakaro? Does lexeme have suffix, -aro or -ro?'),
   ('dt', '20/Nov/2006'),
   ('ex',
    'Kaakaroto ira purapaiveira aue iava opita, voeao-pa airepa oraouirara, ra va aiopaive.'),
   ('xp',
    'Kokonas ol i save wokim long ol kain samting bilong ol nupela marit, bai ol i ken kaikai.'),
   ('xe', 'Mixtures are made from coconut for newlyweds, who eat them.')]),
 ('kaakaaviko',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'type of beetle'),
   ('tkp', '???'),
   ('nt', 'round beetle like Mexican bean beetle'),
   ('dt', '10/Feb/2005'),
   ('sf', 'FAUNA.INSECT'),
   ('ex', 'Kaakaaviko kare oea binara touaveira vara tapo piupaiveira.'),
   ('xp',
    'Kaakaaviko em i wanpela kain insect em i save istap long ol bin or na long kain lip.'),
   ('xe', '???'),
   ('ex', 'Kaakaaviko kare oea raviriro kouro piupaiveira.'),
   ('xp', 'Em i wanpela kain weevil i save bagarapim ol bin.'),
   ('xe', '??? damages up beans.')]),
 ('kaakaavo',
  [('rt', 'kaavo'),
   ('ps', '???'),
   ('rdp', 'partial'),
   ('ge', 'white'),
   ('tkp', 'wait'),
   ('sc', '???'),
   ('cmt', "What's the part of speech?"),
   ('dt', '29/Oct/2005'),
   ('ex',
    'Kaakaaro oa purapaiveira varauraro tokipasia aue iava opita ora vegoara iava oirara iava ora riakova kaakaaro.'),
   ('xp',
    'Ol i save wokim out long kokonas coconut na ol lip na skin blong ol diwai.'),
   ('xe', '???'),
   ('ex', 'Varoa kaakaavopa popotepa ragai varo.'),
   ('xp', 'Em white lap lap blong mi.'),
   ('xe', "That's my white laplap."),
   ('ex', 'Vaoia evaova kaakaavopaova.'),
   ('xp', 'Dispela diwai em i waitpela.'),
   ('xe', 'This tree is white.'),
   ('ex',
    'Rarasoria kaakaavoto ira Amerika iava urioroera vo kovosia rupairara voaro.'),
   ('xp',
    'Rarason em i wait man em i bin kam long Amerika na kam wok long hap bilong ol bilak man.'),
   ('xe', 'Rarason is a white man who came from America ???.')]),
 ('kaakaoko',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'type of beetle'),
   ('tkp', 'binatang'),
   ('sf', 'FAUNA.INSECT'),
   ('cmt', 'Is it kaakaoko or kaakauko?'),
   ('dt', '08/Feb/2005'),
   ('ex', 'Kaakaoko vuri gesito./Kaakauko vurisi gesiva.'),
   ('xp', '???'),
   ('xe', 'Kaakauko em i wanpela binatang.')]),
 ('kaakasi',
  [('rt', '???'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'hot'),
   ('tkp', 'hot'),
   ('vx', '1'),
   ('sc', '???'),
   ('cmt',
    "Vowel length can't possibly be right. Or is the vowel of kaasi long?"),
   ('dt', '29/Oct/2005'),
   ('ex', 'Upiriko pitoka kaakasipai.'),
   ('xp', 'Sospen kaukau em i hot tru.'),
   ('xe', 'The saucepan of sweet potatos is really hot.'),
   ('ex',
    'Kaukau pitoka rirovira rutu kaakasipai uvare riro kasia tuitui kasi oripiro.'),
   ('xp', 'Sospen kaukau em i hot tru bikos em i tan long bikpela paia.'),
   ('xe', '???')]),
 ('kaakau',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'dog'),
   ('tkp', 'dok'),
   ('dt', '17/Jul/2005'),
   ('ex', 'Kaakau voresiurava toupa aue kokoto ora kokopi.'),
   ('xp', 'Dog i gat fopela lek bilong em na em i teleblonge.'),
   ('xe', 'Dogs are four-footed ???.'),
   ('ex', 'Revisa riro kaakau raguito.'),
   ('xp', 'Revisa em i man bilong lukautim dok.'),
   ('xe', 'Revisa is a big dog lover.'),
   ('ex', 'Rake ora Jon kaakau kare ousia avasie.'),
   ('xp', 'Rake wantaim Jon ol i go kisim ol wail dok.'),
   ('xe', 'Rake and John went to get wild dogs.')]),
 ('kaakauko',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'gray weevil'),
   ('tkp', 'wanpela kain binatang'),
   ('sf', 'FAUNA.INSECT'),
   ('nt', 'pictured on PNG postage stamp'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Kaakauko ira toupareveira aue-ia niugini stemp.'),
   ('xp', 'Kaakauko em insect em i istap long niugini.'),
   ('xe', 'The gray weevil is found on the PNG stamp.'),
   ('ex', 'Kaakauko iria toupaeveira niugini stamia.'),
   ('xp', 'Weevil i stap long niguini stamp.'),
   ('xe', 'The gray weevil is on the New Guinea stamp.'),
   ('ex',
    'Kaakauko korekare iava oira iria iava varaua vurivurivira ora kaapovira toupaiveira.'),
   ('xp',
    'Kaakavuko em i wanpela kain binatang skin bilong em i braun na wait.'),
   ('xe', 'Kaakavuko is an insect whose body is brown and white.')]),
 ('kaakito',
  [('rt', 'kaaki'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'person blind with cataracts'),
   ('tkp', 'man i gat wanpela ei'),
   ('nt', 'nickname when used to describe one-eyed person'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Rarasirea kakito eisiva rera Tavusiva uruiia.'),
   ('xp', 'Rarasirea em i wan ai man bilong ples Tavusiova.'),
   ('xe', 'Rarasirea is a one-eyed man from Tavusiova village.'),
   ('ex', 'Kaakito kataitoa iava osireito vurapare.'),
   ('xp', 'Man i gat wanpela ei na i lukluk.'),
   ('xe', 'A one-eyed man looks out of one eye.')]),
 ('kaakuupato',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'spring of hot mineral water near Togarao.'),
   ('tkp', '???'),
   ('nt',
    'It is located in gulley above the shorter waterfall and is most likely ???.'),
   ('dt', '08/Feb/2005'),
   ('ex',
    'Kasiraopato kaakuupato uicoto ira vusivusipareveira vova rasito vo toupare togarao-ia sisiupaveira vosa upiapave ora ruvapasa.'),
   ('xp',
    'Kaakuupato em i spirins hot water em i stap long Togavao taim husat i sik bai ol waswas long bai sick br pinis .'),
   ('xe', '???'),
   ('ex',
    'Kaakuupato kasiraopato ukoto ira toupare eisi Rureva Togaraoia ruvaraia.'),
   ('xp', 'Hot wara kaakuupato i stap long Rureva klostu long Togarao.'),
   ('xe', 'The hot spring Kaakuupato is in Rureva near Togarao.')]),
 ('kaaova',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'aunt'),
   ('tkp', '???'),
   ('nt', 'FaSi'),
   ('sf', 'KIN'),
   ('dt', '19/Jul/2004')]),
 ('kaapa',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'copper metal'),
   ('tkp', 'retpela ain'),
   ('dt', '12/Feb/2005'),
   ('cmt', 'What is paupara doing in the second example?'),
   ('ex', 'Kaapa vao oa-ia kepa paupaviei.'),
   ('xp', 'Kaapa em i roof yumi save wokim haus long em.'),
   ('xe', 'Copper we make houses from.'),
   ('ex', 'Kaapara kepa paupara oara purapaiveira eisi Astararia.'),
   ('xp', 'Kapa bilong wokim haus ol i save wokim long Australia.'),
   ('xe', 'Copper rooves, they make them in Australia.')]),
 ('kaapea',
  [('ps', '???'),
   ('ge', 'weak'),
   ('ge', 'loose'),
   ('ge', 'easy'),
   ('tkp', '???'),
   ('cmt', 'Check spelling. Is it kaapea or kapea?'),
   ('dt', '03/Jun/2005'),
   ('ex', 'Kaapeta virago vao paupa.'),
   ('xp', 'Dispela chair i no strong bilong sindaun.'),
   ('xe', '???')]),
 ('kaapie',
  [('rt', 'kaa'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'hook'),
   ('ge', 'fishhook'),
   ('tkp', 'huk'),
   ('dt', '15/Feb/2004')]),
 ('kaapie',
  [('rt', 'kaa'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'hook'),
   ('eng', 'choke'),
   ('eng', 'snag'),
   ('eng', 'hook'),
   ('ge', 'capture'),
   ('tkp', 'hukim'),
   ('tkp', 'pasim long huk'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '15/Nov/2005'),
   ('cmt',
    "Double-check vowel length of kaa. First example doesn't make sense. Is it two sentences?"),
   ('ex', 'Aiopaoro karoi kakaeto kaapierivoi aioa-ia.'),
   ('xp', 'Kaikai pas long nek bilong mi kaikai i pas long pikinini.'),
   ('xe', '???'),
   ('ex',
    'Koie kaapierevo Ririre ovare oira gisipoaro iare karuveraisi vikirevo.'),
   ('xp', 'Ririre i tromoem singapo insait long maus bilong pik na i pas.'),
   ('xe', 'Ririre ???.'),
   ('ex', 'Aakova kakaeto kapieevoi aioa-ia.'),
   ('xp',
    'Mama i givim kaikai long pikinini na hap kaikai i pas long nek bilong em.'),
   ('xe', 'Mother made the boy choke with some food.'),
   ('ex',
    'Aakova kakaeto kaapievoi aioa-ia uvare viapau vearovira va orievo.'),
   ('xp', 'Mama em i mekim pas kaikai long pikinini bikos em i no kukim gut.'),
   ('xe',
    "Mother made the boy choke from the food because she didn't cook it well."),
   ('ex', 'Avuka kakaeto aiopiepaoro rera kaapieevo.'),
   ('xp', 'Lapun wok long givim kaikai long bebe na kaikai i pas long nek.'),
   ('xe', 'The old person fed the boy and made it choke.')]),
 ('kaapiepato',
  [('rt', 'kaapie'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'fisher'),
   ('tkp', 'man bilong hukim pis'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Aveatoa atari kapiepato vokiara rutu.'),
   ('xp', 'Aveato em i man bilong hukim pis olgeta de.'),
   ('xe', 'Aveato works as a fisherman every day.')]),
 ('kaapisi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'pinch together'),
   ('ge', 'grip with pincers'),
   ('tkp', 'holim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex',
    'Kaapisi ava eva ra avekeara kasiraopa ra kaekaepiea. Ra varao vera oara kasiraopai.'),
   ('xp',
    'Yu mas kam wantam sisis pinvh bar mi ya rausim ol dispela pela stow ol bai mi rausim ol dispela i hot.'),
   ('xe', '???'),
   ('ex', 'Avekeara kaapisi evara kasiraopara.'),
   ('xp', 'Yu rausim ol ton i hot long pansa.'),
   ('xe', '???')]),
 ('kaapisivira',
  [('rt', 'kaapisi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'linked'),
   ('ge', 'pinched'),
   ('tkp', '???'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Auea eva oa kaapisivira toupaivoi.'),
   ('xp', 'Samting i stap olsem pansa.'),
   ('xe', '???'),
   ('ex', 'Pariearei tapokovira toupai uva kaapisivira kekepapiroi.'),
   ('xp', 'Hap mambu i pas wantaim na i luk olsem sises.'),
   ('xe', '???')]),
 ('kaapo',
  [('ps', '???'),
   ('ge', 'white'),
   ('tkp', 'wait'),
   ('cmt', 'Not a sentence, I think.'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Kaapoa varoa ragai vaaro.'),
   ('xp', 'Waitpela laplap bilong mi.'),
   ('xe', 'My white clothes.'),
   ('ex',
    'Naomi aakova tavievo oisio kaakau ragai vaaro ivaraia vokaevo varoa kaapopa uva vuri keke rutu.'),
   ('xp',
    'Naomi tokim mama bilong em olsem, dok i wokabaut antap long waitpela laplap bilong mi na i luk nogut tru.'),
   ('xe', '???')]),
 ('kaapopato',
  [('rt', 'kaapo'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'white man'),
   ('tkp', 'waitskin'),
   ('cmt', 'One example is kaapoto, not kaapopato.'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Rarasoria kaapoto oirato.'),
   ('xp', 'Rarason em i waitman.'),
   ('xe', 'Rarason is a white man.'),
   ('ex', 'Rarasoria kaapopato ira rupairara vo reoaro poreparevo.'),
   ('xp',
    'Rarason em i waitpela man em i wok long tanim tok ples bilong ol bilak man.'),
   ('xe',
    'Robinson is a white man who is translating the language of black men.')]),
 ('kaara',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'car'),
   ('tkp', 'ka'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Kaara avaoe eisi-re Araua.'),
   ('xp', 'Ka i bin go long Arawa.'),
   ('xe', 'The car went to Arawa.'),
   ('ex', 'Moto kaara eira ragai oiraaro garevari.'),
   ('xp', 'Moto ka em bilong mi em liklik.'),
   ('xe', 'My car is small.')]),
 ('kaare',
  [('ps', '???'),
   ('ge', 'white'),
   ('tkp', 'wait'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Eira kaare riakova koie.'),
   ('xp', 'Em waitpela pik meri.'),
   ('xe', "That's a white female pig."),
   ('ex', 'Kaare reoreoa raga oaravu reoara vearo kekepiepasia.'),
   ('xp', '???'),
   ('xe', '???')]),
 ('kaareko',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'scour'),
   ('tkp', 'skrapim'),
   ('eng', 'scour'),
   ('eng', 'clean by scraping'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dcsv', 'true'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Avukava iria opokuio kaarekoevo va oriorioro.'),
   ('xp', 'Lapun meri i skirapim taro wait tru.'),
   ('xe', 'The old woman cleaned the taro by scouring it.')]),
 ('kaarekopie',
  [('rt', 'kaareko'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'scrub'),
   ('eng', 'scrape'),
   ('eng', 'scour'),
   ('eng', 'scrub'),
   ('eng', 'clean by scraping or scrubbing'),
   ('tkp', 'skirapim gut'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '15/Nov/2005'),
   ('ex', 'Vao opoa kaarekopierivoi rutu va oriorioro.'),
   ('xp', 'Yo sikirapim gut tru dispela taro na em wait dseta.'),
   ('xe', 'You scraped clean the taro by scouring it.'),
   ('ex',
    'Sera kaarekopievira opoara oriorievoi uva viapau katokatoara ivaraia.'),
   ('xp',
    'Sera em i sikirapim gut tru ol taro na ol taro i no gat bilakpela hap long em.'),
   ('xe',
    'Sera cooked the taro, cleaning it by scraping it, and there are no spots on it.'),
   ('ex', 'Sera pitokava kaarokopievo eisi uukovi.'),
   ('xp', 'Sera em i klinim gut sospen long wara.'),
   ('xe', 'Sera scrubbed the sauce pan in the water.')]),
 ('kaareto',
  [('rt', 'kaare'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'white male object'),
   ('tkp', '???'),
   ('cmt',
    "Is the gloss right? What exactly does this mean? Example isn't sentences."),
   ('nt', 'Used to describe a white male pig.'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Ragai reraaro kaareto koieto.'),
   ('xp', 'Waitpela pik man bilong mi.'),
   ('xe', 'My white pig.'),
   ('ex',
    'Oirato kaareto varuere kare iava ora korekare oisio osia koieto o kookopuoto.'),
   ('xp', 'Waitpela man animal ol insek olsem pik man o bataplai man.'),
   ('xe', '???')]),
 ('Kaareva',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'Mapiaro'),
   ('tkp', 'ples nem'),
   ('nt', 'altenate name for this village'),
   ('sf', 'TOP'),
   ('dt', '08/Feb/2005'),
   ('ex', 'Uruia vaisiaro vao Kaareva.'),
   ('xp', 'Em name blong wanpela ples Kaareva.'),
   ('xe', 'Kaareva is the name of a village.')]),
 ('kaava',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'feint with bow and arrow'),
   ('tkp', '???'),
   ('vx', '1'),
   ('dt', '03/Jun/2005'),
   ('cmt', 'Is there a better translation. Double check.'),
   ('ex', 'Kaavapauei viapau uvuipaupei iasia.'),
   ('xp', 'Skin blong yu i no strong tumas long sut.'),
   ('xe', "You feint but you can't shoot."),
   ('ex', 'Kaavapauei viapau uvuipau iasia eira-ia koetava.'),
   ('xp', 'Yu no i nap sut long dispela bonara.'),
   ('xe', "You feint but you can't shoot with a bow.")]),
 ('kaavaaua',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'plant'),
   ('tkp', '???'),
   ('sf', 'FLORA'),
   ('nt', 'salt flavor is derived by burning it to ash, it grows ???'),
   ('cmt', 'Is it kaavaaua or kaavaua?'),
   ('dt', '27/Feb/2005'),
   ('ex',
    'Kaavaua votoupai uuko vagaparo-ia ora vo uuko gaero sivova. Va kasiupa auero sait.'),
   ('xp',
    'Kaavaua em i save i stap long ol watertool na long arere bilong ol wara. Ol i save kukim bilong sait.'),
   ('xe', '???'),
   ('ex', 'Kaavaua okouroa purapapiroveira vo vagapapara kovuaro-ia.'),
   ('xp', 'Kaavaua i save i stap namel long waterfall.'),
   ('xe', 'The kaavaaaua plant shows up in the middle of waterfalls.')]),
 ('kaaveaka',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'clear up'),
   ('ge', 'make available'),
   ('tkp', '???'),
   ('arg', '???'),
   ('vx', '2'),
   ('dcsv', 'true'),
   ('dt', '08/Jun/2005'),
   ('cmt', 'The syntax of this is interesting and should be investigated.'),
   ('ex', 'Kaaveaka vao-ia uvuipai va purasia.'),
   ('xp', 'Em isi long mi long mekim.'),
   ('xe', "That's easy for me to do.")]),
 ('kaaveakapie',
  [('rt', 'kaaveaka'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', '???'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Vearovira uvare va kaveakapieri ragai-pa.'),
   ('xp', 'Tenkyu bikos yu mekim isi long mi.'),
   ('xe', 'Thank you because you make it easy for me.')]),
 ('kaaveakapievira',
  [('rt', 'kaaveakapie'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', '???'),
   ('tkp', 'i no putim gut samting'),
   ('dt', '15/Nov/2005'),
   ('ex', 'Kaveakapievira aveke tovoivo uva koveoe.'),
   ('xp', 'Ol i no putim gut ston bihain em i pundaun.'),
   ('xe', 'They placed the stone insecurely and it fell down.')]),
 ('kaaveakavira',
  [('rt', 'kaaveaka'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'easy'),
   ('tkp', 'isi'),
   ('eng', 'revealed'),
   ('eng', 'out in the open'),
   ('eng', 'freely'),
   ('eng', 'available'),
   ('eng', 'easy'),
   ('eng', 'clearly'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Kaaveakavira toupaivo ra va purape. Viapau riro gorupai.'),
   ('xp', 'Em i isi long mekim i no hat tumas long yumi long mekmekim.'),
   ('xe',
    "It is easy and it can't be done. It's not hard (literally: strong)."),
   ('ex', 'Kaveakavira rutu toupai veigei-pa ra va purape.'),
   ('xp', 'Em i isi tru long yumi wokim.'),
   ('xe', 'It is easy for us to do it.')]),
 ('kae',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'carry'),
   ('tkp', 'karim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dcsv', 'true'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Vao kae paupava tovosia vo.'),
   ('xp', 'Yu karim dispela chair na kam putim long hia.'),
   ('xe', 'Carrying this chia and put it here.'),
   ('ex', 'Viruata atari keesi kaerevoi eisi-re atoia.'),
   ('xp', 'Viruata i karim katen pis i go long ples.'),
   ('xe', 'Viruata is carrying a case of fish to the village.')]),
 ('kae',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'blow'),
   ('tkp', '???'),
   ('vx', '1'),
   ('cmt', 'double-check gloss'),
   ('dt', '06/Dec/2006')]),
 ('kaekae',
  [('rt', 'kae'),
   ('ps', '???'),
   ('ge', 'long'),
   ('tkp', 'long'),
   ('am', 'true'),
   ('dcsv', '???'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Tovasi vokepaaro-ia riro kaekaea toupaivoi paupaa.'),
   ('xp', 'Long haus bilong Tomas i go longpela bet bilong sindaun.'),
   ('xe', "In Thomas' house there is a long bed."),
   ('ex',
    'Buka-ia riro kaekae tapi rutu viapau uvuipai ra katai voki raga-ia voa-re avau.'),
   ('xp', 'Buka i long we tru, yu no inap i go long hap wanpela dei tasol.'),
   ('xe', "Buka is a faraway place, you can't go there in just one day.")]),
 ('kaekae',
  [('rt', 'kae'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'tempt'),
   ('tkp', 'traim'),
   ('arg', '???'),
   ('vx', '???'),
   ('dt', '08/Dec/2005'),
   ('ex',
    'Ragai kaekaeparevo reoreopaoro oisio ra kasipura ra ragai uporeve.'),
   ('xp',
    'Em i wok long traim mi long toktok olsem na bai mi koros na bai em i paitim mi.'),
   ('xe', '???')]),
 ('kaekaearo',
  [('rt', 'kae'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'length of'),
   ('tkp', 'long bilong en'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Va kaekaearo vao-ia oiso uvuipai.'),
   ('xp', 'Longpela em em i nap olsem.'),
   ('xe', 'Its length is enough.'),
   ('ex', 'E vo kepa kaekaearo rutua vao-ia.'),
   ('xp', 'Em longpela bilong dispela haus.'),
   ('xe', "That's the length of the house."),
   ('ex', 'E vo kepa kaekaearo ourivo.'),
   ('xp', 'Yu kisim longpela bilong haus.'),
   ('xe', 'Get the length of the house.'),
   ('ex',
    'Buka raiva kaekaearo riro kaekaea rutu Arawa raiva-pa oa tutupievira toupaivoi.'),
   ('xp',
    'Buka rot em i longpela rot moa yet long rot bilong Arawa em i stap klostu tasol.'),
   ('xe', '???')]),
 ('kaekaeo',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'type of fern'),
   ('tkp', '???'),
   ('nt', 'grows on the ground and has leaves used in medicine and in coded'),
   ('dt', '11/Feb/2005'),
   ('sf', 'FLORA'),
   ('ex',
    'Kaekaeo okouro riropai vo vegoaro vara gururaroia ruvaropaveira aue upiaara ora okovorovu toupaivoi.'),
   ('xp',
    'Kaekaeo em i save grom lans buch ol, save usim lit blons em loag nakim manas bloms sick.'),
   ('xe', '???'),
   ('ex', 'Kaekaeo kouro oara kovapapeira vo uva kovopaiveira.'),
   ('xp', 'Kaekaeo i save kamap long ol hap ol i save wokim gaden.'),
   ('xe', 'This type of fern usually appears wherever they garden.')]),
 ('kaekaesoto',
  [('rt', 'kaekae'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'tall person'),
   ('tkp', '???'),
   ('dt', '11/Sep/2004'),
   ('ex', 'Mak iro kaekaesoto.'),
   ('xp', 'Mak em i longpela man.'),
   ('xe', 'Mark is tall.'),
   ('ex', 'Itasito kaekaesoto eisiva rera Ivu.'),
   ('xp', 'Itasito em i longpela man bilong Ibu.'),
   ('xe', 'Itasito is a tall man from Ibu.')]),
 ('kaekaevira',
  [('alt', 'kaekaepavira'),
   ('rt', 'kaekae'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'long'),
   ('eng', 'long awy'),
   ('eng', 'far away'),
   ('tkp', 'long we'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Rarasori riro kaekaevira rutu-va urioroera vigei vo tokoaro iare.'),
   ('xp', 'Rarason i bin kam long we tru na kam long ailan bilong yumi.'),
   ('xe', 'Robinson came from far away to our island.')]),
 ('kaekeru',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'clavicle'),
   ('tkp', 'bun bilong sol'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Kaekeru keru upiaei ragai-re voea vao kaepaavoi.'),
   ('xp', 'Klavikol bun em i pain taim mi kalim dispela.'),
   ('xe', 'The collar bone hurts me ???.'),
   ('ex', 'Kaekeru oaia ogata tuupaveira riakora ora oirara evao kaepaveira.'),
   ('xp',
    'Klavikol bun ol meri i save karim woksak na tu ol man i save karim diwai.'),
   ('xe', 'On their shoulders women carry bags and men carry trees.')]),
 ('kaepaa',
  [('rt', 'kae'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'wheelbarrow'),
   ('ge', 'basket'),
   ('tkp', '???'),
   ('nt', 'anything used for carrying things'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kiaie kouro kaesia kaepa va-ia.'),
   ('xp', 'Yu go karim ol pipia long wilbaro.'),
   ('xe', 'Carry the trash in the wheelbarrow.')]),
 ('kaepie',
  [('rt', 'kae'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('sn', '1'),
   ('ge', 'hoist'),
   ('ge', 'lift'),
   ('ge', 'raise'),
   ('tkp', 'apim'),
   ('tkp', 'antapim'),
   ('tkp', 'putim antap'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '15/Nov/2005'),
   ('ex', 'Pita, sigoa kaepieoro uriou eva evoa rasiuaro.'),
   ('xp', 'Pita, yu kisim naip i kam em i stap long graun.'),
   ('xe', "Pita, come pick up the knife that's on the ground."),
   ('ex', 'Aiteto rirova aveke kaepierevo aruvea.'),
   ('xp', 'Papa i hapim traipela ston asde.'),
   ('xe', 'Dad lifted a large stone yesterday.')]),
 ('kaepie',
  [('rt', 'kae'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('sn', '2'),
   ('ge', '???'),
   ('tkp', 'kisim bik nem'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '11/Feb/2007'),
   ('ex',
    'Rarasori vaisiaro kaepieivoi uvare vearo kovo rutu purare voea Rotokasi taere-ia'),
   ('xp',
    'Ol i putim nem bilong Rarason i go hantap bikos em i wokim gutpela wok long Rotokas.'),
   ('xe',
    "They've put Robinson's name up there because he is doing good work among the Rotokas."),
   ('ex', 'Rarasori riro vaisi ourevoi rera varo-ia vearo kovo.'),
   ('xp', 'Rarason em i kisim bik nem long gutpela work bilong em.'),
   ('xe', 'Robinson is getting a big name for his work.')]),
 ('kaepievira',
  [('rt', 'kaepie'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'high'),
   ('tkp', 'antap hai'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Vaoepa kaepievira toupai viraia rutu.'),
   ('xp', 'Dispela em i stap antap tru.'),
   ('xe', 'This is really high up.'),
   ('ex', 'Tutue pukui kaepievira toupaivoi.'),
   ('xp', 'Maunten Balbi em i stap antap tru.'),
   ('xe', 'Mount Balbi is up high.')]),
 ('kaereasi',
  [('ps', '???'),
   ('ge', 'type of weave that is slanted (X-like)'),
   ('tkp', 'wankain samap'),
   ('cmt', 'Second ex. sentence needs to be edited'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Touro ira kaereasi take raga turupareveira.'),
   ('xp', 'Touru i save wokim tasol kaereasi wol.'),
   ('xe', 'Touro only knows how to make slant-weave walls.'),
   ('ex',
    'Viapau uvuipai ra oiratoavai o kaereve takei iria vaisipaiveira oisio kaereasi uvare kovekove paoveira osia oira kaepaive.'),
   ('xp',
    'Nogat wanpela man karim dispela wuol ol i kolim olsem kaereasi bikos em i save pundaon nating taim ol i karim.'),
   ('xe', 'No man can ???.')]),
 ('kaereasivira',
  [('rt', 'kaereasi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'slanted'),
   ('ge', 'crooked'),
   ('tkp', 'i no stret i go'),
   ('dt', '11/Feb/2005'),
   ('cmt', 'What does regorepa mean?'),
   ('ex', 'Kaereasivira vao toupai regorepanisivi.'),
   ('xp', 'Dispela em i stap krut liklik.'),
   ('xe', 'This is slightly crooked.'),
   ('ex', 'Vao evaoa kaereasivira toupai regorepa.'),
   ('xp', 'Dispela diwai i no gutpela long karim bikos em i krunkut.'),
   ('xe', 'This tree is crooked and ???.')]),
 ('kaetu',
  [('ps', '???'),
   ('ge', 'tight'),
   ('ge', 'stiff'),
   ('ge', 'taut'),
   ('tkp', 'tait'),
   ('dcsv', 'true'),
   ('am', 'false'),
   ('cmt', 'Is it kaitu or kaetu?'),
   ('dt', '07/Jun/2005'),
   ('ex', 'Iroiro vao kaitu vara toupaivoi.'),
   ('xp', 'Dispela rop i tait tru.'),
   ('xe', 'This rope is tight.')]),
 ('kaetupie',
  [('rt', 'kaetu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'tighten'),
   ('tkp', 'taitim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cmt', 'It is kaetupie or kaitupie?'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Iroiro kaetupieta, ra vara ivupe.'),
   ('xp', 'Yu taitim dispela rop na XXX pulim.'),
   ('xe', "Tighten the rope and we'll pull them."),
   ('ex', 'Iroiro vao kaetupieri rutu ra kaetupe rutu.'),
   ('xp', 'Yu taitim tru dispela rop. Bai em i tait tru.'),
   ('xe', 'Tighten this rope good and it will be tight.'),
   ('ex', 'Iroiro vao kaitupieri ra kaitupe rutu.'),
   ('xp', 'Yu taitim strong rop bai tai tru.'),
   ('xe', 'Tighten the rope and it will be tight.')]),
 ('kaetuvira',
  [('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'tight'),
   ('ge', 'stiff'),
   ('ge', 'taut'),
   ('tkp', 'tait'),
   ('cmt', 'Is it kaitu or kaetu?'),
   ('dt', '03/Jun/2005'),
   ('ex', 'Iroiro vao kaetuvira toupai.'),
   ('xp', 'Dispela rop em tait tumas.'),
   ('xe', 'This rope is too tight.')]),
 ('kaeviro',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'lift off'),
   ('ge', 'take off'),
   ('tkp', 'go antap'),
   ('sc', 'MOTION'),
   ('vx', '1'),
   ('nt', 'used to describe action of plane'),
   ('dt', '03/Jun/2005'),
   ('ex', 'Pita kaeviroroe kepa kekesia oa vuripierevo kiuvu.'),
   ('xp', 'Pita i go antap na lukim haus win i bagarapim.'),
   ('xe', 'Peter went to look at the house that the wind destroyed.')]),
 ('kagave',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'forehead'),
   ('tkp', 'poret pes'),
   ('sf', 'BP'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Kukue-ia toupaevoi kakave.'),
   ('xp', 'Peis em i stap long het.'),
   ('xe', 'The forehead is on the head.'),
   ('ex',
    'Riakova resipaivo oiratoa iare vuri kagaveto uva rera-pa reasipaoro torievo ouruivu iare.'),
   ('xp',
    'Ol i mekim meri long wanpela pes nogut man na meri i les long em na em i ranawe i go long narapela ples.'),
   ('xe',
    'They made the woman go to the man with a bad forehead and, not wanting him, she ran away to another village.')]),
 ('kaie',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'make trash'),
   ('ge', 'create a mess'),
   ('tkp', 'pipia'),
   ('vx', '1'),
   ('cmt', 'What does panat mean?'),
   ('sc', '???'),
   ('dt', '03/Jun/2005'),
   ('ex', 'Kaiepauei.'),
   ('xp', 'Yu mekim panat pipia.'),
   ('xe', 'You are creating a mess with the rubbish.'),
   ('ex', 'Riroara rutu kaiepaue voia.'),
   ('xp', 'Yu mekim panat pipia tumas.'),
   ('xe', "You're making a lot of garbage.")]),
 ('kaiea',
  [('rt', 'kaie'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'rubbish'),
   ('ge', 'trash'),
   ('ge', 'garbage'),
   ('tkp', '???'),
   ('dt', '12/Feb/2005'),
   ('cmt', 'Is the second sentence a command or a sentence fragment?'),
   ('ex', 'Kaiea vao voia kepa-ia.'),
   ('xp', 'Pipia em istap long house.'),
   ('xe', 'The trash is in the house.'),
   ('ex', 'Kaieara vikisia eisi rikui.'),
   ('xp', 'Go tromoe ol pipia long hol.'),
   ('xe', 'throw trash in the hole.')]),
 ('kaikaio',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'type of vine'),
   ('tkp', '???'),
   ('nt', "Sometimes mistaken for the black vine called `varuru iro'."),
   ('dt', '12/Feb/2005'),
   ('ex',
    'Kaikaio vo kovapapeira vo uva kovoive rovovovira va-ia ruvarupavei vosia ira upiaparo.'),
   ('xp',
    'Kaikaio em i save grow long ples o hap ol isave wokim grondew long em.'),
   ('xe', 'The kaikaio vine grows here and ???.'),
   ('ex', 'Kaikaio iria vo karekepaoveira uva kovopaiveira.'),
   ('xp', 'Kaikaio i save kamap long hap ol i save wokim gaden.'),
   ('xe', 'This type of vine appears where they garden.')]),
 ('Kaio',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('eng', 'name'),
   ('ig', 'yes'),
   ('dt', '17/May/2004')]),
 ('kaipori',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'alert'),
   ('tkp', 'alert'),
   ('eng', 'perky'),
   ('eng', 'alert'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Kaiporiparoi.'),
   ('xp', '???'),
   ('xe', 'He is very alert.'),
   ('ex', 'Kaekaeto kaiporivira vuravurarevo osia aakova rera kavauevo.'),
   ('xp', 'Pikinini em i lukluk orait taim mama em i karim em.'),
   ('xe',
    'The boy looked around alertly when his mother gave birth to him.')]),
 ('kaiporipie',
  [('rt', 'kaipori'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'arouse'),
   ('eng', 'arouse'),
   ('eng', 'perk up'),
   ('tkp', 'kirapim'),
   ('tkp', 'mekim orait'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '22/Nov/2005'),
   ('ex',
    'Perapera kaiporipierevo Aata uvare rera-va avaroe ruvarupa kepa iare.'),
   ('xp',
    'Aata em i mekim orait Perapera taim em i kisim em i go long haus sik.'),
   ('xe',
    'Aata made Perapera perk up because he went to the medical station with him.')]),
 ('kaiporivira',
  [('rt', 'kaipori'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'alertly'),
   ('tkp', '???'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Kaiporivira vuravurarevoi.'),
   ('xp', '???'),
   ('xe', 'He is looking around in an alert way.'),
   ('ex', 'Luke kaiporivira toreroi uusi tapi-va.'),
   ('xp', 'Luke em i kirap gut long slip.'),
   ('xe', 'Luke got up alert from bed.'),
   ('ex', 'Timoti kaiporivira toupare viapau ruraparevoi.'),
   ('xp', 'Timoti i no ai slip em i stap gut.'),
   ('xe', "Timothy is alert, he's not ???."),
   ('ex', 'Pita kaiporivira vokaparevo uvare upia opesie.'),
   ('xp', 'Pita em i wokabaut olsem em i orait bikos sik bilong i pinis.'),
   ('xe', 'Peter walked about alert because his sickness finished.')]),
 ('kairi',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'spear'),
   ('ge', 'arrow'),
   ('tkp', '???'),
   ('dt', '28/Jan/2005'),
   ('ex', 'Kairi isi-va uriou Pita koie togasia.'),
   ('xp', 'Pita kam sutim pik long spia.'),
   ('xe', 'Pita came to spear the pig with an arrow.')]),
 ('kairiro',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'bushes'),
   ('ge', 'small trees'),
   ('tkp', 'ol liklik diwai'),
   ('nt', 'generic?'),
   ('cmt',
    'Is the <ro> part of the stem or is it bound morphology? Can you get <kairi>, <kairirei>, and <kairiro>?'),
   ('dt', '29/Nov/2006'),
   ('ex', 'Evao kairiro-va uriou.'),
   ('xp', 'Yu kam wantaim ol liklik diwai.'),
   ('xe', 'You come with the little trees.')]),
 ('kairo',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'neck'),
   ('tkp', 'nek'),
   ('sf', 'BP'),
   ('cmt', 'Is it toupa vokiia?'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kairo okairo vo uva aioara varapapeira kopapaperia.'),
   ('xp', 'Kaikai i save ga down long neck bilong yumi.'),
   ('xe', '??? and food ???.'),
   ('ex', 'Koike ira kairo-ia asiroi toupa voki-ia.'),
   ('xp', 'Koike em i putim nektae long sande.'),
   ('xe', 'Koike ???.')]),
 ('kaita',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'white beads'),
   ('tkp', 'waitpela bis'),
   ('cmt', 'Is it kaita or kaaita?'),
   ('dt', '28/Jan/2005'),
   ('ex', 'Eresivie kaaitauaia aasipaoi.'),
   ('xp', 'Eresivie i putim waitpela bis.'),
   ('xe', 'Eresivie is puttong on white beads.')]),
 ('kaitutu',
  [('alt', 'kaitu'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'resolute'),
   ('ge', 'steadfast'),
   ('ge', 'tight'),
   ('tkp', 'tait'),
   ('cmt', 'Are kaitutu and kaitu really just alternates? Any difference?'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Kaitutuparoepa uriri asavira.'),
   ('xp', '???'),
   ('xe', 'He was fearlessly steadfast.'),
   ('ex', 'Pita, kaitupai eva iroiro oa iava tokopiroi varo tavava.'),
   ('xp', 'Pita, rop ia em i tait olsem na i bruk wantaim ol laplap.'),
   ('xe',
    'Peter, that rope is tight and therefore it broke with the clothes.')]),
 ('kaitutupie',
  [('alt', 'kaitupie'),
   ('rt', 'kaitutu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'smooth out'),
   ('ge', 'tighten'),
   ('tkp', '???'),
   ('cmt', 'Again, check on the kaitutu and kaitu difference...'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cmt', 'Figure out semantics of this...'),
   ('nt',
    'Refers to smoothing out cloth during process of pressing it with hot ???'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Iroiro kaitutupieri.'),
   ('xp', 'Yu taitim strong rop.'),
   ('xe', 'You tighten the rope.'),
   ('ex',
    'Vutevi viapau iroiro kaitupievoi oa iava varo tava rasitoa-ia rirue uva vuriei.'),
   ('xp',
    'Vutevi em i no taitim rop olsem na ol laplap long lain i tasim graon na i bagarap.'),
   ('xe',
    "Vutevi didn't tighten the rope and therefore the clothes on the ground fell down and got messed up.")]),
 ('kaitutuvira',
  [('rt', 'kaitutu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'resolutely'),
   ('ge', 'steadfast'),
   ('ge', 'tightly'),
   ('tkp', '???'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Vivisi ira oisioa kaitutuvira toreparo.'),
   ('xp', 'Vivisi em i save sanap strong.'),
   ('xe', 'Vivisi always stand resolute.')]),
 ('kakae',
  [('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'child'),
   ('tkp', 'pikinini'),
   ('nt', 'plural is irregular, <kakaevure>, and NOT <kakaeirara>'),
   ('cmt', 'Check vowel length. Is it avae or avai?'),
   ('dt', '04/Sep/2005'),
   ('ex', 'Oira kakae kavauevo Sipirie.'),
   ('xp', 'Sipirie i bonim pikinini man.'),
   ('xe', 'Sipirie gave birth to a boy.'),
   ('ex', 'Kakaevure avae skurusia.'),
   ('xp', 'Ol pikinini i bin go long skul.'),
   ('xe', 'The children are going to school.')]),
 ('kakae',
  [('ps', '???'),
   ('ge', 'small'),
   ('ge', 'little'),
   ('tkp', 'liklik'),
   ('cmt', 'Need ex of this used as verb'),
   ('dt', '29/Oct/2005'),
   ('ex',
    'Pita, vii kaekae rugovira toupari, viapau uvuipauei ra oiraopavira rugopau.'),
   ('xp',
    'Pita, tingting bilong yu olsem pikinini, yu no inap i gat trupela tingting.'),
   ('xe', "Peter, you are short-sighted, you can't think correctly.")]),
 ('kakae',
  [('ps', 'CLASS'),
   ('ge', 'child'),
   ('tkp', 'pikinini'),
   ('dt', '04/Sep/2005'),
   ('ex', 'Sigo kakae kovorevo Patriki uva rera upoevo aakova.'),
   ('xp', 'Patrik em i pundaunim liklik naip na mama i paitim em.'),
   ('xe', 'Patrick dropped the little knife and his mother smacked him.')]),
 ('kakaevira',
  [('rt', 'kakae'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'small-like'),
   ('tkp', 'liklik'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kakaevira touparivoi viapau uvuipau va riakovavai ouri.'),
   ('xp', 'Yu liklik meri tumas yu no i nap kisim wanpela man.'),
   ('xe', "You're too small; you can't get a woman."),
   ('ex', 'Kakaevira toupaoro ro opitato pauriva.'),
   ('xp', 'Yu bin i stap liklik na yu bin planim dispela kokonas.'),
   ('xe', 'When you were little, you planted this coconut tree.')]),
 ('kakapikoa',
  [('rt', 'kakapiko'),
   ('ps', '???'),
   ('ge', 'small'),
   ('ge', 'little'),
   ('tkp', 'liklik'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Rarasori kakapikoa aioa aioparevoi uva rera-pa siraoparoe Pita.'),
   ('xp', 'Rarasori em i kaikaim liklik hap kaikai na Pita i sori long em.'),
   ('xe', 'Robinson is eating little food and Peter feels sorry for him.')]),
 ('kakapikoto',
  [('rt', 'kakapiko'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'newborn baby'),
   ('tkp', '???'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Garetoavi kakapikoto kakaeto ira kavau aakova vovokio.'),
   ('xp', 'Liklik pikinini tru mama i karim em tude tasol.'),
   ('xe', '???'),
   ('ex', 'Kakaeto kakapikoto ira kavaue aakova vovokio raga.'),
   ('xp', 'Liklik pikinini, mama i karim tude.'),
   ('xe', '???')]),
 ('kakapu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'place in sling for purpose of carrying'),
   ('tkp', 'slingim'),
   ('tkp', 'pasim laplap'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Aakova kakaeto kakapupae.'),
   ('xp', '???'),
   ('xe', 'The mother places the child in the sling.'),
   ('ex', 'Aakova kakaeto kakapupae varoa-ia rera kaepaoro.'),
   ('xp', 'Mama i karim liklik pikinini blong long laplap.'),
   ('xe', 'The mother is carrying the child ???.')]),
 ('kakapua',
  [('rt', 'kakapu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'sling for lifting'),
   ('tkp', 'sling'),
   ('dt', '12/Feb/2005'),
   ('cmt', "Don't understand syntax of third example."),
   ('ex', 'Kakapuava ena ra ro kakaeto kakapua rera kaeoro rera Tutuoro.'),
   ('xp', 'Yu kalim dispela pikinini long dispela slins bek.'),
   ('xe', '???'),
   ('ex', 'Kakapuava uriou ra kakaeto kaea.'),
   ('xp', 'Yu kam wantaim sling bai mi karim pikinini.'),
   ('xe', "Come with the sling and I'll carry the child."),
   ('ex', 'Uvuoa-ia kakapua toupaivoi kotokotoara verapasia rasiuaro-re.'),
   ('xp', 'Long sip i gat sling bilong rausim ol kago i go long graun.'),
   ('xe', 'On a ship there is a crane ??? to leave cargo on the ground.')]),
 ('kakara',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'arm band'),
   ('ge', 'bracelet'),
   ('tkp', 'paspas'),
   ('cmt',
    "Check gloss. Is translation of example right? What about 'kakara pako' vs. simply 'pakoua'?"),
   ('dt', '31/Oct/2005'),
   ('ex', 'Auoro, uriou asi kakara-va.'),
   ('xp', 'Yu kam wantaim hap rop buai.'),
   ('xe', 'Hey, come with a betel nut rope.'),
   ('ex', 'Tesi vearo kekepaoi kakara pako tovoevoi ora oira vavaearo-ia.'),
   ('xp', 'Tesi em i luk gut bikos em i putim paspas long han bilong em yet.'),
   ('xe', 'Tesi looks good, she put a bracelet on her wrist.')]),
 ('Kakarapaia',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'village name'),
   ('tkp', 'ples nem'),
   ('dx', 'Pipipaia'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kakarapai urui - uruia vaisiavo aue kakarabaia.'),
   ('xp', 'Name blong peles em long kakarapaia.'),
   ('xe', '???'),
   ('ex', 'Kakarapaia sevendepairara vo uruiaro.'),
   ('xp', 'Kakarapaia ples bilong ol sevende.'),
   ('xe', 'Kakarapaia is a village of Seventh Day Adventists.')]),
 ('kakarau',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'frog'),
   ('ge', 'stingray'),
   ('tkp', 'rokrok'),
   ('tkp', 'prok'),
   ('tkp', 'epa korvo'),
   ('sf', 'FAUNA'),
   ('nt', 'this type of frog lives in the bush'),
   ('dt', '28/Feb/2005'),
   ('ex', 'Kakarau vo toupava vegoaro ora vovuko gaevo sirova.'),
   ('xp',
    'Kakarau tros em i save tstap long buck na long arere long ol wara.'),
   ('xe', 'The ??? frog lived in the bush and ???.'),
   ('ex', 'Kakarau iria vo toupaeveira uuko gaero sirova aiopava.'),
   ('xp', 'Rokrok em i save i stap arere long wara na em i bilong kaikai.'),
   ('xe', 'The kakarau frog lives on the ??? and is food.')]),
 ('Kakarera',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Kakarera-ia uva opitara paureva Raupeto.'),
   ('xp', 'Raupeto i bin palanim ol kokonas long Kakarera.'),
   ('xe', 'Raupeto is planting coconuts in Kakarera.')]),
 ('kakata',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'cockatoo'),
   ('tkp', 'koki'),
   ('dt', '31/Aug/2005'),
   ('ex', 'Kakata popoteva kokiova riro vavioko aiova.'),
   ('xp', 'Koki em i waitpela pisin na i save kaikai popo.'),
   ('xe', 'The kakata bird is a white bird and a big eater of pawpaw.')]),
 ('kakate',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bamboo.tube'),
   ('tkp', 'mambu bilong karim wara'),
   ('eng', 'bamboo tube for water'),
   ('dt', '27/Jul/2005'),
   ('ex', 'Kakate aue kaepava uko kakate purapai aue rava veta uuko kaepas.'),
   ('xp', 'Kakate bamboo ol i save wokim long bamboo karim water.'),
   ('xe', '???'),
   ('ex', 'Raru kakate ukoro avaroe eisi-re kovoa.'),
   ('xp', 'Raru i pulamapim wara long mambu na go long wok.'),
   ('xe', 'Raru fills up the bamboo water tub and goes to work.')]),
 ('kakatuara',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of lizard'),
   ('tkp', '???'),
   ('nt', 'dark body, tan scales, eats snakes, skin not used for drum'),
   ('sf', 'FAUNA'),
   ('dt', '12/Feb/2005'),
   ('ex',
    'Kakatuara kanora oiso rera osia kariana ira gagopaiveira aue-re kututoroia-re.'),
   ('xp',
    'Kakatuara lizard em i wanpela lizard ol save kisim skin blong blong putim long kustu drum.'),
   ('xe', '???'),
   ('ex', 'Kakatuara riro torito ikaupavira.'),
   ('xp', 'Palai em i save ranawe kwik.'),
   ('xe', 'The kakatuara is a quick runner.')]),
 ('kakau',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'cocoa'),
   ('tkp', 'kakao'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '28/Jan/2005'),
   ('ex', 'Kakau kovoro purapaivoi oirara moni oupasia.'),
   ('xp', 'Gaden kakau ol man i save wokim bilong kisim moni.'),
   ('xe', 'Men made cocoa gardens in order to get money.')]),
 ('kakauoa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'skin'),
   ('eng', 'skin'),
   ('eng', 'shell'),
   ('eng', 'husk'),
   ('tkp', 'skin'),
   ('cmt', 'Can it be a shell? Need to check semantics.'),
   ('dt', '10/Nov/2005'),
   ('ex', 'Kakauoa aue iava evaova vararo-ia tegoro.'),
   ('xp', 'Skin blong diwai ir wile banana.'),
   ('xe', '???'),
   ('ex', 'Tetevutoa iava kakauoa verarevoi aue Reivasia.'),
   ('xp', 'Reivasia i rausim skin bilong saksak.'),
   ('xe', 'Reivasia is removing the husk from the sago.')]),
 ('kakavea',
  [('ps', 'N'),
   ('pt', '???'),
   ('cl', 'isi'),
   ('ge', 'fern tree'),
   ('tkp', 'wankain diwai i no strong'),
   ('nt', 'generic term'),
   ('sf', 'FLORA'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kakavea oiso va osa evaova rao ra toupai ora kupei.'),
   ('xp', 'Kakave em wanpela kain diwai em gro long bus.'),
   ('xe', '???'),
   ('ex', 'Kakaveasia oo vaisi oiso rasaita osa aue kakavea vaisi toopai.'),
   ('xp', 'Kakaveasi em I wan kain nem olsem tasol kakavea.'),
   ('xe', '???'),
   ('ex', 'Kakavea oa vegoaro toupaiveira.'),
   ('xp', 'Pen i save i stap long bus.'),
   ('xe', 'Ferns live in the bush.'),
   ('ex', 'Kakavea isi aisia toepaiveira kepa purapasia vo vukoro-ia.'),
   ('xp', 'Pen ol i save katim bilong wokim haus.'),
   ('xe', '???')]),
 ('kakavoro',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'Malay apple tree'),
   ('tkp', 'laulau'),
   ('dt', '28/Jan/2005'),
   ('ex', 'Tevire kakavoro kue-ia tuparevoi.'),
   ('xp', 'Tevire i tambuim laulau.'),
   ('xe', 'Tevire put a taboo on Malay apple trees.')]),
 ('kakavu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'scoop up with the hands'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Avo, varoo rutu kakavuri piratiara ora vii varaearo-ia.'),
   ('xp', 'Meri yu mas kisim olgeta pinat long han bilong yu.'),
   ('xe', 'Woman, you pick up all of the coconuts in your hands.'),
   ('ex', 'Varoo rutu pirati kakavuavoi.'),
   ('xp', 'Y mi kisim olgeta pinats.'),
   ('xe', 'I am scooping up all of the peanuts.'),
   ('ex', 'Vareapi pinatiara rutu kakavure.'),
   ('xp', 'Vareapi i kisim olgeta pinat.'),
   ('xe', 'Vareapi scooped up peanuts.'),
   ('ex', 'Kuriara kakavuri.'),
   ('xp', '???'),
   ('xe', 'Scoop up the scrapings with your hands.')]),
 ('kakeoto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'inside'),
   ('eng', 'inside'),
   ('eng', 'insides'),
   ('eng', 'internals'),
   ('eng', 'innards'),
   ('tkp', 'insait'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Sera karuveraisi rakaraiaro vikievoi uva sovarai apa raga aioevoi kakeoa.'),
   ('xp',
    'Sera em i tromem skin bilong singapo na kaikaim insait bilong em tasol.'),
   ('xe', 'Sera threw away ??? and ???.')]),
 ('kaki',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'open'),
   ('eng', 'cracked open'),
   ('eng', 'split open'),
   ('tkp', 'bruk'),
   ('cmt', 'Need clear example illustrating subject agreement'),
   ('vx', '1'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kakioviro rivuko.'),
   ('xp', '???'),
   ('xe', 'The mosquito is burst open.'),
   ('ex', 'Opita isi kaki va isi toeoro.'),
   ('xp', 'Yu katim kokonas bai bruk.'),
   ('xe', 'Split up a coconut by cutting it.'),
   ('ex',
    'Opita isi kaki va isi toeovo vii kukueno kakipanoi tooro vii ragiovo.'),
   ('xp', 'Yu brukim kokonas taim yu katim bai mi brukim het bilong yu.'),
   ('xe', '???')]),
 ('kaki',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'open'),
   ('eng', 'crack open'),
   ('eng', 'split open'),
   ('tkp', 'brukim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '14/Nov/2005'),
   ('ex', 'Opita isi kaki va isi toeoro.'),
   ('xp', 'Yu katim kokonas bai bruk.'),
   ('xe', 'Split up a coconut by cutting it.'),
   ('ex',
    'Opita isi kaki va isi toeoro vii kukuero kakipaavoi tooro vii ragiovo.'),
   ('xp', 'Yu brukim kokonas taim yu katim bai mi brukim het bilong yu.'),
   ('xe', '???'),
   ('ex',
    'Tapipava-ia kakia toupaivo gareavi ovusia oira-ia uko rovu kokoavo kasiraopa rovu uva pegeovoioviro.'),
   ('xp',
    'Mi kapsaitim hot wara long kap i gat liklik mak olsem i bruk na kap i bruk olgeta.'),
   ('xe', '???')]),
 ('kakiaki',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'crack open'),
   ('ge', 'fracture'),
   ('tkp', 'bruk'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Vetatou kakiakirivoi va varipiero.'),
   ('xp', 'Yu brukim het bilong yu.'),
   ('xe', 'You cracked your head.'),
   ('ex', 'Vi kukuearo kakiakipaavoi viuparo.'),
   ('xp', 'Bai mi brukim het blong yu.'),
   ('xe', "I'm going to break your head open."),
   ('ex', 'Teva iru kakiakipaoi Tasia kukuearoia.'),
   ('xp', 'Teva i paitim laus long het bilong Tasia.'),
   ('xe', "Teva cracks open lice on Tasia's head.")]),
 ('kakiri',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'type of insect'),
   ('tkp', '???'),
   ('nt', 'resembles sandfly, lives near beach area'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Kakiri ira-ia riro kou toupai aue oru.'),
   ('xp', 'Binatang i gat plenti gras long skin.'),
   ('xe', 'As for the ???, it has lots of ???.')]),
 ('kakiua',
  [('rt', 'kaki'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'cave'),
   ('ge', 'cavern'),
   ('tkp', 'hol ston'),
   ('dt', '11/Feb/2007'),
   ('cmt', 'Check spelling on originals'),
   ('ex',
    'Ave kakikakiva vo toupai aveke keparo-ia kakiua vo uva uvuipai usipasa vo vegoaro.'),
   ('xp', 'Kakiua cave ol mom i kan sleep long em .'),
   ('xe', '???'),
   ('ex', 'Kakiua vo uva uvuipai akauko kakau karu waipave.'),
   ('xp', 'Na tu ol animal i kan slip long em.'),
   ('xe', '???'),
   ('ex', 'Kakiua oua siovaraia rakoru kare toupaiveira riropa kare.'),
   ('xp', 'Insait long hol ol snek i save i stap long em.'),
   ('xe', 'Inside of a cave, there are many snakes.')]),
 ('kaku',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'split open'),
   ('tkp', 'op bruk'),
   ('arg', 'O'),
   ('vx', '2'),
   ('nt', 'using instrument (axe or knife)'),
   ('dcsv', 'true'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Kepa pausia oira kakurivere.'),
   ('xp', '???'),
   ('xe', 'Split it open in order to build the house.'),
   ('ex', 'Pita opita isi kakurevoi va isi toeoro.'),
   ('xp', 'Pita i katim kokonas na i bruk.'),
   ('xe', 'Peter cut the coconut, breaking it.')]),
 ('kakua',
  [('rt', 'kaku'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'axe'),
   ('ge', 'knife'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Kareviri kakua-ia torara paure.'),
   ('xp', 'Kareviri i putim tamiok long andoro.'),
   ('xe', 'Kareviri ???.'),
   ('ex',
    'Ata torara iva kaku rao gasirevoi uva viapau uvuipa ra evaovavai toreve oira-ia oa-ia oavu raga tovorevoi torara-ia.'),
   ('xp',
    'Ata em i brukim andol bilong tamiok olsem na em i no inap katim wanpela diwai olsem na em putim narapela andol gen.'),
   ('xe', '???')]),
 ('kakuaku',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'break'),
   ('tkp', '???'),
   ('eng', 'break into pieces with instrument'),
   ('arg', 'O'),
   ('vx', '2'),
   ('nt', 'typically the instrument is a knife'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Kaukau kakuakuri.'),
   ('xp', '???'),
   ('xe', 'Break open the sweet potato with some instrument.'),
   ('ex', 'Iovore upiriko-isi kakuakuevoi pitokava iare garepa visivi.'),
   ('xp', 'Iovore i katkatim kaukau i go long sospen.'),
   ('xe', 'Iovore broke the sweet potato into pieces in the saucepan.')]),
 ('kakupaa',
  [('rt', 'kaku'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'landslide'),
   ('ge', 'mudslide'),
   ('tkp', 'graun i bruk'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Vosia kokeva tavira kokeo ra rakakupape 13 days.'),
   ('xp',
    'Taim rain i pundaun bai araun i bruk inap 4 days or taim guria earthquake I kam stong bai groun I bruk.'),
   ('xe', 'When the rain ???.'),
   ('ex',
    'Vosa vovava voki roia kove kokeva ra kakupape ravurike tape vusireve rakakupape.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Rirova-ia kokeva kakupa varae vavo.'),
   ('xp', 'Long traipela rein graun i bruk long hap.'),
   ('xe', 'From ??? a landslide ???.'),
   ('ex',
    'Rirovira kokeva koveoe uva pukuia iava kakupa varae uva Pita vo kovoaro upiriko kovo rakuvo.'),
   ('xp',
    'Ren i pundaun bikpela na graun i bruk kam daun long maunten na karampim gaden kaukau bilong Pita.'),
   ('xe',
    "The rain fell down heavily and a landslide fell from the mountains and it covered up Pita's sweet potato garden.")]),
 ('kakuparei',
  [('rt', 'kaku'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'two halves'),
   ('tkp', 'tuhap'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Sio vo kakuparei purarevoi evaova-ia.'),
   ('xp', 'So i katim diwai long tupela hap.'),
   ('xe', 'Sio made two halves from the tree.')]),
 ('kakupato',
  [('rt', 'kaku'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('dx', 'Central'),
   ('ge', '???'),
   ('tkp', 'samting bilong brukim diwai'),
   ('avm', 'pa_required'),
   ('dt', '02/Sep/2005')]),
 ('kakupie',
  [('rt', 'kaku'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'shout'),
   ('ge', 'yodel'),
   ('tkp', 'singaut'),
   ('tkp', 'bikmausim'),
   ('vx', '1'),
   ('nt', '???'),
   ('dt', '15/Nov/2005'),
   ('ex', 'Kakupierevorao roruvira.'),
   ('xp', '???'),
   ('xe', 'He was shouting merrily.'),
   ('ex', 'Oirato kakupieparevo uvavuva.'),
   ('xp', 'Manpela man i singaut long places.'),
   ('xe', 'A man shouts ???.'),
   ('ex', 'Virepa kakupiepare vavoisio pukui-ia.'),
   ('xp', 'Virepa i saot long maunten.'),
   ('xe', 'Virepa is shouting on the mountain.'),
   ('ex',
    'Virepa rirovira kakupierevo ovusia evaova kove uvare va toerevo koratoa-va.'),
   ('xp',
    'Virepa em i singautim bik maus taim diwai i pundaun bikos em i katim wantaim kapul.'),
   ('xe',
    'Virepa shouted loudly when the tree fell because he cut it with a kapul (on it).'),
   ('ex',
    'Vioviokoa o ipavuia toupaoro kakupiepaivo ovusia vvoea urupiepavo.'),
   ('xp',
    'Ol yangpela man wok long singaut long narapela hap taim mi harim ol.'),
   ('xe', '???')]),
 ('kakupute',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'fern tree'),
   ('tkp', '???'),
   ('nt', 'grows at high elevations'),
   ('sa', 'kakavea'),
   ('sf', 'FLORA'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kakupute kouro oara toupaiveira eisi Tutue pukui vituaro.'),
   ('xp', 'Dispela kain diwai pen i save istap long maunten Balbi tasol.'),
   ('xe', 'The fern tree kakapute lives on Mt. Balbi.')]),
 ('kakutauo',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'flower sheath of palms'),
   ('tkp', 'lim'),
   ('sf', 'FLORA'),
   ('dt', '29/Jan/2005'),
   ('ex', 'Kakutauo oa iava opita kuero purapapiroveira.'),
   ('xp', 'Parao bilong kokonas i save kamap.'),
   ('xe', '???')]),
 ('kakuto',
  [('rt', 'kaku'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'fight club'),
   ('tkp', 'stik bilong pait'),
   ('avm', 'pa_barred'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kakuto ira-ia oisioa oraupopave.'),
   ('xp', 'Pakoo ol i save yusim long pait.'),
   ('xe', 'We always fight with a club.'),
   ('ex',
    'Kakuto ira purapaiveira oeavu iava sirivuko sara iava uva eera kakuto ira-ia upo purapaveira.'),
   ('xp',
    'Stik bilong pait ol i save wokim long wanpela kain limbum na ol i save yusim dispela stik long pait.'),
   ('xe', '???')]),
 ('kakutuiato',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of tree'),
   ('tkp', '???'),
   ('nt', 'leaves have rough edges, used for sandpaper'),
   ('sf', 'FLORA'),
   ('dt', '12/Feb/2005'),
   ('ex',
    'Kakutuiato ira guruvaroja kakuava tuiapa vara gausisi piepaovo. Oaravu kovoara toupa kakutuato gurava roia.'),
   ('xp',
    'Plisave kisim lefi blong em napim ol kain timber bai kam sumt tru em use long plant kan work.'),
   ('xe', '???'),
   ('ex', 'Kakutuiato ira guruvaro oupaiveira kakuraoro gausisipiepasia.'),
   ('xp',
    'Diwai ol i save kisim lip bilong mekim smut ol andoro bilong tamiok.'),
   ('xe', 'The kakutuiato tree, they get its leaves in order to ???.')]),
 ('kakuva',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'frog'),
   ('tkp', '???'),
   ('nt',
    'brown body, cry sounds like the striking of an axe on wood, lives ???'),
   ('dt', '11/Feb/2005'),
   ('ex',
    'Kakuva riropovira gaupaeveira vuri vuri varava vo toupaevei ra viavaia vasia.'),
   ('xp',
    'Kakuva em i save karai bikpela tru em brown body em I save istap tasol long antap long oldiwai.'),
   ('xe', 'The frog was crying loudly ???.'),
   ('ex', 'Kakuva iria gaupaeveira ovaiarovi ora avitoava.'),
   ('xp', 'Rokrok i save krai long avinun na long moning.'),
   ('xe', 'The frog cries in the afternoon and at night.')]),
 ('kakuvira',
  [('rt', 'kaku'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'split-like'),
   ('tkp', 'olsem bruk'),
   ('dt', '11/Feb/2005'),
   ('ex', 'Kakuvira va toeri opita vao kaukuvira va toe.'),
   ('xp', 'Katim hap yu katim kokomas long twopela hap.'),
   ('xe', '???'),
   ('ex', 'Kakuvira opita isi toeri.'),
   ('xp', 'Yu katim kokonas long tupela hap.'),
   ('xe', 'You cut the coconut, splitting it in two.')]),
 ('kameoro',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'camel'),
   ('tkp', 'kamel'),
   ('ig', 'yes'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '12/Feb/2005'),
   ('cmt', "Check original for second example. What's up with tapirs?"),
   ('ex', 'Kameoro vo toupaevo raka tapi, viapau vo toupai.'),
   ('xp', 'Kameoro em i stap long ol dert place i no i stap long bus ia.'),
   ('xe', "The camel lives in a dray place, they don't exist here."),
   ('ex', 'O kameovo iria-ia vokapaivei vo raka tapirs.'),
   ('xp', 'Kameoro ol i usim long mekim sampela kain work.'),
   ('xe', 'The camel, they walk around on it in dry places.'),
   ('ex',
    'Kameoro iria oisioa vokapaive isippairara oira avuaro kotokoto kaepaoro.'),
   ('xp',
    'Kamel em ol i sip ol i save yusim long wokabaut na long karim kago.'),
   ('xe', '???')]),
 ('kandora',
  [('alt', 'ketoroa'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'candle'),
   ('tkp', 'kandol'),
   ('ig', 'yes'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '10/May/2005')]),
 ('kaokao',
  [('rt', 'kao'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'frog'),
   ('tkp', 'rokrok'),
   ('nt', 'mate of epioto, cry sounds like its name'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kaokao vo toupaiveira uuko gaero sirova.'),
   ('xp', 'Dispela rokrok i save i stap tasol long ol wara.'),
   ('xe', '???'),
   ('ex', 'Kaokao oisio raga ita oira osia kakarau.'),
   ('xp',
    'Em wanpela lain rokrok bilong kaikai na ol i save i stap arere long wara.'),
   ('xe', '???')]),
 ('kaokaoara',
  [('rt', 'kaokao'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'invitations'),
   ('tkp', '???'),
   ('dt', '12/Feb/2005'),
   ('ex',
    'Kaokoara vavosio-va uriopai Sisivi-ia. Kaokaoara sipok vigei-re ra avavio aiosia.'),
   ('xp',
    'Ol singaut i kam long sisivi. Ol salim il singaut long yumi bai yumi go kaikai.'),
   ('xe', 'Invitations come from Sisivi. ???.')]),
 ('kaokaoto',
  [('rt', 'kaokao'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'messenger'),
   ('tkp', 'man bilong karim tok'),
   ('dt', '11/Feb/2005'),
   ('ex',
    'Kaokaoto urioro rupika eisi Ibu-ia. Kaokaoto uriovoi eisi-va Kereaka (ibu)(toisikoaro).'),
   ('xp',
    'Man bilong karim tok i kam long Kereaka. Ibu tosikoa wanpela man blong karim tok blong singsing kau long Ibu ikam.'),
   ('xe', 'The messenger came to Ibu. ???.'),
   ('ex', 'Kaekaoto avaroe eisi-re Wakunai.'),
   ('xp', 'Man bilong toksave i go pinis long Wakunai.'),
   ('xe', 'The messenger went to Wakunai.')]),
 ('kapa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'food'),
   ('eng', 'food cooked to break a period of fasting'),
   ('tkp', 'kaikai'),
   ('nt',
    'The food like koora kapa or koie kapa (opossum or pig) was [MISSING]'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Vovokio-ia Uruate ora Toupie koora kapa vatepaere Kokora ovitoaro-pa, ra koora aioparo.'),
   ('xp',
    'Tude bai Uruate na Toupie i givim kapul long pikinini bilong Kokora, bai em i save kaikai kapul.'),
   ('xe',
    "Today Uruate and Toupie is giving possum meat to Kokora's child, and eats possum."),
   ('ex', 'see notebook'),
   ('xp', 'see notebook'),
   ('xe', 'see notebook')]),
 ('kapa',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'break.fast'),
   ('tkp', 'kaikai'),
   ('eng', 'eat after fasting'),
   ('vx', '2'),
   ('arg', 'O'),
   ('cmt', 'Need examples'),
   ('dt', '17/Oct/2005'),
   ('ex',
    'Raviata ira kokai kapare erikapa kavua tokosia oa purarevora erika-ia kokaiarapa rera kavupieoro oa iava rera-pa vara kaparevoi..'),
   ('xp',
    'Aviata i kukim kakaruk long Elika bilong brukim tambu em i bin wokim long tambuim Elika long kakaruk. Olsem na em i givim ol dispela samting long em na em i kaikaim.'),
   ('xe', '???')]),
 ('kapaava',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'teeth'),
   ('tkp', 'bikpela tit'),
   ('nt', 'specifically, molars'),
   ('cmt', "Check vowel length: kapaava or kapava? What's plural?"),
   ('dt', '15/Apr/2006'),
   ('ex', 'Kapaava reuriva iria upiapaoe.'),
   ('xp', 'Nambawan tit bilong mi i pen.'),
   ('xe', 'My molar hurts.')]),
 ('kapai',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'wild pitpit'),
   ('tkp', 'liklik pitpit'),
   ('nt', 'Used for shelves/platforms.'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kapai aoopauparveira urupurapaso riro goru arute.'),
   ('xp', 'Kapai pitpit ol save plain blong wokim bed em stang pela.'),
   ('xe', '???'),
   ('ex', 'Kapaiara oupaiveira varaia uru purapasia.'),
   ('xp', 'Wail pitpit ol i save kisim bilong wokim bet.'),
   ('xe', 'Wild pitpit is used to make beds.')]),
 ('kapara',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'roast without pan or container'),
   ('tkp', 'kukim long paia'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Isivairi korato kapararevoi etokasi raga-ia.'),
   ('xp', 'Isivairi i kukim kapul long paia tasol.'),
   ('xe', 'Isivairi is cooking possum in the fire.'),
   ('ex', 'Orekerovu aue keruara oara oisoa kaparapaive oearovu.'),
   ('xp', '???'),
   ('xe', 'Other people would always roast the bones.')]),
 ('kaparu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'short of'),
   ('ge', 'missing'),
   ('tkp', 'wanpela i lus'),
   ('dt', '08/Jun/2005'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cmt', 'Check whether uporo is really okay...'),
   ('ex', 'Pita vaio ora Sera kaparuivoi Sera raga uporo.'),
   ('xp', 'Ol man i misim Pita na ol i paitim tasol Sera.'),
   ('xe', 'They are missing Pita and Sera and hitting only Sera.'),
   ('ex', 'Kareroepa gaupaoro vore Rotokasi-ia vo uvare vo aaorei kaparuiva.'),
   ('xp', '???'),
   ('xe',
    'He returned to the Rotokas area crying because he missed the father and.'),
   ('ex', 'Pita ora Sera aiterea kaparu vaitere uporo vegei kaparuvoi vavo.'),
   ('xp', 'Pita na Sera kols tru ol kitim tupela na ol misim tupela.'),
   ('xe', 'Peter and Sera ???')]),
 ('kaparuvira',
  [('rt', 'kaparu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'one missing'),
   ('tkp', 'olsem i sot'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Atari vavae vao kaparuvira vateri ragaipi.'),
   ('xp', 'Yu gonim avepela pis hap tasol.'),
   ('xe', '???'),
   ('ex', 'Vovavae rutu toupaivo uva kaparuvira toupai.'),
   ('xp', 'I bin gat paipela i stap tasol wanpela i lus.'),
   ('xe', 'Five are here and one is missing.'),
   ('ex',
    'Sopia vaio ora Kepii Pita vasiaro aioerevo upiriko isi oa iava vairei kaparuvira vovouroe uva sopia raga uporevoi ovusia kepii torievo vegoaro-re.'),
   ('xp',
    'Sopia wantaim Kepii tupela kaikaim kaukau bilong Pita na Pita em i ???.'),
   ('xe', '???')]),
 ('kapatau',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'add to'),
   ('tkp', '???'),
   ('eng', 'augment'),
   ('eng', 'add to'),
   ('eng', 'cap up'),
   ('eng', 'supplement'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Uuko rovu kapatauevere oai tovooro.'),
   ('xp', '???'),
   ('xe', 'She will add to the water putting some other (water in).'),
   ('ex',
    'Rupieri iria reoa kapatauevoi Siuripiri tapo oiso ra toupaeve ita vo Kuritaturi.'),
   ('xp',
    'Rupieri i wokim niupela tok orait gen wantaim Siuripiri bai em i ken i stap long Kuritaturi.'),
   ('xe',
    'Rupieri added words with Siuripiri ??? that he can still be in Kuritaturi.')]),
 ('kapatoro',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'leprous'),
   ('tkp', 'liea'),
   ('dt', '03/Jun/2005'),
   ('ex', 'Kapatoro ro irai kapuora toupai vo kokotoa-ia sivi kapuari.'),
   ('xp', 'Man i got ol soa nogut i stap long em.'),
   ('xe', 'A leper is one who ???.')]),
 ('kapatoroto',
  [('rt', 'kapatoro'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'leperous person'),
   ('tkp', 'lepaman'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Viapau uvaipa va koetapape masin ouragaparovere osa viapau.'),
   ('xp', 'Em kisim manas bilong dispela sore tasol bai i no gat tru.'),
   ('xe', '???')]),
 ('kape',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'unable to meet'),
   ('tkp', 'i no nap bung'),
   ('arg', '???'),
   ('vx', '???'),
   ('cmt',
    'Try to find clearer example to establish transitivity, Class A or B.'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Kapepaivoi siguva.'),
   ('xp', '???'),
   ('xe', 'The joint does not meet together.'),
   ('ex', 'Kapepaavoi, viia kovaeto.'),
   ('xp', 'Han bilong mi i sot bikos yu bikbel.'),
   ('xe', "I can't reach because you're fat."),
   ('ex',
    'Kape vo vavaearei oiratoa iava rikova-ia pituarapa uvare riro varava.'),
   ('xp',
    'Han bilong man i sot long holim bodi bilong meri bikos em i pat tumas.'),
   ('xe', "The man's hands ???."),
   ('ex',
    'Oirato riakova raure uva rera vavaearo kovutoa-ia kapere oisio riakova torievoi uvare rera iava vavaearei. Viapau uvuipae oarea etekupae riakova-ia pitu goruarapa.'),
   ('xp',
    'Man i holim meri na han bilong em i sot long holim bel bilong meri. Taim meri i ranawe bikos tupela han bilong em i sot long holim pas meri.'),
   ('xe',
    'The man is hugging the woman ??? the woman has run away because ???')]),
 ('kapeaa',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'flimsy'),
   ('eng', 'insubstantial'),
   ('eng', 'flimsy'),
   ('eng', 'unstable'),
   ('tkp', 'no strong tumas'),
   ('tkp', 'i no pas tumas'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Vo kovo kapeapape.'),
   ('xp', '???'),
   ('xe', 'The work is unstable.'),
   ('ex',
    'Oirato kapea vovouvira touparevoi oa iava rasii varu-ia vori ouroe uva viapau rera-pa rasi avaiei.'),
   ('xp',
    'Man i no gat strongpela tingting olsem na em i salim hap graun na em i nogat graun nau.'),
   ('xe', 'The man was weak-minded and therefore he sold and ???.')]),
 ('kapeaa',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'flimsy'),
   ('tkp', 'no strong tumas'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cmt', 'Think about a better gloss'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Rataoa kapearevoi viapau va gorurevo.'),
   ('xp', 'Em i no bin pasim strong doa.'),
   ('xe', "He didn't ??? the door, ???."),
   ('ex', 'Aiteto viapau oisio kakaevure kapeapareveira voea-re reopaoro.'),
   ('xp', 'Papa i no save isi long ol pikinini taim em i toktok long ol.'),
   ('xe', 'Dad ??? talking to them.'),
   ('ex', 'Ae, rera kapeaatavo uva torirevo.'),
   ('xp', 'Hei, yupela isi long man ia na em i ranawe.'),
   ('xe', 'hey, you ??? him and he ran away.'),
   ('ex', 'Oira-ia tupareve aiteto oira kapeapareveira.'),
   ('xp', '???'),
   ('xe', 'He always is paying in advance to secure her with her father.')]),
 ('kapeaavira',
  [('alt', 'kapeaapavira'),
   ('rt', 'kaapea'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'flimsy'),
   ('ge', 'loosely'),
   ('tkp', 'no strong tumas'),
   ('tkp', 'i no pas tumas'),
   ('cmt', 'Check vowel length'),
   ('dt', '29/Oct/2005'),
   ('ex', 'Opeita kaaeavira pitupari- opeita kaapeavira torepau.'),
   ('xp', 'Yumas holim pasi stong - yu mas sanap stong.'),
   ('xe', '???'),
   ('ex', 'Kaapeavira toupaivoi vao kepa.'),
   ('xp', 'Dispela haus i no strong.'),
   ('xe', 'This house is ???.'),
   ('ex',
    'Kapeavira raga Iteipirie sigoa-ia pitupaevo uva kove oira togasia.'),
   ('xp',
    'Iteipirie em i no holim pas gut naip. Na naip i pundaon na sutim em.'),
   ('xe', 'Iteipirie held the knife loosely and it fell down, cutting her.'),
   ('ex',
    'Teiavi kaapeavira aio pitoka ivitaevo uva epusi ivita veraevo aioara aiosia.'),
   ('xp',
    'Teiavi em i no pasim gut sospen kaikai, na pusi i opim sospen na kaikaim ol kaikai.'),
   ('xe',
    "Teiavi didn't shut the saucepan tightly and the cat ??? in order to eat all of the food.")]),
 ('kapekape',
  [('rt', 'kape'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'embrace'),
   ('ge', 'grip with arms not meeting'),
   ('tkp', 'i laik pasim han waintim'),
   ('cm', '-ia'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('cmt', 'Can verb appear without oblique argument?'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Opita ipasia kapekapepasivoi.'),
   ('xp', '???'),
   ('xe', 'The two men are embracing the coconnut tree to climb up.'),
   ('ex', 'Kapekapeiraoavoi opita govuko-ia.'),
   ('xp', 'Han bilong mi i no inap long traipela kokonas.'),
   ('xe', 'I tried to get my hands around a ???.')]),
 ('kapekapevira',
  [('alt', 'kapekapepavira'),
   ('rt', 'kapekape'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'with arms barely encircling'),
   ('tkp', '???'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Oirato kapekapevira iiparoi opita atosia.'),
   ('xp', '???'),
   ('xe',
    'The man climbs the coconut tree, nearly embracing the trunk with [MISSING].'),
   ('ex', 'Kapekapevira rutu iiparai opitatoa-ia.'),
   ('xp', 'Mi go antap long kokonas na tupela han bilong mi i sot.'),
   ('xe', 'I am going climbing the coconut without my hands meetings.')]),
 ('kapere',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'swim'),
   ('tkp', 'suim'),
   ('eng', 'swim with part of the body out of the water'),
   ('vx', '1'),
   ('cmt', "There's something weird about the 2nd ex sentence!"),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kaperepareva avakava-ia.'),
   ('xp', '???'),
   ('xe', 'He was swimming on the surface of the ocean.'),
   ('ex', 'Mak kaperepare kavua-ia.'),
   ('xp', 'Mak i suim long raun wara.'),
   ('xe', 'Mark swims in the ???.'),
   ('ex',
    'Oiratoa-va petoviropa opuruva avakava ivaraia uva kaperesiva rogara iare.'),
   ('xp',
    'Man i bin kapsait long kanu hantap long solwara na tupela suim i kam long nambis.'),
   ('xe',
    'The canoe capsized with the man on the ocean and the two of them swam to the shore.')]),
 ('kaperepie',
  [('rt', 'kapere'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'make.swim'),
   ('tkp', 'mekim subim'),
   ('eng', 'make swim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Suke kaperepieivo rovua-ia.'),
   ('xp', 'Ol i mekim Suke subim wara.'),
   ('xe', 'They made Suke swim in the water.')]),
 ('kapi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'beak'),
   ('ge', 'bill'),
   ('tkp', 'maus bilong pisin'),
   ('dt', '11/Feb/2005'),
   ('ex',
    'Riro kaekae kapito kokioto. Vo toupare avakava sirova vuoto riro kaekae kapito aue aioparovei okoe va kaviri.'),
   ('xp',
    'Longpela neck pinic em Isane istap long Solwana wanpela Pisin vuoto em I long pela neck isana kaikaim kuka crab na or kitam.'),
   ('xe', '???'),
   ('ex', 'Riro kaekae kapito ovuveo.'),
   ('xp', 'Kokomo i gat long nek.'),
   ('xe', 'The ??? is long-necked.')]),
 ('kapiaa',
  [('ps', '???'),
   ('dx', 'Central'),
   ('ge', '???'),
   ('tkp', 'bek bilong pisin'),
   ('eng', '???'),
   ('dt', '13/Nov/2005'),
   ('ex',
    'Tako ovuveo kapiaa ro ritarevoi porokoa-ia uva koveroi vo-re rasito uva rera orisia kareroi vo atoia.'),
   ('xp',
    'Tako em i sutim bek bilong kokomo long ponook na kokomo em i pundaun i go daun long graun na em i kisim em na go kukim long peles.'),
   ('xe', '???')]),
 ('kapiaavira',
  [('rt', 'kapiaa'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'available'),
   ('ge', 'easy'),
   ('ge', 'clearly'),
   ('tkp', '???'),
   ('cmt',
    'Is it kapeaa or kapiaa? And is the last vowel really long? Not recognized by informants. Needs to be double-checked.'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Eake-a ra va iava kapiaavira taviro?'),
   ('xp', '???'),
   ('xe', 'What is it that he can explain it clearly?'),
   ('ex',
    'Pita viapau uvuipa ra taviro, uvare viapau oisio kapeavira toupaivoi.'),
   ('xp', 'Pita i no inap tru long tokaut.'),
   ('xe', "Peter can't talk, because he isn't available."),
   ('ex',
    'Veeta tou gasivoi vavaea raga-ia uvare aireviraga riroe oa iava kapeavira vo tou gasirivoi.'),
   ('xp',
    'Mambu yu brukim tasol long han bikos i no olsem i strong bikos i no long taim em i gro.'),
   ('xe', '???')]),
 ('kapikapi',
  [('rt', 'kapi'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'bow of ship'),
   ('ge', 'peak of house'),
   ('tkp', 'poret bilong sip'),
   ('tkp', 'antap long haus'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Uvuoa iava kapikai ora tuguruei raka-ia uvare oira vokapiepato viapau vearovira uraparevo.'),
   ('xp',
    'Poret bilong sip em i bam long rip bikos man i save stiaim in no lukluk gut.'),
   ('xe', "??? because the helmsman didn't ??? good.")]),
 ('kapiro',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'clamp'),
   ('ge', 'hold in place'),
   ('tkp', 'holim pas'),
   ('arg', '???'),
   ('vx', '???'),
   ('cmt', '???'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Pita, uriou ra avekeara kasiiraopara kapiro upirikoara ivarai vara tovotovo evaia kapiroa. Oa oisio kekevira toupai osa sisisi uvare votapopierei toupaiveira.'),
   ('xp',
    'Pita, yu kam na holim ol hotpela ston wantaim clamp na bai yu putim hantap long ol kaukau. Dispela clamp em i sae luk olsem sises bikos tupela hap i pas wantaim.'),
   ('xe', '???')]),
 ('kapiroa',
  [('rt', 'kapiro'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'clamp'),
   ('eng', 'clamp'),
   ('eng', 'vise'),
   ('tkp', 'holim pas'),
   ('dt', '30/Apr/2006'),
   ('ex',
    'Tevire kapiraa-ia evaoarei kapiroparevoi airepa kepa sovaraia oisio ra takuvuvira tarevokovira toave osa varei ririreve goruvira.'),
   ('xp',
    'Tevire em i klamim tupela hap timba insait long niupela haus olsem bai tupela hap tim.'),
   ('xe',
    'Tevire has clamped the two branches with a clamp inside of the new house so that ???.')]),
 ('kapiroko',
  [('rt', 'kapiro'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'clamp'),
   ('ge', 'boundary'),
   ('tkp', 'swinge'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kapirokoa-ia avekeara verapaiveira kasiraopara.'),
   ('xp', 'Ol i save kalamim ol ston i hot.'),
   ('xe', '???'),
   ('ex',
    'Vosia avekeara kapirokopaiveira riakora auero-ia vetapariero oara upievira toupaiveira oa iava kapiroko pariero rataupa varataro-ia pitupaiveira avearai ava vovoarataro.'),
   ('xp',
    'Taim ol meri i save holim ol ston wantaim ol hap mambu i save bum wantaim olsem na tupela hap mambu i save holim ol hap autsait bilong ol ston.'),
   ('xe', '???')]),
 ('kapisi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'tobacco'),
   ('tkp', 'brus'),
   ('tkp', 'tabak'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Kapisi ragai-pa avai ras sivuka. Kapisi vosia reropire kapisi tabacco.'),
   ('xp',
    'Yu gave tabacco bai me smok. Kapisi tabacco taim em I dry na I ready long smokim.'),
   ('xe', '???'),
   ('ex', 'Popoteirara oisioa topeka purapave aue iava kapisi guruvaro.'),
   ('xp', 'Ol waitman ol i save wokim tapak long lip bilong burus.'),
   ('xe', 'White men make tobacco from the leaf of the tobacco leaf.'),
   ('ex',
    'Kapisia oavivu rutu oapa oirara kopipaveira vosia viapau sivukapave.'),
   ('xp',
    'Brus i wanpela samting tru ol man i save dai long sapos ol i no simuk.'),
   ('xe', '???'),
   ('ex',
    'Sivukapairara kapisi paupaveira uvare oavivu rutua oapa upiavira ora ravapaveira oirara ora riakora voeao pea sivukapaveira.'),
   ('xp',
    'Ol man i save smok ol i save planim brus bikos em wanpela samting ol man meri ol i save pilim olsem ol i sik ol manmeri ol i save simuk.')]),
 ('kapisito',
  [('rt', 'kapisi'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'tobacco'),
   ('ge', 'plant with long wide leaves used for smoking'),
   ('tkp', '???'),
   ('sf', 'FLORA'),
   ('nt', 'A bundle of tobacco leaves kapisi is neuter gender.'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Pikaito ira kapisito paurevora riropa guruvato.'),
   ('xp', 'Pikaito i bin palanim burus i gat ol bikpela lip.'),
   ('xe', 'Pikaito planted tobacco plant with big leaves.')]),
 ('kapiu',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'twigs'),
   ('ge', 'kindling'),
   ('tkp', 'liklik haphap stik'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kapiuarava uriou ra tuitui kasi pitupiea.'),
   ('xp', 'Yu kam wantaim ol liklik stik na bai mi statim pie.'),
   ('xe', "Come here with the firewood and I'll build a fire."),
   ('ex',
    'Kapiuara oara evao rao ro-ia toupaiveira gareparaoro oara uvuipai ra ikauvira rutu erakopape uvare garepa raorovia oara riropa raoro vara roia toupaiveira.'),
   ('xp',
    'Ol liklik han bilong diwai i save stap long han bilong ol diwai ol dispela liklik han i save drai kuik bikos em ol liklik han tumas i save stap long skin bilong ol bikpela han diwai.'),
   ('xe', '???')]),
 ('kapiua',
  [('rt', 'kapi'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'bill of bird'),
   ('ge', 'beak'),
   ('tkp', '???'),
   ('cmt', "Is 'ua' part of the word or not?"),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Ovuveo ira-ia riro kaekaeua toupaiveira kapiua ouaiava uvuiparoi ratauavisivi aio ouparo uvare riro kaekaeua kapiua. Ari okare rovu iava kokio kare okarea iava ouarovu kapiuaro porokovira toupaiveira. Oiso osia roia keravo ira iava porokovira toupaiveira kapiua.'),
   ('xp',
    'Longpela maus i save stap long kokomo em i nap kisim kaikai long wai liklik bikos i gat longpela maus. Tasol ol narapela pisin i gat maus i krungut olsem dispela tarangu.'),
   ('xe', '???')]),
 ('kapo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'fasten.cover.strip'),
   ('tkp', 'rabis'),
   ('tkp', 'pasim banis long haus'),
   ('eng', 'join together'),
   ('eng', 'clamp together'),
   ('eng', 'fasten on coverstrips'),
   ('eng', 'put cover strips on house or wall'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Leo kepa kapoparerevoi.'),
   ('xp', 'Leo i wok long pasim wol long ol sitirips.'),
   ('xe', 'Leo is covering the house with cover strips.'),
   ('ex', 'Take kapoave ra opesipe.'),
   ('xp', '???'),
   ('xe', "I'll clamp on the bamboo wall material and it will be finished."),
   ('ex', 'Kaposi tovo kepaio vavao kaposi tovori evoa takei.'),
   ('xp', 'Putim vap timber long wall.'),
   ('xe', '???'),
   ('ex',
    'Leo ira evaova iava kapoara tekarevoi oisio ravara-ia takei takireve vara tapo oira ririoro.'),
   ('xp', 'Leo em i sapim ol diwai bilong pasim wol na nilim wantaim.'),
   ('xe', '???')]),
 ('kapoa',
  [('rt', 'kapo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'cover.strip'),
   ('tkp', 'banis long haus'),
   ('eng', 'cover strips on house or wall'),
   ('cmt', 'Check vowel length--kapo vs. kapoo!'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kapooara tekapaivoi ragai vokeparo takear?? kapoosia.'),
   ('xp',
    'Ol i sapim ol lik strip diwai bilong pasim wol long haus bilong mi.'),
   ('xe', 'They are ??? cover strips'),
   ('ex',
    'Ragai urioparai kata-ia ragava kapoa uvare kepa rutu opesiera osa vakapoapavora.'),
   ('xp',
    'Mi bai mi kam wantaim wanpela hap diwai tasol bikos haus i bin pinis olgeta taim mi pasim wantaim ol hap diwai taim mi bin wok long pasim wantaim ol strip.'),
   ('xe', '???')]),
 ('kapokao',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'kapok tree'),
   ('tkp', '???'),
   ('dt', '29/Jan/2005')]),
 ('kapokapo',
  [('rt', 'kapo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'fasten.cover.strips'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Takei kapokapoa ari kepa opesiei.'),
   ('xp', 'Mi nailim wol bikos haus i pinis.'),
   ('xe', '???'),
   ('ex', 'Kepa iava takei kapokapo.'),
   ('xp', 'Yu putim sitirip long olgeta wol bilong haus.'),
   ('xe', '???'),
   ('ex',
    'Kepa iava takei kapokapoa teapi koveovere rara kiuvu urioro oira peosia rakoveo rasitoa iare.'),
   ('xp',
    'Wol bilong haus mi papas wantaim ol hap diwai nogut i pundaun sapos win i kam bai pusim i go daon long graun.'),
   ('xe', '???')]),
 ('kapokapoa',
  [('rt', 'kapokapo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'cover.strips'),
   ('tkp', 'diwai i pasim banis bilong haus'),
   ('eng', 'cover strips of wall or house'),
   ('dt', '30/Oct/2005')]),
 ('kapokapora',
  [('rt', 'kapora'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', "carry between two people's shoulders"),
   ('tkp', 'karim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Ararai kapokaporosivoi Visa vaio ora Apoka.'),
   ('xp', 'Visa wantaim Apoka, tupela i karim Arari i go.'),
   ('xe', 'Visa and Apoka are carrying Arari between their shoulders.'),
   ('ex',
    'Vuri vakato oirato kopoka pora tvaiterei vi kapokapora pasi osa vaokavi. Ragai kopokapora si.'),
   ('xp', 'Yut bai mi woik bout namel long tuopela bikos lags blong em ino.'),
   ('xe', '???'),
   ('ex',
    'Siropiri kapokaporopaoro avase ausiki iare vutuouarei iava uvare koveroe.'),
   ('xp',
    'Tupela man i karim Siropiri long tupela soldia i go long haus bikos em i pundaun.'),
   ('xe', '???'),
   ('ex', 'Rera kapokaporoereva voa raivaaro.'),
   ('xp', '???'),
   ('xe',
    'The two (girls) carried him (supporting him) between their shoulders.'),
   ('ex', 'Rera kapokaporooro avasi ari viapau uvuipa vokasia.'),
   ('xp', 'Yutupela karim em i go, emi ni inao wokabaut.'),
   ('xe', "You two carry him between your shoulders, because he can't walk."),
   ('ex',
    'Ata vaio ora Vavaviri aiterea Perapera kapokaporopaoro avasie eisi-re ruvarupa tapi. Vaiterei vutuoaro-ia vokaparevo uvare vurivira upiaparoe rutu.'),
   ('xp',
    'Ata wantaim Vaviri tupela i go wantaim Perapera long haus sik. Em i wokabaut namel long solda bilong tupela bikos em i sik nogut tru.'),
   ('xe',
    'Ata and Vavaviri, the two of them went carrying Perapera on their shoulders to the medical station. He walked (leaning) on their shoulders because he was really sick.')]),
 ('kapokaporo',
  [('rt', 'kaporo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'grip'),
   ('eng', 'grip'),
   ('eng', 'hold'),
   ('tkp', 'holim'),
   ('dt', '12/Jul/2006')]),
 ('kapokari',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'type of vine'),
   ('tkp', '???'),
   ('nt', 'red, used for weaving'),
   ('dt', '14/Sep/2004'),
   ('ex', 'Kapokari ova oa iava oaravu ruvaruara purapaiveira.'),
   ('xp', 'Long diwai kapokari ol i save wokim ol marasin.'),
   ('xe', '???')]),
 ('kapokarito',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'tree roots'),
   ('tkp', '???'),
   ('nt', 'used for make red/purple-colored stain'),
   ('dt', '17/Jul/2006'),
   ('ex', 'Kapokarito vavurupato ora vu toupai.'),
   ('xp', 'Kapokarito roots igat kovoara sampela kain work long ai.'),
   ('xe', '???'),
   ('ex', 'Kapokarito ira epao vavo Rarova, ira iava vavurupaara ouavorao.'),
   ('xp', 'Diwai kapokarito i stap long Rarova, mi bin kisim roots long em.'),
   ('xe', 'The tree that is in Rarova, I got roots from it.')]),
 ('kapokoa',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'kapok tree'),
   ('tkp', 'kapok'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kopokoa evaova evaoa kuepapuva vutara-ia.'),
   ('xp',
    'Kapok em I wampela diurai em I save karnu flau ou fruit lovy seasome blong em.'),
   ('xe', '???'),
   ('ex', 'Kapokoo aue purapaive vaio orave uusipasia.'),
   ('xp', 'Ol i save usim bilong mekim pilo.'),
   ('xe', 'They make use of kapok for pillows to sleep on.'),
   ('ex', 'Kapoko pupupuro iava oraveara purapaiveira voriako.'),
   ('xp', 'Ol meri i save wokim pilo long wul bilong kapok.'),
   ('xe', 'From kapok cotton the women make pillows.'),
   ('ex',
    'Kapokoa evaova oa kuepapeira rava iava kueisiro oupaive oravepara purapasia.'),
   ('xp',
    'Kapok em wanpela diwai i save karim ol prut. Ol dispela prut ol save kisim bilong mekim pilo long dispela samting.'),
   ('xe', '???')]),
 ('kapoo',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'poor'),
   ('ge', 'destitute'),
   ('tkp', 'lus long moni'),
   ('sa', 'apota'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kapooparoi aue-pa moni.'),
   ('xp', '???'),
   ('xe', 'He is poor, lacking money.'),
   ('ex',
    'Ragaivi kapooparai uvare viapau ragai kakau kovovai raoa-ia vori oupara.'),
   ('xp',
    'Mi tarangu mi lus long moni bikos mi nogat gaden kakau bilong mi bai mi inap kisim mani long em.'),
   ('xe',
    "Little me is poor because I don't earn money from cocoa gardens.")]),
 ('kapooto',
  [('rt', 'kapoo'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'poor person'),
   ('tkp', 'man i no gat moni'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kapooto vi aue-pa moni apotato auepa moni.'),
   ('xp',
    'Poor man long maney wanpela word abotato kapogt - lusia?? Man long samting.'),
   ('xe', '???'),
   ('ex',
    'Sevei oaravu rutu-pa kapooto uvare vegoaro toupareveira. Viapau moni oupa kovorovai oa iava kapootoa rutu arera.'),
   ('xp',
    'Sevei em i lus man tru long olgeta samting bikos em i save stap long bus. Em i nogat ol wok bilong kisim mani olsem na em i stap lusman.'),
   ('xe',
    'Sevei is poor with respect to everything because he is usually in the jungle. ???.')]),
 ('kapoovira',
  [('alt', 'kapoopavira'),
   ('rt', 'kapoo'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'poor'),
   ('ge', 'without much'),
   ('tkp', 'olsem i no gat tru'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kapoovira rutu toupaavoi vovokio-ia.'),
   ('xp', 'Mi i stap lus tru long ol dispela taim.'),
   ('xe', 'I was very poor during this time.'),
   ('ex', 'Kapovira toupari kapovira touba voi ravi vatea aue mon.'),
   ('xp', 'Mi lusiman minogat samting bai mi gavim long yu.'),
   ('xe', '???'),
   ('ex', 'Kapovira toupaavoi vo vuutao-ia aue-pa moni.'),
   ('xp', 'Mi lus tru long mani dispela taim.'),
   ('xe', 'I was poor during this time as far as money is concerned.'),
   ('ex',
    'Ragaivi kapoovira rutu toupaavoi uvare kavirupairara moniara rutu kaviruivorao oa iava kapoovira toupaavoi.'),
   ('xp',
    'Mi trangu nogat moni tru bikos ol stilman ol i stilim olgeta moni bilong mi olsem na mi stap poaman tru.'),
   ('xe',
    "Little me is poor because thieves stole lots of money and that's why I am poor.")]),
 ('kapopaa',
  [('rt', 'kapo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'wrench'),
   ('ge', 'spanner'),
   ('tkp', 'spana'),
   ('dt', '14/Sep/2004'),
   ('ex', 'Kapopa aue-pa varo morn sigo alo eakepa.'),
   ('xp', 'Em sot long laplap maney knife kaikai angthing.'),
   ('xe', '???'),
   ('ex', 'Kapopaa-va uriou.'),
   ('xp', 'Yu kam wantaim sipana.'),
   ('xe', 'You come with the spanner.')]),
 ('kaporo',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'passage'),
   ('tkp', 'pasis apes'),
   ('eng', 'space between objects'),
   ('eng', 'mountain passage'),
   ('eng', 'passage'),
   ('dt', '29/Jan/2005'),
   ('ex', 'Kaporoa vao oa toupai vopa Togaraoia ora vo Kikisova.'),
   ('xp', 'I gat baret namel long Togarao na Kikisova.'),
   ('xe', '???')]),
 ('kaporo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'grip'),
   ('eng', 'grip'),
   ('eng', 'hold'),
   ('tkp', 'holim'),
   ('dt', '12/Jul/2006')]),
 ('kaporopa',
  [('rt', 'kaporo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'clamp'),
   ('tkp', 'samting bilong holim ol samting'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Ragai aveke kaporopa puraavoi oa-ia avekeara kaporopaavere vosia upiriko vagi purasia auepara.'),
   ('xp',
    'Mi wokim samting bilong olim ol ston taim mi laik wokim mumu kaukau,'),
   ('xe',
    "I'm making a stone clamp with which I will turn the stones when I am preparing sweet potato ???.")]),
 ('kaporoto',
  [('rt', 'kaporo'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'scissors'),
   ('tkp', '???'),
   ('dt', '14/Sep/2004'),
   ('ex', 'Kaporotoava uriou ra varoa tokoa.'),
   ('xp', 'Yu kam wantaim sisisi, bai mi katim laplap.'),
   ('xe', '???')]),
 ('kapoto',
  [('rt', 'kapo'),
   ('ps', '???'),
   ('ge', 'poor man'),
   ('tkp', 'rabisman'),
   ('dt', '17/Sep/2004'),
   ('ex', 'Kapoto vi auepa orekerovu mani auepa sigo.'),
   ('xp', 'Lusiman long ol samting mani na long knife.'),
   ('xe', '???'),
   ('ex', 'Oraeviri kapoto oreke rovu rutupa.'),
   ('xp', 'Oraeviri em i lusman tru long ol narapela samting.'),
   ('xe', '???')]),
 ('kapu',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'cup'),
   ('eng', 'cup'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '29/Jan/2005'),
   ('ex', 'Gisipo kapupierevoi kopiioro.'),
   ('xp', 'Em i pasim maus na dai.'),
   ('xe', 'He closed his mouth while dying.')]),
 ('kapua',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'sore'),
   ('tkp', 'sua'),
   ('eng', 'sore'),
   ('eng', 'infection'),
   ('eng', 'wound'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kapua eva vii kokotoaro-ia toupai.'),
   ('xp', 'Soa i stap long lek bilong yu.'),
   ('xe', 'That sore is on your leg.'),
   ('ex',
    'Kapua kakaetoa-ia toupai uvare aakova viapau rera sisiupaeveira oa iava eva kapua rera kokotoaro vuripievo.'),
   ('xp',
    'Soa i stap long pikinini bikos mama i no save wasim em olsem na soa bagarapim lek bilong em.'),
   ('xe',
    "The sore was on the child because his mother didn't wash him and therefore that sore damaged his leg.")]),
 ('kapua',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'have sores'),
   ('tkp', '???'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('dt', '01/Apr/2006'),
   ('ex', 'Riakova kapuapaoi uvare vatuato oira uporevoi vurivira rutu.'),
   ('xp', 'Meri bai gat sua bikos man bilong em i paitim meri nogut tru.'),
   ('xe', '???'),
   ('ex', 'Gisipo ragai iava kapuaoi uvare tavute isi aioavoi kopupa isi.'),
   ('xp', 'Maus bilolng mi i sua bikos mi kaikaim mango i no red.'),
   ('xe', 'My mouth is sore because I ate a red mango.')]),
 ('kapuapato',
  [('rt', 'kapua'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'fruit damaged by insect or disease with mark (sore) on it'),
   ('tkp', '???'),
   ('dt', '14/Sep/2004'),
   ('ex',
    'Kapuapoto oveuto tavuteto kapuapato vi vigoa vuriue rutu. Kapuapato vigoa gesipau.'),
   ('xp',
    'Kapka em I gat sore mago em I sat sore yu gat sore yu smell long ol sore istap long yu dog igat sore sore man.'),
   ('xe', '???'),
   ('ex', 'Kapuapato era oveuto.'),
   ('xp', 'Kapiak i gat soa.'),
   ('xe', 'The breadfruit is damaged.')]),
 ('kapuapie',
  [('rt', 'kapua'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'wound'),
   ('tkp', 'mekim sua'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Rera kapuapierivora.'),
   ('xp', '???'),
   ('xe', 'You wounded him so that he had sores on his body.'),
   ('ex', 'Auro, era kapuapieri rera ragipaoro.'),
   ('xp', 'Hei, yu stikim em na em i gat soa.'),
   ('xe', 'Hey, you gave him sores by beating him.'),
   ('ex',
    'Oirato kapuapieiva rera upooro uvare kepaa tokooro koataroe torara kavirusia.'),
   ('xp',
    'Man ol i mekim sua taim ol i paitim em bikos em i brukim haus na em i go insait na stilim tamiok.'),
   ('xe',
    'They injured the man by hitting him because he went and broke into a house to steal an axe.')]),
 ('kapuasisi',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'type of vine'),
   ('tkp', '???'),
   ('nt', 'covers everything'),
   ('dt', '10/Feb/2005'),
   ('ex', 'Kapuasisi oa oiso toupai osa aue iro rirovira toupai vo rotokasi.'),
   ('xp', 'Kapuasisi em wampela rop am planin long hio long rotokas.'),
   ('xe', '???'),
   ('ex', 'Kapuasisi kouro kakauara erakopiepaiveira vara-ia iipapaoro.'),
   ('xp',
    'Kapuasisi em i save go antap long ol kakao na ol kakao i save drai.'),
   ('xe', 'The kauasisi vine dries out the cocoa by climbing on top of it.')]),
 ('kapupie',
  [('rt', 'kapu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'close tight'),
   ('tkp', 'taitim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Jun/2005'),
   ('cmt',
    "Syntax is weird and Rotokas doesn't seem to match Tok Pisin. What's up? Where's the tense on the second example?"),
   ('ex', 'Ivita kapupieeva goruvira rakapa tou iava.'),
   ('xp', 'Yu pasim strong lit bilong dram.'),
   ('xe', '???'),
   ('ex', 'Tupa kapupiea goruviva rutu ra viapau irai va karureve.'),
   ('xp', 'Mi pasim stong door mom wanpela mav ino nap opim.'),
   ('xe', 'I close the door very strongly and nobody can open it.')]),
 ('kapupiea',
  [('rt', 'kapu'),
   ('ps', '???'),
   ('ge', 'lock'),
   ('tkp', 'slos'),
   ('cmt',
    'Check vowel length; second sentence seems to be missing a verb (special construction?)'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Taupirie, uriou kapupiea kekesia kepa-ia oa puraivo vegei-re teapi voa uusive.'),
   ('xp',
    'Taupirie, yu kam na lukim dua i pas ol i mekim long mitupela olsem bai yumi no inap insait na slip.'),
   ('xe', '???'),
   ('ex',
    'Ragai raiva-ia vokapaavo uva kepa-ia kapupiea kekeava oa iava vorevira vorerae uvare viapau uvuiparae rakoatara.'),
   ('xp',
    'Mi wokabaut i go long rot na mi lukim haus i pas olsem na mi go bek bikos mi no inap go insait.'),
   ('xe',
    "I walked along the road and I saw the locked house and therefore I returned because I couldn't the door.")]),
 ('kapupiepaa',
  [('rt', 'kapupie'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'pincers'),
   ('tkp', 'pinsis'),
   ('cmt', 'Check vowel length'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Vii kapupiepa vi toro vi gisipoara.'),
   ('xp', 'Bai mi lockim masi blong yu.'),
   ('xe', '???'),
   ('ex',
    'Vao kapuepa va-ia goruviara pituoro. Vokeruo kapupiepa parais keru.'),
   ('xp', 'Pinccers em blong holim samting.'),
   ('xe', '???'),
   ('ex',
    'Kapupiepa puraivo Teiavi tavaripieoro uvare Sipirievi kepiivo kikipaoro.'),
   ('xp',
    'Olo putim Teiavi long kisim ples bilong Sipirievi bikos ol i brukim lek bilong em tain ol i pila soka.'),
   ('xe', '???')]),
 ('kapuu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'dumb'),
   ('ge', 'not speaking'),
   ('tkp', '???'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('dt', '16/Nov/2005'),
   ('ex', 'Roku kapurevo, viapau uvuipa reoreosia.'),
   ('xp', 'Roki em i pasim maus pinis, em i no inap toktok.'),
   ('xe', "Roki is dumb, he can't speak."),
   ('ex', 'Kapuuevoi.'),
   ('xp', '???'),
   ('xe', 'She (=the mouth) is dumb.'),
   ('ex',
    'Gisipo kapuuvira raga toupaevo Sera iava uvare varoa kaviruevo uva kapuuvira raga toupaevo. Viapau uvuipaoe ra reoreopao uvare aripaoe evo pitupitu-ia oa puraevo.'),
   ('xp',
    'Maus bilong Sera i stap pas tasol bikos em i stilim laplap na em i stap maus pas tasol bikos em i sem long pasin em i mekim.'),
   ('xe',
    "Sera's mouth was shut because she stole clothes and she kept her mouth shut. She couldn't talk because she was ashamed of the deeds she had done.")]),
 ('kapuupie',
  [('rt', 'kapuu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'close'),
   ('tkp', '???'),
   ('eng', 'close (mouth or jaws)'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '16/Nov/2005'),
   ('ex', 'Spana kapuupieri.'),
   ('xp', '???'),
   ('xe', 'Close up the jaws of the wrench.'),
   ('ex', 'Ragai tupa kapuupiea.'),
   ('xp', 'Mi lokim doa.'),
   ('xe', "I'm closing the door."),
   ('ex', 'Orakapuupieroe kopiioro.'),
   ('xp', 'Taim man i dai em i pasim maus bilong em.'),
   ('xe', '???')]),
 ('kapuupiepa',
  [('rt', 'kapuu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'pincers'),
   ('tkp', '???'),
   ('dt', '15/Feb/2004')]),
 ('Kara',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('eng', 'name'),
   ('ig', 'yes'),
   ('dt', '26/Jan/2005')]),
 ('kara',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'leaf'),
   ('tkp', 'lip bilong wokim mat'),
   ('nt', 'used for mat weaving'),
   ('dt', '10/Feb/2005'),
   ('ex', 'Kara tria guruvaroia, mat uspara turupai.'),
   ('xp', 'Kara karuaka usin long mekim sleep mat.'),
   ('xe', '???'),
   ('ex', 'Kara kouro oara kaarukaa purapaveira riakora.'),
   ('xp', 'Long lip bilong karaa ol meri i save wokim mat.'),
   ('xe', 'The kara leaf, women make leaves from it.')]),
 ('karaava',
  [('rt', 'karaa'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'large Chinese bamboo'),
   ('tkp', '???'),
   ('nt', 'cultivated'),
   ('cmt', 'Is this related to kara?'),
   ('dt', '29/Jan/2005'),
   ('ex',
    'Karaava oiso toupaiveiro osa aue veta (siaira vetata) votoupai siaira veta.'),
   ('xp', 'Karaava bamboo em i blong chinase.'),
   ('xe', '???'),
   ('ex', 'Auea vao etepeua oua vaisipaiveira toisikopairara oisio karaava.'),
   ('xp', 'Ol Toisiko ol i save kolim het bilong suka ken olsem karaava.'),
   ('xe', '???')]),
 ('karakarao',
  [('alt', 'karakaro'),
   ('rt', 'karao'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'partial'),
   ('ge', 'take without permission'),
   ('tkp', '???'),
   ('cmt', 'Double check that karakaro is in fact a possible alt'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Karakaraovira rutu ro itova ourevo ake asiavira.'),
   ('xp', 'Man ia kisim banana na em i no askim pastaim.'),
   ('xe',
    'Without asking persmission at all he took the banana without asking.'),
   ('ex', 'Karakaraoiraouei oirato kukuakue atoro.'),
   ('xp', 'Man yu no askim pastaim na yu daunim garip bilong mi.'),
   ('xe', '???'),
   ('ex',
    'Vavaviri karakaraoroe tuupato ouoro ata reraro viapau vearo vaisi va iava rera ourevoi.'),
   ('xp',
    'Vavaviri em i kisim nating traus bilong Ata nogat tok orait na em i kisim.'),
   ('xe', '???')]),
 ('karakaraoa',
  [('rt', 'karakarao'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'trough for carrying liquids'),
   ('tkp', 'diwai bilong karim wara'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Karakaraoa-ia uukoa uriopieri.'),
   ('xp', 'Yu putim wara i kam long hos paip.'),
   ('xe', 'Bring the water for the trough.'),
   ('ex',
    'Ragai uukoa karakaraoa tetevu kuriare uva tetevu gagaukoa-ia uriopae ukoa tetevu kuri iare oa iava aioa orapurae.'),
   ('xp',
    'Mi kisim wara long pangal bilong saksak olsem na wara i kam long meme bilong saksak, olsem na kaikai bilong saksak i kamap.'),
   ('xe', '???')]),
 ('karakaraoto',
  [('rt', 'karakarao'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'boldly possessive person'),
   ('tkp', '???'),
   ('avm', 'pa_barred'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Karakaraoto akuku oaravu outo oerovu vararo oirara.'),
   ('xp', 'Man bilong kisim nating samting bilong narapela man.'),
   ('xe', '???')]),
 ('karakaraovira',
  [('alt', 'karakaraopavira'),
   ('rt', 'karakarao'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'take without permission'),
   ('tkp', '???'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Karakaraovira rutu ro itova ourevo ake asiavira.'),
   ('xp', 'Man ia kisim banana na em i no askim pastaim.'),
   ('xe',
    'Without asking persmission at all he took the banana without asking.')]),
 ('karakaroto',
  [('rt', 'karakaro'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'boldly possessive man'),
   ('tkp', 'man bilong kisim man'),
   ('cmt',
    'Is this related to karakarao? Can it be men or women? Is the POS N.HUM or simply N.MASC?'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Roia oirato karakaraotoa rutu akukupapa pitutoa rutu.'),
   ('xp', 'Man ia em i man bilong kisim nating samting bilong narapela man.'),
   ('xe', 'This possessive man is a grabber of ???.'),
   ('ex', 'Apeisi Kare karakaroragaroe asi kue pakosia.'),
   ('xp', 'Kare em i mekim pasin nogut long daunim nating rop buai.'),
   ('xe', '???')]),
 ('karakuku',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'coconut sheath'),
   ('tkp', 'gai imbong'),
   ('dt', '10/Feb/2005'),
   ('ex',
    'Karakuku vo uva opita isoro toupaivura ira ia kotoviva toupaivei aue opita kororo.'),
   ('xp', 'Sheath coconuts i save agamp long em.'),
   ('xe', '???'),
   ('ex', 'Karakuku ira-ia opita isiro toupaiveira.'),
   ('xp', 'Long sut ol kokonas i save hangamap.'),
   ('xe', 'Coconut shells hang on their sheathes.')]),
 ('karaova',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'betel nut'),
   ('tkp', 'kambibi'),
   ('tkp', 'wail buai'),
   ('eng', 'Highland Betel Nut Palm (Areca macrocalyx)'),
   ('cmt', 'But the form <karaoto> also seems to exist...'),
   ('dt', '15/Apr/2006'),
   ('ex', 'Karaova iria vo toupaeveira tuvuara-ia.'),
   ('xp', 'Kavivi i save i stap long tais wara.'),
   ('xe', 'Highland Betel Nut Palms are found in the swamp.'),
   ('ex',
    'Karaova rarepavavi asiva iria vegoaro toupaeveira iria aiopaiveira iso kare vegoarapairara.'),
   ('xp',
    'Wail buai i liklik kain buai i save stap long bus. Em ol tambaran bilong bus i save kaikai.'),
   ('xe', '???')]),
 ('karapi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'sing high pitched'),
   ('tkp', 'sing antap'),
   ('sc', 'EMISSION.SOUND'),
   ('vx', '1'),
   ('cmt', "Can you specify what you're singing (cognate object)?"),
   ('dt', '30/Oct/2005'),
   ('ex', 'Irouto-a karapiroi.'),
   ('xp', '???'),
   ('xe', 'Who is the man singing with the high pitched voice?'),
   ('ex',
    'Kikiviori pupiva-ia karapivira kovaroe uva viapau oirara uvuipae ra oisi karapivira kovaave.'),
   ('xp',
    'Kikiviori em i singsing kaul antap tru olsem na ol man i no inap singsing antap olsem.'),
   ('xe', 'Kikiviori ???.')]),
 ('karapivira',
  [('alt', 'karapipavira'),
   ('rt', 'karapi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'high-pitched'),
   ('eng', 'high-pitched'),
   ('eng', 'soprano'),
   ('tkp', 'antap'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Karapivira erata.'),
   ('xp', 'Yupela sing antap.'),
   ('xe', 'You guys sing in a high pitch.'),
   ('ex', 'Karapivira erapai riakora.'),
   ('xp', 'Ol meri ol i singsing antap.'),
   ('xe', 'Women sing in a high pitch.'),
   ('ex', 'Riakora karapivira erapaveira eraara rutu-ia vovokiro rutu-ia.'),
   ('xp',
    'Ol meri ol i save singsing olsem antap long olgeta singsing olgeta dei.'),
   ('xe', 'The women sang soprano ???.')]),
 ('karara',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'crab'),
   ('tkp', 'kuka'),
   ('nt', 'lives in the beach sand'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Karara vo toupaiveira avaka rogaraia.'),
   ('xp', 'Dispela kuka em i stap long solwara.'),
   ('xe', 'The beach crab lives in saltwater.'),
   ('ex', 'Kararakare vo toupaiveira rogarauaia.'),
   ('xp', 'Bis kuka ol i save i stap long wet sen.'),
   ('xe', 'Beach crabs are found in wet sand.'),
   ('ex',
    'Karara garevavi okoe iria rogaraua-ia toupaeveira vo avaka vatuaro-ia.'),
   ('xp',
    'Karara em i liklik kuka i save stap long waisand arere bilong solwara.'),
   ('xe', '???')]),
 ('karata',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'deal out'),
   ('ge', 'divide up'),
   ('ge', 'apportion'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('nt',
    'When much is to be divided the reduplicated form is used, karakarata.'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Aioara karatareva.'),
   ('xp', 'Em i dilim ol kaikai.'),
   ('xe', 'He divided up the food.'),
   ('ex', 'Sera kukara guru karataevoi.'),
   ('xp', 'Sera i dilim ol kon.'),
   ('xe', 'Sera is dealing out the corn.')]),
 ('karato',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'small stream of water'),
   ('tkp', 'wara ol i pulim long wanpela samting'),
   ('dt', '30/Oct/2005'),
   ('cmt', 'Double-check translation of example'),
   ('ex', 'Vopae karepieivora karatoa-ia.'),
   ('xp', 'Ol i bin pulim wara Vopae long paip.'),
   ('xe', 'They diverted Vopae to a small stream of water.'),
   ('ex',
    'Karato vosia oavu-ia uukovi ivuri oisio osia aue vetapariero uvavu rutu-re. Oisio osia rariketo ora vopae aitere-ia karatoarei puraivora vaiterei karaoro atoia-re vegoarova.'),
   ('xp',
    'Karato i olsem sapos yu pulim wara long wanpela samting i go long wei olsem Rariketo wantaim Vopae ol i bin pulim dispela tupela wara i kam long bus na kam long ples.'),
   ('xe', '???')]),
 ('karavau',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'mushroom'),
   ('tkp', 'papai'),
   ('eng', 'small type of mushroom'),
   ('dt', '29/Jan/2005'),
   ('ex', 'Karavauara vo purapapiroveira tetevu tatagaro-ia.'),
   ('xp', 'Masrum i save kamap long sting saksak.'),
   ('xe', 'Mushrooms appear on the ??? of sago.')]),
 ('karavisi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'angry'),
   ('ge', 'upset'),
   ('tkp', '???'),
   ('vx', '1'),
   ('sc', 'EMOTION'),
   ('dt', '11/Feb/2007'),
   ('cmt', 'Check out this vstem + ovoi construction.'),
   ('ex', 'Ikauvira raga karavisi ovoirovere.'),
   ('xp', '???'),
   ('xe', 'Quickly he becomes very upset.'),
   ('ex', 'Saimon ira karavisi ovoiparoveira.'),
   ('xp', 'Saimon man bilong kros kwik.'),
   ('xe', 'Saimon is a man who quickly gets angry.'),
   ('ex',
    'Karavisi ovoiparoveira Pita. Viapau uvuipa ra reoara roviriei rovopareve. Ari kasipua ikau ovoipaiveira rera-ia.'),
   ('xp',
    'Wan tu tasol Pita i save koros o wari em i no save skelim pastaim ol toktok. Tasol koro i save kam hariap long em.'),
   ('xe', '??? but anger quickly gets to him.')]),
 ('karavisito',
  [('rt', 'karavisi'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'person who is easily angered or upset'),
   ('tkp', '???'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Riro karavisito ro oirato garepara-IA reoreoara kasipupaoro.'),
   ('xp', '???'),
   ('xe', 'Easily at just a little talk that man gets angry.'),
   ('ex', 'Devid riro ikaupa karavisito garepa reo rovia.'),
   ('xp', 'Devid man bilong kros hariap long liklik toktok.'),
   ('xe', 'David is a very hot-tempered man ???.')]),
 ('karavuru',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'get dusty'),
   ('tkp', 'kamap das'),
   ('vx', '1'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Karavuruiraouei poupou-ia.'),
   ('xp', 'Yu bagarap tru long das.'),
   ('xe', "You're really dusty."),
   ('ex', 'Kakaeto karavururoe uvare poupou-ia uusiparoe visikopaoro.'),
   ('xp', 'Pikinini em i das bikos em i slip long das taim em i pilai.'),
   ('xe', 'The child is dusty because he slept in dust while playing.')]),
 ('kare',
  [('ps', 'FFP'),
   ('ge', 'animals'),
   ('tkp', 'plenti'),
   ('dt', '17/Oct/2005'),
   ('ex', 'O karevu koie kare kouevo ita akova.'),
   ('xp', 'Pik mama i karim ol narapela pik.'),
   ('xe', 'The mother carried the other pigs.')]),
 ('kare',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'return'),
   ('tkp', 'bekim'),
   ('tkp', 'kam bek'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('sc', 'MOTION'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kareparai atoi iare.'),
   ('xp', 'Mi go bak gen long ples.'),
   ('xe', 'I am going back to the village.'),
   ('ex', 'Petero kareroe eisi atoia.'),
   ('xp', 'Petero i go pinis long ples.'),
   ('xe', 'Petero went to the village.'),
   ('ex', 'Ovu iava kareuei, auoro?'),
   ('xp', 'Yu kam long we?'),
   ('xe', 'Where are you coming from, man?'),
   ('ex', 'Rarasori kare kepa iare uvare ogoeroi uva kare kepa iare aiosia.'),
   ('xp',
    'Rarason em i go long haus bikos em i hangere na em i go long haus bai em i kaikai.'),
   ('xe',
    'Robinson returned to the house because he was hungry and he returned to eat.')]),
 ('karekare',
  [('rt', 'kare'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('sn', '1'),
   ('rdp', 'full'),
   ('ge', 'itch'),
   ('tkp', 'skrap'),
   ('vx', '1'),
   ('sc', '???'),
   ('cmt', "What's the deal with 'voka voiva'? What about kaskas?"),
   ('dt', '30/Oct/2005'),
   ('ex', 'Ae, karekarerai kokeva-ia voka voiva.'),
   ('xp', 'Mi skirap bikos mi raun long ren.'),
   ('xe', 'Hey, I itch from the rain ???.'),
   ('ex',
    'Oirato kaskas iava karekareparoe uvare rera vavaearo aioevo kaskas.'),
   ('xp',
    'Man em i kirap long kaskas bikos kaskas em i kaikaim han bilong em.'),
   ('xe', 'The man is itchy from ??? because he ate the hand of the ???')]),
 ('karekare',
  [('rt', 'kare'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('sn', '2'),
   ('rdp', 'full'),
   ('ge', 'return'),
   ('tkp', 'go bek'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Oirara karekare atoia-re uvare pupipa iare avaera Asitaipa-ia.'),
   ('xp',
    'Ol man i kam bek long ples bikos ol i bin go long singsing kaul long Asitaipa.'),
   ('xe',
    'The men return to the village because they went to sing in Asitaipa.'),
   ('ex', 'Oirara rutu karekareae eisi atoia.'),
   ('xp', 'Olgeta  man i bin go long ol ples.'),
   ('xe', 'Everyone went back to their villages.')]),
 ('karekarererava',
  [('rt', 'karekarearava'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'slit gong signal of patrol officer'),
   ('tkp', 'garamut krai bilong klap'),
   ('dt', '15/Apr/2006'),
   ('ex', 'Karekarearava togaivoi toiva eisi Ibu, ari iravu kopiiroi.'),
   ('xp', 'Ol i paitim garamut long Ibu, bikos wanpela man i dai.'),
   ('xe', 'They beat drums in Ibu because somebody died.')]),
 ('karekareto',
  [('rt', 'karekare'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'person with grille or ringworm'),
   ('tkp', 'man i gat girire'),
   ('dx', 'Pipipaia'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Ririuvea karekareto.'),
   ('xp', 'Ririuve em i girireman.'),
   ('xe', 'Ririuvea is a man with grille/ringworm.'),
   ('ex',
    'Riakora airoa oisioa reasipave oirato apa uvare karekaretoa oisio teapi iriai-re vuri gesiparo oa iava oisoa reasipave riakora rera-pa.'),
   ('xp',
    'Ol meri ol i save les long man bikos em grille man. Olsem nogut em i smel nogut long wanpela meri olsem an ol meri ol i save les long em.'),
   ('xe', '???')]),
 ('kareke',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'appear'),
   ('tkp', 'kamap'),
   ('eng', 'appear'),
   ('eng', 'happen'),
   ('eng', 'come to be'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '03/Jun/2005'),
   ('ex', 'Viapau oiso vearoavai karekepape kovoa oaio-ia.'),
   ('xp', '???'),
   ('xe', 'Nothing good would come from our (exclusive) work.'),
   ('ex', 'Vape oa oisiri avu karekepere rara.'),
   ('xp', 'Larim bai em i mas kamap yet.'),
   ('xe', '???')]),
 ('karekepie',
  [('rt', 'kareke'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'show'),
   ('tkp', 'soim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Opesioto ira kokopuoto karekepiepareveira.'),
   ('xp', 'Em binatang i save kamapim bata palae.'),
   ('xe', 'The butterfly becomes a butterfly.'),
   ('ex', 'Gavman vii-pa tauva karekepiereve.'),
   ('xp', '???'),
   ('xe', 'The government would come up with some help for you.'),
   ('ex', 'Tugoropato uraurato vigei varaaro vuriara karekepiepareveira.'),
   ('xp', 'Holi spirit i save kamapim ol pasin nogut bilong yumi.'),
   ('xe', 'The holy spirit shows our sins.'),
   ('ex',
    'Rarasori rera vo kovoaro karekepiesia kare rovere rera rirotoaro ruvaraia.'),
   ('xp',
    'Rarason bai em i go bek na soim wok bilong em long bikman bilong em.'),
   ('xe', 'Robinson will return to show his work to the bigman.')]),
 ('karekova',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'vine'),
   ('tkp', 'rop i gat wara'),
   ('nt', 'water can be extracted from it'),
   ('dt', '29/Jan/2005'),
   ('ex',
    'Karekova iriaia varia tupaveira auere kora. Ora uvuipau ra oira tokori ra ukaiou.'),
   ('xp',
    'Rop bilong bus ol i save wokim trap bilong kisim kapul, tu sapos yu raun long bus yu ken katim na dring long em.'),
   ('xe', '???')]),
 ('kareo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'penetrate'),
   ('ge', 'pierce through'),
   ('tkp', 'sutim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Riopeiri kakau kareore oira iiaoro.'),
   ('xp', 'Riopeiri i sutim dok na spia i kamap long narapela sait.'),
   ('xe', 'Riopeiri is shooting the dog spearing it.'),
   ('ex', 'Oirato kaakau kareorevora rera iiaoro koetava-ia.'),
   ('xp', '???'),
   ('xe', 'The man shooting the dog with a bow and arrow pierced it through.'),
   ('ex', 'Riopeiri kaakau kareorevo sigoa-ia uva kopioe.'),
   ('xp', 'Riopeiri em i sutim dok long naip i dai.'),
   ('xe', 'Riopeiri stabbed the dog with a knife and it died.')]),
 ('kareovira',
  [('alt', 'kareopavira'),
   ('rt', 'kareo'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'clear through-like'),
   ('tkp', '???'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kareovira vusire.'),
   ('xp', '???'),
   ('xe', 'He burst throught from one side to the other.'),
   ('ex', 'Kareovira toupaevoi rauruva.'),
   ('xp', 'Spia i stap long tupela sait wantaim.'),
   ('xe', '???'),
   ('ex',
    'Oirato kekeavo kapua-va oa revasipae uva kareovira va vurapaavo oisio sigoa-ia rera kareoivo.'),
   ('xp',
    'Man mi lukim wantaim sua em i wok long bulut. Na mi lukim olsem ol i sutim long naip.'),
   ('xe',
    'I saw the man with a sore that bled and I saw him stabbed as they stabbed him with a knife.')]),
 ('karepie',
  [('rt', 'kare'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'return'),
   ('tkp', 'bekim'),
   ('eng', 'return'),
   ('eng', 'pay back'),
   ('eng', 'send back'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Vii vaaro vukua oa vii iare karepieavere.'),
   ('xp', '???'),
   ('xe', 'I will return your book to you.'),
   ('ex', 'Oire vii varo vukua karepieavere.'),
   ('xp', 'Em i orait, bai mi bekim buk bilong yu.'),
   ('xe', "Okay, I'll give you your book back.")]),
 ('karepieto',
  [('rt', 'kare'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'entrance to village'),
   ('ge', 'village entrance'),
   ('tkp', 'maus rot'),
   ('dx', 'Central'),
   ('dx', 'Pipipaia'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Aia vavo situepasivere karepieto.'),
   ('xp', 'Hei, bai yutupela i was long maus rot.'),
   ('xe', '???'),
   ('ex',
    'Oiratoarei karepietoa-ia ora aivaropiesiei uva aioara-ia oratavariosiei.'),
   ('xp', 'Tupela man i bung long maus rot na tupela i senis long ol kaikai.'),
   ('xe',
    'The two men met at the entrance to the village and exchanged food.')]),
 ('Karepirie',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ig', 'yes'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('dt', '25/Jan/2005')]),
 ('kari',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'rip'),
   ('ge', 'tear'),
   ('tkp', 'bruk'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dcpv', 'true'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Ragai varoa karia.'),
   ('xp', 'Mi brukim laplap.'),
   ('xe', 'I ripped the clothes.'),
   ('ex', 'Sera, varoa kari eva ra kapua tukea va-ia teapi revasipe.'),
   ('xp', 'Sera, yu brukim laplap bai mi pasim sua nogut em i blut.'),
   ('xe',
    "Sera, rip the clothes and I'll cover my sore with them lest it bleed.")]),
 ('karia',
  [('rt', 'kari'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'hole'),
   ('ge', 'rip'),
   ('ge', 'tear'),
   ('tkp', 'hol'),
   ('tkp', 'bruk'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Karia eva varoa-ia.'),
   ('xp', 'Bruk i stap long laplap.'),
   ('xe', 'There is a rip in the clothes.'),
   ('ex',
    'Risevi Pairi tavievo oisio vii reraro-ia koros karia toupaivoi rera verari ra rera turua vii-pa.'),
   ('xp',
    'Risievi tokim Pairi olsem long kolos bilong yu i gat hul, yu raosim na bai mi somapim bilong yu.'),
   ('xe',
    "Risevi told Pairi, your clothes have holes, take them off and I'll sew them for you.")]),
 ('kariava',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'lizard'),
   ('tkp', 'palai'),
   ('nt', 'dark body with tan to yellow scales, no spines, skin used for ???'),
   ('cmt', 'What does kundu  mean in Tok Pisin?'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kariava rakariaro oupaiveira aue iare kundu touro vara tovopasia.'),
   ('xp', 'Ol i save kisim skin bilong palai na putim long kundu.'),
   ('xe', 'They get the skin of the ??? lizard in order to put it on drums.'),
   ('ex',
    'Kariava iria oisio kekevira toupaeveira osia oteote iria iava rakaria oupaiveira aue purapasia kundu touro aue-re vara-ia kovapasia.'),
   ('xp',
    'Palai i save luk olsem pukpuk. Skin bilong em ol save kisim na mekim ol kundu bilong singsing.'),
   ('xe',
    'The ??? lizard resembles a crocodile from which they get the skin to make drums for singing with.')]),
 ('karikari',
  [('rt', 'kari'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'tear'),
   ('ge', 'shred'),
   ('tkp', 'brubru kim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Pepa karikariri rera-pa va vatesia.'),
   ('xp', 'Yu brubrukim pepa na givim long em.'),
   ('xe', 'Shred the paper and give it to him.'),
   ('ex', 'Pepa karikari eva ra va vate Sera-pa.'),
   ('xp', 'Yu brukbrukim pepa na givim long Sera.'),
   ('xe', 'Rip the paper and give it to Sera.')]),
 ('karirapa',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'young banana leaf'),
   ('tkp', 'yangpela lip bilong banana'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Airepa guruva itotoa iava oaoupaiveira varatarata pasia tuitui kasiro-ia. Evaoa vaisipaiveira oisio karirapa ra vo guruva-ia aioara tovotovopaive vasiovar-ia ra voru tovoive aveke oriia ivaraia ra vo vagi rakuive.'),
   ('xp',
    'Yangpela lip banana ol i save kisim na boenim long paia. Na long dispela lip ol i save putiputim ol kaikai insait long em. Na dispela karamap bai ol i putim long ston i paia na bai oli karamapim dispela mumu.'),
   ('xe', '???')]),
 ('karisito',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'Christ'),
   ('tkp', 'krais'),
   ('ig', 'yes'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Karisito vigei varaaro-ia vuriara kopiroepa.'),
   ('xp', 'Krais i bin dai long ol sin bilong yumi.'),
   ('xe', 'Christ died for our sins.')]),
 ('karivai',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'have an appetite'),
   ('tkp', '???'),
   ('eng', 'have an appetite'),
   ('nt', 'also has a sexual dimension'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Karivaiparai.'),
   ('xp', '???'),
   ('xe', 'I have a desire for food.'),
   ('ex', 'Rirovira rutu karivaiparai.'),
   ('xp', 'Mi sikirap tru long koap.'),
   ('xe', 'I am very horny.'),
   ('ex',
    'Avukato rirovira karivaiparoe oisio ra varuavai aioreve oisio osia aue kokotu ora koie.'),
   ('xp', 'Lapun man em i skirap tru long kaikai abus olsem kakaruk o pik.'),
   ('xe',
    'The old man had a big appetite to eat meat such as chicken or pig.'),
   ('ex',
    'Oirato kaakau rirovira karivaiparoe uvare riakova kaakau geesirevo ovusia kekira va toupaevo.'),
   ('xp',
    'Dok man em i skirap tru taim em i smelim dok meri taim em i gat sik mun.'),
   ('xe',
    'The male dog he was really horny because he smelled the female dog while she was in heat.')]),
 ('karivaito',
  [('rt', 'karivai'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'man who always desires sex'),
   ('tkp', 'man bilong skirap'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Era vuri osireito karivaito.'),
   ('xp', 'Ai nogut ia man bilong sikirap long koap.'),
   ('xe', 'The horny man has a bad eye.'),
   ('ex',
    'Oirato riro karivaito ira toeivoi uvare riakora-va vuri pitupitu purapareveira riro karivaitoa rutu.'),
   ('xp',
    'Man bilong skirap ol i katim em long wanem em i save mekim pasin nogut wantaim ol meri. Man bilong skirap tru.'),
   ('xe',
    'The very horny man, they cut him because he was always up to no good with the women, the big sex fiend.')]),
 ('karivara',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'Buka'),
   ('tkp', 'Buka'),
   ('nt', 'name given to Buka area or Buka people by Rotokas'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Avapaiei eisi-re Karivara-ia.'),
   ('xp', 'Mipela go long Buka.'),
   ('xe', 'We went to Buka.'),
   ('ex',
    'Vaisia vao-ia Rotokasipairara oa vateiva Bukapairara-pa oisio karivara.'),
   ('xp', 'Nem ia ol Rotokas i bin givim long ol Buka olsem Karivara.'),
   ('xe', 'The name that the Rotokas gave to the Buka is Karivara.')]),
 ('karo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'spoon out a liquid'),
   ('tkp', 'sipunim'),
   ('dt', '30/Oct/2005'),
   ('arg', 'O'),
   ('vx', '2'),
   ('ex', 'Uukoavai karori rerapa.'),
   ('xp', '???'),
   ('xe', 'Spoon out some water for him.'),
   ('ex', 'Ukoa karo eva evoa rakapa tou.'),
   ('xp', 'Yu kisim wara long dram.'),
   ('xe', '???'),
   ('ex',
    'Ragai roviva karoavoi sipuru-ia tauo iare uvava vatapipavoi rara ogoera.'),
   ('xp',
    'Mi kapim sup long sipun i go long pleit long hap bai mi dringim bihain.'),
   ('xe',
    "I am spooning the ??? with a spoon to the plate from which I'll drink later ???.")]),
 ('karokaropo',
  [('rt', 'karopo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'deal out'),
   ('eng', 'deal out'),
   ('eng', 'distribute'),
   ('eng', 'send'),
   ('tkp', 'tilim'),
   ('ge', 'skelim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '30/Oct/2005'),
   ('rdp', 'partial'),
   ('ex', 'Rarasori vukuara karokaropoparevoi.'),
   ('xp', 'Rarason em i tilim ol buk.'),
   ('xe', 'Robinson is dealing out the books.'),
   ('ex',
    'Sipito oirara karokaroporevoi kovoara purasia uvare riroara kovoara oa iava vearovira voea karokaroporevoi kovoara rutu iare.'),
   ('xp',
    'Sip em i tilim ol man i go wokim ol wok bikos ol wok i planti olsem na em i tilim gut ol man i go long ol wok.'),
   ('xe',
    'The chief sent out the men to do work because there is a lot of work and therefore he send them all to the work.')]),
 ('karopato',
  [('rt', 'karo'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'spoon'),
   ('tkp', 'spun'),
   ('avm', 'pa_required'),
   ('sc', 'INSTRUMENT'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Sera Viki-re kerapoe, uriou karopatoa-va ra aio pitoka karoa ra va aiope.'),
   ('xp',
    'Sera em i singautim Viki, yu kam wantaim spun na bai mi spunim sospen kaikai bai yumi kaikai.'),
   ('xe',
    "Sera called out to Viki, come with a spoon and I'll spoon the sauce pan and we'll eat.")]),
 ('karopo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'portion out'),
   ('ge', 'divide up'),
   ('tkp', '???'),
   ('nt', 'Repeated action is marked by the reduplicated form karokaropo.'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Aioara karopori voea-pa.'),
   ('xp', '???'),
   ('xe', 'Divide up the food for them.'),
   ('ex', 'Vitera kotokotoara karopoevoi.'),
   ('xp', 'Vitera em i dilim aut ol kago.'),
   ('xe', 'Vitera dealt out the cargo.'),
   ('ex',
    'Vitera vaio ora Jackie varoara karopoerevoi ora vairei vuruvuruia roia oisio ra vairei rutu vaeavira varoara-va toupaere.'),
   ('xp',
    'Vitera wantaim Jackie tupela i tilim ol laplap namel long tupela yet. Olsem bai tupela wantaim stap wantaim ol laplap.'),
   ('xe', 'Vitera and Jackie ??? the clothes and ???.')]),
 ('karot',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('dx', 'Central'),
   ('ge', 'carrot'),
   ('tkp', 'carot'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '18/Nov/2006'),
   ('ex',
    'Karotara ora iava rakariara turuevira toupaiveira orakesikesikevira oara aiopaiveira opopotepairara ora oisio kekepapeira osia upirikoara.'),
   ('xp',
    'Ol karot skin bilong ol i ret o ielo ol waitman i save kaikaim. Na i save luk olsem kaukau.'),
   ('xe', '??? white people eat them and they resemble a sweet potato.')]),
 ('karoto',
  [('rt', 'karo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'rafters'),
   ('tkp', 'sparen'),
   ('dt', '15/Feb/2004')]),
 ('karova',
  [('rt', 'karo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'shelf'),
   ('ge', 'table'),
   ('tkp', 'tebol'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Tauoara tovotovori evoa karova.'),
   ('xp', 'Yu putim ol pelet long selv.'),
   ('xe', 'Put the plates there on the shelf.'),
   ('ex',
    'Ririuto karova purare erako tovopasia. Ra vo karova-ia erakoara posiposipaive kuparetoa-ia ra vara-ia tuitui kasiave.'),
   ('xp',
    'Ririuto mekim bet bilong putim paia wut. Na long dispela bet bai ol paia wut i drai long simuk.'),
   ('xe',
    'Ririuto is making a shelf to put firewood. And on this shelf firewood would dry from smoke and they make fire from them.')]),
 ('karu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'open'),
   ('ge', 'unlock'),
   ('ge', 'untie'),
   ('ge', 'unhook'),
   ('tkp', 'opim op'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cm', '???'),
   ('cmt',
    'I think that this verb takes -re case-marking. Also, it looks like the object follows the verb in the first example! Gotta look into this more.'),
   ('dt', '30/May/2005'),
   ('ex', 'Masta Kostro ira vao karu rovoreva Rotokas taere.'),
   ('xp', '???'),
   ('xe', 'Mr. Kostro opened up the Rotokas area initially.'),
   ('ex', 'Samson rataoa karu.'),
   ('xp', 'Samson, yu opim doa.'),
   ('xe', 'Samson, open the door.')]),
 ('karuka',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'mat'),
   ('tkp', 'mat'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Karuka iria iava mat purapaveira riakora.'),
   ('xp', 'Ol meri i save wokim mat long padanas.'),
   ('xe', 'From the wild pandanus women make mats.'),
   ('ex',
    'Ragai karuka tokosia avaparai ra urupavavai puraa ra iria-ia uusipara.'),
   ('xp',
    'Mi go katim wail pandanats. Bai mi mekim wanpela mat bai mi save slip long wanpela.'),
   ('xe', 'I am going to cut wild pandanus and make a mat and sleep on it.')]),
 ('karukaru',
  [('rt', 'karu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'open'),
   ('tkp', 'opim'),
   ('nt', 'describes action done repeatedly or to many thing'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Ora vurevureoro oira karukarureva.'),
   ('xp', '???'),
   ('xe', 'Moving himself about, he opened up (his bindings).'),
   ('ex', 'Pauro iroiro karukarurevoi uva koie avaoe.'),
   ('xp', 'Pauro em i lusim rop na pik i go.'),
   ('xe', 'Pauro loosened the rope and the pig got away.')]),
 ('karukava',
  [('rt', 'karuka'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'rat'),
   ('tkp', 'mumut'),
   ('eng', 'type of large bush rat'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Karukava riropava iria aiopaiveira oisio kekeva osia isike.'),
   ('xp', 'Rat bilong bus ol i save kaikai, em i wankain olsem rat.'),
   ('xe',
    'The ??? is a large one which they eat, since it looks like a rat.')]),
 ('Karuru',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('dx', 'Central'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('eng', 'name'),
   ('ig', 'yes'),
   ('dt', '15/Feb/2007')]),
 ('karutu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'divide up'),
   ('ge', 'portion out'),
   ('tkp', 'tilim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Moniara rutu karukaruturi.'),
   ('xp', '???'),
   ('xe', 'Divide up all of the money.'),
   ('ex', 'Mak, e monia karuturivo.'),
   ('xp', 'Mak, yu hapim moni.'),
   ('xe', 'Mark, divide up the money.'),
   ('ex',
    'Ravereto voriara karutere uva oaravu vatatoporevoi oarava ava pa eisi-re Buka-ia kotokoto vorisia ari oaravu kavuparevoi kakau orisia vo atoia.'),
   ('xp',
    'Ravereto em i tilim ol mani na em i redim narapela bai em i go baem kago long Buka na bai em i lusim narapela bilong baem kakau long ples.'),
   ('xe', '???')]),
 ('karuvira',
  [('alt', 'karupavira'),
   ('rt', 'karu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'open'),
   ('tkp', 'olsem op'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Rataoa karuvira toupai kepa-ia.'),
   ('xp', 'Doa i op i stap long haus.'),
   ('xe', 'The door on the house is open.')]),
 ('kasi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'build.fire'),
   ('eng', 'start a fire'),
   ('eng', 'make a fire'),
   ('tkp', 'boinim'),
   ('vx', '1'),
   ('sc', '???'),
   ('dcpv', 'true'),
   ('cmt', 'Is it kasi or eto kasi?'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Oire eto kasiaepa, oripaaepa.'),
   ('xp', '???'),
   ('xe', 'They built a fire and cooked.'),
   ('ex',
    'Aakova avaoe kovoa iare kepa tupaoro oa iava oira varaaro varoara kasipiro.'),
   ('xp',
    'Mam em i pasim doa na go long gaden olsem na ol laplap bilong em i paia.'),
   ('xe',
    'Mother went to work, closing up the house, and her clothes burned.')]),
 ('kasi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'fire'),
   ('tkp', 'paia'),
   ('dt', '18/Nov/2006'),
   ('ex', 'Tom, etokasi kasi araisi orisia.'),
   ('xp', 'Tom yu wokim paia na kukim rais.'),
   ('xe', 'Tom, start a fire in order to cook rice.'),
   ('ex',
    'Tuituia kasi purapaavoi ra rirovira vuvuve ra vo kasi-ia ito pitoka oria.'),
   ('xp', 'Bai mi laitim bai i lait bai mi kukim sospen banana.'),
   ('xe',
    "I'll make a fire so that it burns and I'll cook a saucepan of bananas on the fire.")]),
 ('kasi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'burn'),
   ('tkp', 'boinim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Dec/2005'),
   ('ex',
    'Pita kepa kasirevora tuituia-ia uva auero rutu kasipirora kepa sovaraia.'),
   ('xp',
    'Pita i bin paiaim haus long paia. Na olgeta samting i bin paia insait long haus.'),
   ('xe',
    'Peter burnt the house with fire and everything burned inside of the house.'),
   ('ex', 'Kepa kasirevo Veruve eisi Rarova.'),
   ('xp', 'Veruve em i bin paiaim haus long Rarova.'),
   ('xe', 'Veruve burned down a house in Rarova.')]),
 ('kasiarao',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'limbum'),
   ('tkp', '???'),
   ('nt', "large leaf variety, used for making pakou 'fighting ???"),
   ('cmt', 'Double-check original ex. sentence (from Timothy).'),
   ('dt', '30/Oct/2005'),
   ('alt', 'kasarao'),
   ('ex', 'Kasarao kaku puraveira upo purapasia oreke.'),
   ('xp', 'Kasarao limbum blong wokim pako blong part.'),
   ('xe', '???'),
   ('ex', 'Kasiarao iria tatupaiveira auere uru purapasia vo keparaia.'),
   ('xp', 'Limbum ol i save paitim bilong wokim poloa long haus.'),
   ('xe', '???'),
   ('ex',
    'Kasiarao iria govukoaro tekapaiveira kakuara purapasia auere upo purapasia.'),
   ('xp',
    'Wuel limbum ol save sarim namel bilong em bilong wokim pako bilong pait.'),
   ('xe', '???')]),
 ('kasiava',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'tree'),
   ('tkp', 'diwai bilong wokim ???'),
   ('eng', 'Milkwood (Alstonia scholaris)'),
   ('nt', 'Used for making canoes. This type of tree is also used to be ???'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kasiava opuru purapaveira vars avakava kepa paupasia.'),
   ('xp', 'Kasiava tree blong workim canoes na house.'),
   ('xe', '???'),
   ('ex',
    'Kasiava ira sovara riroa ukoa toupaiveira oisio kekevira osia roroa. Uva va iava eva evaova opuruara purapaiveira oraauero kepaara.'),
   ('xp',
    'Diwai kanu insait bilong em i gat bikpela wara insait em. Wara i save stap insait i luk olsem susu.'),
   ('xe', '???')]),
 ('kasikasi',
  [('rt', 'kasi'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('rdp', 'full'),
   ('ge', 'angry'),
   ('tkp', 'kros'),
   ('tkp', 'bel i hat'),
   ('eng', 'cross'),
   ('eng', 'angry'),
   ('eng', 'difficult'),
   ('eng', 'diligent'),
   ('vx', '1'),
   ('sc', 'EMOTION'),
   ('cmt', 'Is the 2nd sentence lacking conjugation?'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Vii kasikasiu.'),
   ('xp', 'Yu kros.'),
   ('xe', "You're angry."),
   ('ex', 'Oirato kasikasi uva upo pura.'),
   ('xp', 'Man i kros na em i pait.'),
   ('xe', 'The man was angry and wanted to fight.'),
   ('ex', 'Auro, kasikasirai, ari oravatatopou.'),
   ('xp', 'Hei, yu lukaut, mi kros nau.'),
   ('xe', "Hey, I'm mad, so be prepared."),
   ('ex',
    'Vii kasikasiuei ikauvira osia viapau reoreoara iava vii tavipaavoi.'),
   ('xp', 'Yu bel hat kuik taim mi no tokim yu yet long ol toktok.'),
   ('xe', "You get angry quickly when I don't tell you about the words.")]),
 ('Kasikasiua',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'name'),
   ('cmt',
    "Is this a place name? Where is it? And what's up with rara in the example?"),
   ('dt', '12/Feb/2005'),
   ('ex',
    'Kasikasiua uva oisioa oirara rarearo tovopaive rara voea kasirovopaive.'),
   ('xp',
    'Ples we ol i save putim bun bilong ol man taim ol i kukim ol pastaim long paia.'),
   ('xe', 'Kasikasiua is where they put the bones of people ??? burn them.')]),
 ('kasipie',
  [('rt', 'kasi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'care for'),
   ('tkp', 'lukautim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('cmt', 'Is this derived from kasi? Check vowel length.'),
   ('dt', '15/Nov/2005'),
   ('ex', 'Voea vearovira kasipiepaive.'),
   ('xp', 'Ol i lukautim ol gut.'),
   ('xe', 'They would diligently care for them.'),
   ('ex', 'Kakaevure kasipieparivere.'),
   ('xp', 'Bai yu lukautim gut ol pikinini.'),
   ('xe', 'You will take care of the children.'),
   ('ex',
    'Ovoteivi kasipievira o kovo rovu rutu purapaeveira vo vokiro rutu-ia. Viapau uveipaoi ra o kovovu-re ovauo vara rutu rugopaevere vara purapaoro.'),
   ('xp',
    'Ovoteivi em i save wokim olgeta wok. Em i inap lus tingim wanpela wok em i mas tingting na wokim olgeta wok.'),
   ('xe', '???'),
   ('ex',
    'Aiteto kakaeto kasipieparevo teapi kovero riku-ia uvare kepa ruvara-ia toupaevoi.'),
   ('xp',
    'Papa i lukluk gut long pikinini nogut em i pundaon long hul bikos hul em i stap klostu long haus.'),
   ('xe',
    'Father looked out for the child lest he fell down in the hole because it is close to the house.'),
   ('ex', 'Ragai kasipievira rutu evaova tokiava uva riroepa vearopievira.'),
   ('xp', 'Mi bin lukautim gut diwai na em i gro gut.'),
   ('xe', '???')]),
 ('kasipu',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'angry'),
   ('eng', 'angry'),
   ('eng', 'cross'),
   ('eng', 'pissed off'),
   ('tkp', 'kros'),
   ('tkp', 'bel i hat'),
   ('dcsv', 'true'),
   ('cm', '-re'),
   ('vx', '2'),
   ('arg', 'OBL'),
   ('sc', 'EMOTION'),
   ('cmt', 'Is this related to <kasi>'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Ragai kasipurai.'),
   ('xp', 'Mi kros.'),
   ('xe', "I'm angry."),
   ('ex', 'Evo reoreo uvuragapaoro kasipupaopa.'),
   ('xp', '???'),
   ('xe', 'Not paying attention to those words she was angry.'),
   ('ex', 'Areipiri kasipu oirara-re ora riakora.'),
   ('xp', 'Areipiri em i kros long ol man na meri.'),
   ('xe', 'Areipiri is angry at the men and women.'),
   ('ex',
    'Areipiri oirara-re kasipu uvare viapau ikauvira urioae osia vero gaurevo kovo ousia.'),
   ('xp',
    'Areipiri em i koros long man na meri bikos ol i no kam kuik taim belo i krai bilong kisim wok.'),
   ('xe',
    "Areipiri is angry at people because they didn't come quickly when he cried [vero???] to get work.")]),
 ('kasipupie',
  [('rt', 'kasipu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'anger'),
   ('eng', 'anger'),
   ('eng', 'enrage'),
   ('tkp', 'mekim kros'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Teapi rera kasipupieive orareoreopaoro.'),
   ('xp', '???'),
   ('xe', "Don't make him angry talking among yourselves."),
   ('ex', 'Ae, visii ragai kasipupietavoi.'),
   ('xp', 'Hei, yupela i mekim mi kros.'),
   ('xe', 'Hey, you guys are making me angry.'),
   ('ex',
    'Viapau oisio ra Vavaviri kasipupiepata. Teapi vigei uporeve rara kasipuro eva-pa oa aiotavo rera varo vaviokoa.'),
   ('xp',
    'No ken mekim koros Vavaviri. Nogut em i paitim yumi long dispela popo bilong em yupela i kaikaim.'),
   ('xe', "Don't anger Vavaviri. ???.")]),
 ('kasipuvira',
  [('alt', 'kasipupavira'),
   ('rt', 'kasipu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'sharp flavor'),
   ('ge', 'angrily'),
   ('tkp', '???'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Sipeipai kasipuvira.'),
   ('xp', '???'),
   ('xe', 'It has a sharp flavor.'),
   ('ex', 'Kasipuvira rutu toupae.'),
   ('xp', 'Em i kros nogut tru.'),
   ('xe', 'He is really angry.'),
   ('ex', 'Era ita kasipuvira rutu toupare.'),
   ('xp', 'Man ia kros nogut tru na i stap.'),
   ('xe', 'He is very angry.')]),
 ('kasirao',
  [('rt', 'kasi'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'hot'),
   ('eng', 'hot'),
   ('tkp', 'hot'),
   ('vx', '1'),
   ('sc', '???'),
   ('cmt', 'Not clear whether this is distinct sense or lexicalized or what.'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Siori rirovira kasiraoroi uvare rirovira upiaparoi.'),
   ('xp', 'Jon em i skin hot nogut tru bikos em i sik nogut tru.'),
   ('xe', 'John is really hot because he is really sick.')]),
 ('kasiraopie',
  [('rt', 'kasi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'heat up'),
   ('ge', 'make hot'),
   ('tkp', 'hatim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('cmt',
    'Not sure whether kasirao is really a separate stem. It seems -pie has some freedom of movement, occuring either before or after -pie.'),
   ('dt', '24/Aug/2005'),
   ('ex', 'Uukoo kasiraopie aioa kasiraopie ra va aiori.'),
   ('xp', 'Hotim wara na kaikai bai yu kaikai.'),
   ('xe', '???'),
   ('ex', 'Aio kuro ro kasiraopieri, Dokas.'),
   ('xp', 'Dokas, yu hotim ol hap kaikai.'),
   ('xe', 'Dokas, heat up the leftovers.'),
   ('ex', 'Ukoo rovu kasiraopie etokasi-ia.'),
   ('xp', 'Yu hotim wara long paia.'),
   ('xe', 'Heat up the water on the fire.')]),
 ('kasiraovira',
  [('alt', 'kasiraopavira'),
   ('rt', 'kasirao'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'hot'),
   ('ge', 'energetically'),
   ('ge', 'diligently'),
   ('tkp', '???'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Kasiraovira toupaivoi voia.'),
   ('xp', 'Em i hot tumas long hia.'),
   ('xe', "It's too hot here."),
   ('ex', 'Kasiraovi kovopaiovoi.'),
   ('xp', 'Mipela work hat tu.'),
   ('xe', "We're working hard."),
   ('ex', 'Kasiraovira kovopaavoi.'),
   ('xp', 'Mi wok hat tru.'),
   ('xe', 'I am working very hard.'),
   ('ex',
    'Vakora-ia riro kasiraovira toupaiveira oa iava oirara voa-pa reasipaveira voa toupaarapa.'),
   ('xp',
    'Wakunai em i ples hot tru olsem na ol man i save les long stap long hap.'),
   ('xe',
    "In Wakunai it is very hot and therefore people don't want to be there."),
   ('ex', 'Kasiraovira uusirae vokiaro, viapau rirovira uteopae.'),
   ('xp',
    'Las nait mi silip hot long nait bikos i no gat bikpela kol tumas long nait.'),
   ('xe',
    "Last night it was really hot sleeping because it wasn't very cold.")]),
 ('kasiu',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'cassowary'),
   ('tkp', '???'),
   ('nt', 'Tok Pisin loan'),
   ('ig', 'yes'),
   ('dt', '15/Sep/2004')]),
 ('kasiura',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'fence'),
   ('tkp', 'banis'),
   ('cmt', 'Need a full sentence.'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kasiura auere koie kare.'),
   ('xp', 'Banis bilong ol pik.'),
   ('xe', '???'),
   ('ex',
    'Kasiura iria purapaiveira koie kare tovopasia ra viapau ratau aio kovoro aiopaive rara kasiura sovaraia toupaive.'),
   ('xp',
    'Banis ol i save mekim bilong putim ol pik insait long em na bai ol i no ken kaikaim ol gaden autsait taim ol i stap insait long banis.'),
   ('xe', '???')]),
 ('kasivari',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'scavenge'),
   ('ge', 'be diligent'),
   ('tkp', 'mumutim'),
   ('tkp', 'paipainim'),
   ('tkp', 'kiskisim'),
   ('vx', '???'),
   ('cmt', 'Is this also a name? Need better examples.'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kasivari vorouto vovouva riakova.'),
   ('xp', '???'),
   ('xe', '???')]),
 ('kasuari',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'cassowary'),
   ('tkp', 'muruk'),
   ('nt', 'Tok Pisin loan'),
   ('ig', 'yes'),
   ('dt', '05/Feb/2005'),
   ('ex', 'Ira oiso toupareveira osa kokai. Viapau vo toupare vo.'),
   ('xp', 'Em I save luk olsem kakaruk. Em i no save i stap long hia.'),
   ('xe', "It looks like a rooster. It doesn't live here."),
   ('ex',
    'Kasiuari ira oisio toupare osia kokai uva eisi toupareveira turue vego.'),
   ('xp', 'Kasiowari em i olsem kakaruk na em i stap long highlands.'),
   ('xe', 'The cassowary is like a rooster and lives in the highlands.')]),
 ('kata',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'exhaust'),
   ('tkp', 'skin dai'),
   ('dcpv', 'true'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '31/Jan/2007'),
   ('cmt', 'Need an example without -piro'),
   ('ex', 'Ragai iava varaua katapiro uvare riro kaekaevira vokaavo.'),
   ('xp', 'Skin bilong mi i dai bikos mi wakabaut longwe hap.'),
   ('xe', 'My skin was exhausted because I walked a long way.'),
   ('ex',
    'Kakaeto Tovariri katarevoi uvare kakaeto kaepaoro kareroe riro kaekae tapi-va uva varaua katarevoi uvare riro vavatatoa kakaeto.'),
   ('xp',
    'Pikinini mekim Tovariri skin dai bikos em i wok long karim pikinini i kam longwe hap na em i skin dai bikos pikinini i hevi tumas..'),
   ('xe',
    'The boy exhausted Tovariri because he was carrying the boy from a far away place and his it exhausted his body because he was a heavy boy.')]),
 ('katai',
  [('ps', 'NUM'),
   ('ge', 'one'),
   ('tkp', 'wan'),
   ('tkp', 'wanpela'),
   ('dt', '15/Sep/2004'),
   ('ex', 'Vieia rovoaroa vaoia katai.'),
   ('xp', 'Stat bilong kauntim em i wan.'),
   ('xe', 'Counting starts with one.')]),
 ('kataitoarei',
  [('rt', 'katai'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'two small things'),
   ('ge', 'limited to two only'),
   ('tkp', '???'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kataitoareivi raga kokai vaio aiterea ouavo.'),
   ('xp', 'Mi kisim tupela kakaruk tasol.'),
   ('xe', 'I only got two little chickens.')]),
 ('katakatai',
  [('ps', '???'),
   ('dx', 'Central'),
   ('ge', 'one.by.one'),
   ('tkp', 'wanwan'),
   ('eng', 'one by one'),
   ('eng', 'individually'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Katakatai kovoro iare avapata voa kovosia visii rutu.'),
   ('xp', 'Bai yupela olgeta bai go wok long wanwan gaden.'),
   ('xe', '???')]),
 ('katakataivira',
  [('rt', 'katakatai'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'individually'),
   ('eng', 'individually'),
   ('eng', 'one by one'),
   ('eng', 'separately'),
   ('tkp', 'wanwan'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Katakataipavira erako kaeuoro karepatai visi rutu vo-re kepa.'),
   ('xp', 'Yupela olgeta bai karim wanwan paia wut go long haus.'),
   ('xe', '???'),
   ('ex', 'Katakataivira toupa vo keparaia - usipata - kovopata ikaupata.'),
   ('xp', 'Istap wanwan tong all dispela house sleep wonwon work wonwon.'),
   ('xe', '???'),
   ('ex', 'Visii rutu katakataivira uriopatavere.'),
   ('xp', 'Yupela olgeta bai yupela kam wanwan.'),
   ('xe', 'All of you come individually.'),
   ('ex', 'Katakataivira voia roopierivere aue-ia kovoara ra vara puraive.'),
   ('xp', 'Bai yu givim ol wanwan long ol wok na bai ol i wokim.'),
   ('xe', 'Give everyone their work they will do it.'),
   ('ex',
    'Oirara katakataivira paupae paupara-ia uvare kepa aitearo voea-re keeraroe aiosia rera vo kepaaro-ia.'),
   ('xp',
    'Ol man i sindaon wanwan long ol chia bikos papa bilong em em i singautim ol long kaikai long haus bilong em.'),
   ('xe',
    'The men came one by one to the chairs because the head of the house called out for them to eat at his house.')]),
 ('kataraua',
  [('rt', 'katarau'),
   ('ps', '???'),
   ('ge', 'chest'),
   ('ge', 'ribs'),
   ('ge', 'sternum'),
   ('tkp', 'brus'),
   ('tkp', 'banis'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Viava kataraua vagaiava karaua upopai.'),
   ('xp', '???'),
   ('xe', '???')]),
 ('katarauto',
  [('rt', 'katarau'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'chest'),
   ('ge', 'rib'),
   ('ge', 'sternum'),
   ('tkp', 'bros'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Katarauto Luke iava upiaparoi.'),
   ('xp', 'Banis bun bilong Luke i pen.'),
   ('xe', "Luke's ribs hurt."),
   ('ex', 'Katarauto ira kasiuravira toupareveira vovou isi-re.'),
   ('xp', 'Bros i save stap olsem banis bilong hat.'),
   ('xe', 'The chest fences in the heart.')]),
 ('katavira',
  [('alt', 'katapavira'),
   ('rt', 'kata'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'mashed and mixed together'),
   ('tkp', 'malumalum'),
   ('tkp', 'meme'),
   ('cmt', 'Check vowel length. What is adv modifying in example?'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kataavira aiopapi vo aio.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Katavira kakaeto aiopieri.'),
   ('xp', 'Yu simasim pastaim na givim long pikinini.'),
   ('xe', 'Feed it to the children mashed up.'),
   ('ex',
    'Arais katavira aiopaoe uvare Pide rirovira uuko tovoroe oa iava katavira aiopaovi araisara ragai gisipoaro-ia votopai uvare katavira orioviro riroa-ia ukoa.'),
   ('xp',
    'Rais i kaikai olsem malumalu bikos Fide em i putim bikpela wara. Olsem na rais i kaikai olsem malumalum rais i pas long maus bilong mi. Bikos em i ??? long bikpela wara.'),
   ('xe', '???')]),
 ('katokato',
  [('rt', 'kato'),
   ('ps', 'N'),
   ('pt', '???'),
   ('rdp', 'full'),
   ('ge', 'black'),
   ('ge', 'filthy'),
   ('tkp', 'blak'),
   ('tkp', 'doti'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Bogenvilpairara katokatoirara.'),
   ('xp', 'Ol lain bilong Bogenvil ol i blak.'),
   ('xe', 'Bougainvilleans are black.'),
   ('ex',
    'Kepara rutu katokatopapeira rara kupareto vara katokatopiepareve katokatoirara voeao votoko iava Bogenvil.'),
   ('xp',
    'Olgeta haus smuk i save mekim blak. Ol blak man ol i bilong dispela ailan Bogenvil.'),
   ('xe', '???')]),
 ('katokatoto',
  [('rt', 'katokato'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'black person'),
   ('ge', 'negro'),
   ('tkp', 'netif'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Oirato katokatoto.'),
   ('xp', 'Black man.'),
   ('xe', 'Blakman.'),
   ('ex', 'Ragai katokatoto.'),
   ('xp', 'Mi blakman.'),
   ('xe', "I'm a black man."),
   ('ex', 'Igeia katokatoirara ora apirikapairara.'),
   ('xp', 'Mipela wantaim ol Apirika i blak.'),
   ('xe', '???'),
   ('ex',
    'P. Ona katokatoto ira votoko-ia upoa rovoreva aue-pa rasia uvare popotepai rara vo rasi eriiva oa iava upoa rovoepa.'),
   ('xp',
    'P. Ona em i blak man. Long dispela ailan em i bin statim pait long graun bikos ol waitman i bin digim ol dispela graun olsem na pait i bin stat.'),
   ('xe', '???')]),
 ('katokatovira',
  [('alt', 'katokatopavira'),
   ('rt', 'katokato'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'black-like'),
   ('ge', 'dirty'),
   ('tkp', 'blak'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Katokatovira kekepi.'),
   ('xp', 'Em i luk black.'),
   ('xe', 'It looks black.'),
   ('ex', 'Katokatovira paruvoi uukovi.'),
   ('xp', 'Wara i blak nau.'),
   ('xe', 'The river is flowing black now.'),
   ('ex',
    'Revasiva katokatovira kekepaoviro oiratoa iava uvare rera vavaearo toeivo torara-ia.'),
   ('xp',
    'Blut bilong man i luk olsem blakpela bikos ol i katim han bilong em.'),
   ('xe',
    'The blood of the man looked black because they cut his hand with an axe.')]),
 ('katokoi',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'strip of limbum used for sago leaf shingle'),
   ('tkp', '???'),
   ('nt', 'The sago leaves are folded over and fastened onto the strip.'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Katokoi-ia tetevua turari vii.'),
   ('xp', 'Yu samapim saksak long limbun.'),
   ('xe', 'Sew up the sago with limbum.'),
   ('ex', 'Asia, uriou katokoi vatesia ra oira-ia tetevuara turaa.'),
   ('xp',
    'Asia, yu kam wantaim stik bilong samapim saksak bai mi samapim saksak long em.'),
   ('xe', '???')]),
 ('katopato',
  [('rt', 'kato'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'skin-and-bones'),
   ('tkp', '???'),
   ('eng', 'skinny person'),
   ('eng', 'thin person'),
   ('eng', 'bony animal'),
   ('nt', 'Insulting when applied to humans. Otherwise used for animals.'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Kaakau era katopato.'),
   ('xp', 'Dok hia em i bun nating.'),
   ('xe', 'This dog is skin-and-bones.'),
   ('ex',
    'Katopato oirato ira raga-ia puterevo ovusia pauuparae raivaro. Ragai rera vurapaavo rirovira uvare katopatoa raga viapau varu aravai rera varaaro-ia uvare riro kaekaevira upiaroepa oa iava katopatoa rera oirato.'),
   ('xp',
    'Bu nating man em i pas by long mi taim mi sundaon long rot. Mi lukluk stron glong em bikos em i bun nating man tasol. Em i nogat mit long bodi bilong em. Bikos em i bin sik longpela taim olsem na em i bun nating man.'),
   ('xe', '???')]),
 ('katoto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'rib (sternal)'),
   ('ge', 'rib cage'),
   ('tkp', 'banis'),
   ('dt', '29/Jan/2005'),
   ('ex', 'Pauto katoto oureva Adam iava Evi purasia.'),
   ('xp', 'God i kisim banis bun bilong Adam na wokim Eve.'),
   ('xe', 'God took a rib from Adam in order to make Eve.')]),
 ('katuara',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'scour'),
   ('tkp', 'brosim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Saspan katuarari?'),
   ('xp', '???'),
   ('xe', 'Are you scouring the saucepan?'),
   ('ex', 'Pitokava katuarari oira riruuoro aue-ia rogara.'),
   ('xp', 'Yu klinim sospen long weitsan.'),
   ('xe', 'Scour the saucepan ???-ing it with ???'),
   ('ex',
    'Sera pitokara katuaraevoi pitoka rirupa-ia uvare vara rutu pitokara katokatoerao.'),
   ('xp',
    'Sera em i brasim ol sospen long stil wul bikos olgeta sospen i bilak.'),
   ('xe',
    'Sera is scrubbing the saucepans with steel wool because all of the saucepans are dirty.')]),
 ('katuarato',
  [('alt', 'katuarapato'),
   ('rt', 'katuara'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'brush'),
   ('tkp', 'bras'),
   ('sc', 'INSTRUMENT'),
   ('dt', '30/Oct/2005'),
   ('ex',
    'Ragai viapau vaaro katuarapatoavai. Uva ruipaparai ra irai voria ra ira-ia varoara katuara.'),
   ('xp',
    'Mi nogat wanpela bras bilong brasim ol laplap. Na mi laik baim wanpela bilong brasim ol laplap.'),
   ('xe', '???')]),
 ('katukatu',
  [('rt', 'katu'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('rdp', 'full'),
   ('ge', 'rot away'),
   ('ge', 'flake off'),
   ('ge', 'unfastened'),
   ('tkp', 'brukbruk'),
   ('tkp', 'sting'),
   ('nt',
    'Clasp breaks or knot becomes untied and binding becomes unfastened'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Katukaturoviro.'),
   ('xp', '???'),
   ('xe', 'His skin has rotted and is flaking off.'),
   ('ex', 'Kaakau katukaturovi eruoro.'),
   ('xp', 'Dok i sting na i lus.'),
   ('xe', '???'),
   ('ex',
    'Varoa Pita varo oa katukature uvare va raga-ia tuuparoveira vokiara rutu-ia.'),
   ('xp', 'Laplap bilong Pita i sting bikos olgeta taim em i save werim.'),
   ('xe', '???')]),
 ('katuta',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'long spear'),
   ('tkp', 'longpela spia'),
   ('dt', '20/Sep/2004'),
   ('ex',
    'Katuta riro kaekaeva iria-ia oisioa oirara togapaive oraupopaoro tuariri.'),
   ('xp',
    'Longpela supsup ol i save yusim long sutim ol man long taim bilong pait bipo.'),
   ('xe',
    'The katuta is a long spear that people used when fighting each other long ago.')]),
 ('kau',
  [('ps', '???'),
   ('ge', 'solid'),
   ('ge', 'hard'),
   ('tkp', '???'),
   ('cmt', 'Ex. sentence not a sentence.'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Kauva rutu oia asiva.'),
   ('xp', 'Strongpela buai.'),
   ('xe', '???'),
   ('ex', 'Oia asiva kauva rutu oa iava viapau uvuiparai ra oira aioa.'),
   ('xp', 'Dispela buai i strong tumas olsem na mi no inap kaikaim.'),
   ('xe', "??? and that's why I can't eat it.")]),
 ('kaukau',
  [('rt', 'kau'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'sweet potato'),
   ('tkp', 'kaukau'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '30/Oct/2005'),
   ('ex', 'Rotokasi-pa riako rirovira kaukau paupaveira.'),
   ('xp', 'Ol meri Rotokas i save planim plat kaukau.'),
   ('xe', '???')]),
 ('kaukaupie',
  [('rt', 'kaukau'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'shine intensely'),
   ('ge', 'intense sunlight'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Ravireo kaukaupieparevoi.'),
   ('xp', '???'),
   ('xe', 'The sun shines intensely.'),
   ('ex', 'Ravireo ragai kaukaupiere.'),
   ('xp', 'San i hatim skin bilong mi.'),
   ('xe', 'The sun is ???-ing me.'),
   ('ex',
    'Poroirie oisio ruipapaoe ra ravireo kaukaupiereve, ra kovoa iare avao kovosia.'),
   ('xp',
    'Poroirie em i laik bai san i hat pastaim na bai em i go long gaden.'),
   ('xe',
    'Poroirie wants the sun to shine for him to go to the garden to work.')]),
 ('kaukauvira',
  [('alt', 'kaukaupavira'),
   ('rt', 'kaukau'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'intensely'),
   ('tkp', '???'),
   ('nt', 'For example, in the manner of a very bright hot sun.'),
   ('sa', 'kaukaupievira'),
   ('dt', '15/Oct/2005'),
   ('ex', 'Kaukauvira rutu roroparevoi ravireo.'),
   ('xp', '???'),
   ('xe', 'The sun is shining intensely.'),
   ('ex', 'Ravireo kaukauvira rororevoi 9 kiloki-ia.'),
   ('xp', 'San i hat strong tru long 9 klok.'),
   ('xe', "The sun shines intensely at 9 o'clock."),
   ('ex',
    'Vovokio oisio kekepai kaukauvira roroparevoi ravireo ora vokua tapo vearpiea.'),
   ('xp', 'Tudei i luk olsem san bai lait strong na dei tu i naispela dei.'),
   ('xe', 'Today it looked like the sun was shining strongly and ???.')]),
 ('kaukovo',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'water cress'),
   ('tkp', 'kango'),
   ('cmt', 'According to Firchow, not originally Rotokas. Why not?'),
   ('dt', '05/Feb/2005'),
   ('alt', 'Is kaukovu an alterate form?'),
   ('ex', 'Kaukovo kouro eisi toupaiveira tuvuara-ia.'),
   ('xp', 'Kangkong i save gro long ol tais.'),
   ('xe', 'Water cress grows in the mud.')]),
 ('kauo',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'jump'),
   ('tkp', 'kalap'),
   ('tkp', 'mama'),
   ('sc', '???'),
   ('vx', '1'),
   ('dt', '15/Oct/2005'),
   ('ex', 'Ragai rorupaoro kauoparai.'),
   ('xp', 'Mi amamas na mi kalap.'),
   ('xe', 'I jumped out of happiness.'),
   ('ex',
    'Verisevi riro kauova vosia oaravu-ia roropaeve, ra kauopao rorupaoro.'),
   ('xp',
    'Verisevi em meri bilong danis taim em i hamamas long ol sampela samting.'),
   ('xe',
    'Verisevi is a celebrator if she is happy about things, and she dances happily.')]),
 ('kauokauo',
  [('rt', 'kauo'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('rdp', 'full'),
   ('ge', 'jump up and down'),
   ('tkp', 'kalkalap'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '15/Oct/2005'),
   ('ex', 'Vii kauokauo.'),
   ('xp', 'Yu jumpjump.'),
   ('xe', 'Jump up and down!'),
   ('ex', 'Rirovira rutu rorupaoro kauokauoparai.'),
   ('xp', 'Mi amamas tru na mi kalapkalap.'),
   ('xe', 'I jumped up and down truly happy.'),
   ('ex', 'Riakora kauokauopaveira vosia iriavu aire-pa kakae kavaueve.'),
   ('xp',
    'Ol meri i save hamamas na kalakalap taim wanpela meri i karim niupela pikinini.'),
   ('xe',
    'The women dance for joy when someone gives birth to a new child.')]),
 ('kaureo',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'stubborn'),
   ('tkp', 'bikhet'),
   ('tkp', 'sakim tok'),
   ('eng', 'contradict'),
   ('eng', 'disagree'),
   ('eng', 'stubbornly against'),
   ('eng', 'rebellious'),
   ('cm', '-va'),
   ('am', 'false'),
   ('dcsv', '???'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('cmt', 'Is kauro an alternate form?'),
   ('dt', '15/Oct/2005'),
   ('ex', 'Rirovira oisoa rera-va kaureopaave.'),
   ('xp', '???'),
   ('xe', 'They were always in much disagreement with him.'),
   ('ex', 'Kakaevure rirovira kaureopaai uva avuka voea-re kasipuoe.'),
   ('xp', 'Ol pikinini i bikhet tumas na lapun meri i krosim ol.'),
   ('xe',
    'The children are very stubborn and the old woman is angry at them.')]),
 ('kaureoto',
  [('rt', 'kaureo'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'braggart'),
   ('ge', 'back talker'),
   ('tkp', 'hambakman'),
   ('dt', '15/Oct/2005'),
   ('ex', 'Via kaureoto viapau uvupariveira.'),
   ('xp', 'Yu bikhat man yu no save harim tok.'),
   ('xe', "You're stubborn, you don't listen."),
   ('ex', 'Raraviri riro kaureoto akova-re.'),
   ('xp', 'Raraviri man bilong sakim bilong mama.'),
   ('xe', 'Raraviri is very stubborn to his mother. (???)'),
   ('ex', 'Kaureoto ro oirato ira viapau reo uvuparoveira, riro reo atoto.'),
   ('xp',
    'Bikhet man em kain man i no save harim tok o man bilong bekim tok.'),
   ('xe', "A braggart is a man who doesn't listen, and is a bit ???.")]),
 ('kausiopa',
  [('alt', 'kausopa'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'stubborn'),
   ('ge', 'unrelenting'),
   ('ge', 'concerned'),
   ('ge', 'anxious'),
   ('tkp', '???'),
   ('cmt',
    "Need to check translation. Is it just stubborn? What about 'concerned' and 'anxious'?"),
   ('cm', '-re'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('nt', "From the two words kau `strong' and sopa `inside'."),
   ('dt', '15/Oct/2005'),
   ('ex', 'E va-re kausopapau?'),
   ('xp', '???'),
   ('xe', 'Are you very concerned about it?'),
   ('ex', 'Kausiopa vovoutoa rutu ro oirato.'),
   ('xp', 'Dispela bikhet man.'),
   ('xe', '???'),
   ('ex',
    'Isivairi kausiopairaoparoe oisio ra voriavai oureve kovoa-ia oa purarevora.'),
   ('xp',
    'Isivairi em i strong bikos em i laikim man i long wok em i bin wokim.'),
   ('xe', 'Isivairi is unrelenting because ???.')]),
 ('kausiovira',
  [('alt', 'kausiopavira'),
   ('rt', 'kausio'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'anxiously'),
   ('tkp', '???'),
   ('nt', 'archaic, possibly dialectal or idiolectal'),
   ('cmt',
    'Sera claims the use of this term is confined to a small group of people--dubious'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Kausiopavira icepa paurivoi viva.'),
   ('xp', 'Man yu wanpela tasol yu wokim dispela house.'),
   ('xe', '???'),
   ('ex', 'Kausiopavira rutu vao erakoa kaerivoi.'),
   ('xp', 'Yu strong inap na yu karim paia wut.'),
   ('xe', '???'),
   ('ex',
    'Oirato kausiopavira rutu reoreoparoe ovusia rera-re koruragapae riropairara.'),
   ('xp', 'Man i bikhet tru long toktok taim ol bikman i stopim em nating.'),
   ('xe', 'The man spoke stubbornly when the big men tried to stop him.'),
   ('ex',
    'Kausiovira rugoopareveira oirato oa iava viapau vii-pa ra oaravu vatepareve oearovu-pa.'),
   ('xp',
    'Man i gat strongpela tingting na em i no save givim ol samting long ol man.'),
   ('xe', "The man is thinking anxiously and that's why ???.")]),
 ('Kava',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('eng', 'name'),
   ('dt', '24/Feb/2004')]),
 ('kavakavau',
  [('rt', 'kavau'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'partial'),
   ('ge', 'reproduce'),
   ('tkp', 'karikarim planti pikinini'),
   ('eng', 'reproduce'),
   ('eng', 'bear many children'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '31/Jan/2007'),
   ('ex', 'Vovokio-ia rirovira kakae kavakavaupae.'),
   ('xp', 'Tude ol i wok long bonim plenti pikinini.'),
   ('xe', "Today they're having a lot of children."),
   ('ex', 'Rirovira oisoa kavakavaupaive tuariri.'),
   ('xp', '???'),
   ('xe', 'In the past they reproduced greatly.'),
   ('ex',
    'Riakora rirovira kakae kavakavaupaai. Oa iava rasivaruro potepaiovoi.'),
   ('xp', 'Ol meri i karikarim planti pikinini olsem na ol hap graun i sot.'),
   ('xe',
    "Women are having many children and that's why land is in short supply.")]),
 ('kavatao',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'newlywed.house'),
   ('eng', 'newlywed house'),
   ('tkp', 'haus bilong niupela marit'),
   ('dt', '26/Sep/2006'),
   ('ex', 'Kavatao kepa vao vaitei varo.'),
   ('xp', 'Niupela marit i stap insait.'),
   ('xe', '???'),
   ('ex', 'Kavatao kepa airepa ora outorei vokeparo.'),
   ('xp', 'Haus bilong tupela nupela marit.'),
   ('xe', '???'),
   ('ex',
    'Vokepao kavatao kepa oa oisio toupaiveira. Vosia rovopa vuta-ia riakova aivaropieive oiratoa taporo, ra kavatao kepa-ia vaiterei koatapieive.'),
   ('xp',
    'Dispela haus niupela haus i save stap olsem. Sapos nambawan taim ol bungim meri wantaim man. Bai ol ol i putim tupela insait long dispela haus.')]),
 ('Kavatara',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('eng', 'name'),
   ('dt', '24/Feb/2004')]),
 ('kavau',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'be born'),
   ('tkp', 'bon'),
   ('cmt', 'Example needed'),
   ('vx', '1'),
   ('dt', '08/Dec/2005')]),
 ('kavau',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'give birth'),
   ('tkp', 'bonim'),
   ('tkp', 'karim pikinini'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Kakaeto kavauevora.'),
   ('xp', 'Em i bonim pikinini man.'),
   ('xe', 'She gave birth to the child.'),
   ('ex', 'Oriri kakaeto kavauevo.'),
   ('xp', 'Oriri i bonim pikinini man.'),
   ('xe', 'Oriri gave birth to a son.'),
   ('ex',
    'Aakova kakaeto kavauevo vovokio-ia uva aiteto riroa-va rorua toupare.'),
   ('xp', 'Mama i karim pikinini tudei na papa i stap wantaim hamamas.'),
   ('xe', 'Mother gave birth to a boy today and father is really happy.'),
   ('ex', 'Kiuwi kaakau kakaero kavauevo.'),
   ('xp', 'Kiuwi em i bonim ol pikinini dok.'),
   ('xe', 'Kiuwi gave birth to puppies.')]),
 ('kavau asiava',
  [('rt', 'kavau'),
   ('ps', '???'),
   ('ge', 'barren woman'),
   ('tkp', 'meri i no ken karim pikinini'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Viia kakae kavau asiava.'),
   ('xp', 'Yu no save kalim pikinini.'),
   ('xe', '???'),
   ('ex', 'Oia riakova kakae kavau asiva.'),
   ('xp', 'Dispela meri i no save bonim pikinini.'),
   ('xe', '???'),
   ('ex', 'Riakova kakae kavau asiava uva okakaerovu raga tokipaeveira.'),
   ('xp',
    'Meri em meri i no save karim pikinini, em save lukautim pikinini bilong ol narapela ol narapela man yet.'),
   ('xe', '???')]),
 ('kave',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'whisper'),
   ('ge', 'reduce the strength or heat of something'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '17/Oct/2005'),
   ('cmt', 'Who is angry in the first sentence? John, no?'),
   ('ex', 'Pita ira Jon kaveepare uvare kasipupa.'),
   ('xp', 'Pita i givim bel isi tok long Jon bikos em i kros.'),
   ('xe', 'Pita whispered to John because he was mad.'),
   ('ex', 'Vii kaveeparevere.'),
   ('xp', '???'),
   ('xe', 'He will whisper to yo.'),
   ('ex',
    'Pita ira soripa kave vovou vatepare uvare rirovira kasipuroe uva rera tapo toupare Pita rera-pa kave vovou vatepaoro.'),
   ('xp',
    'Pita em man i wok long givim bel isi long Jon bikos em i kros tumas olsem na Pita i stap wantaim em, na givim bel isi long em.'),
   ('xe', '???')]),
 ('kavee',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'cool off in a shaded spot'),
   ('tkp', 'karamap namel long lait bilong san na kol'),
   ('vx', '1'),
   ('sc', '???'),
   ('cmt',
    'Double-check vowel length. Is there really a kave/kavee minimal pair?'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Kave voki viapau ravire vai ova kokevavai.'),
   ('xp', 'Kave nogat sun hat tumas na i no gat ran.'),
   ('xe', '???'),
   ('ex', 'Vo kavesia uriou ari ravireoa eva virovira roropare.'),
   ('xp', 'Yu kam kul liklik pasitaw long hia bikpela sun i hot.'),
   ('xe', '???'),
   ('ex',
    'Kaveea oatoupaiveira ravireo kasikasiaro vurvuruia roia ora uteoa.'),
   ('xp', 'Karamap i save i stap namel long hot bilong san na kol.')]),
 ('kaveepaa',
  [('rt', 'kavee'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'shade'),
   ('tkp', 'hap i no gat planti san moa long ol narapela hap'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Kavepa guvaguva vao-ia.'),
   ('xp', 'Em cool tru long dispeta diwai.'),
   ('xe', '???'),
   ('ex', 'Vate toupare kaveepa kepa-ia.'),
   ('xp', 'Vate i stap long haus win.'),
   ('xe', '???'),
   ('ex',
    'Igei vo paupaiei evaova vituaroi uva viapau rirovira ravireo roropare.'),
   ('xp',
    'Mipela i sindaun adanit ol dispela diwai i no gat bikpela lait o hot bilong san.'),
   ('xe', '???')]),
 ('kaverui',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'gourd container'),
   ('tkp', 'sel kabang'),
   ('nt', "used for keeping akoro `lime powder'"),
   ('dt', '17/Oct/2005'),
   ('ex', 'Kaverui iria purapaiveira aue iava opita isi ro, akoro tovopasia.'),
   ('xp', 'Kointena ol i save wokim long kokonas bilong putim kabang.'),
   ('xe', '???'),
   ('ex', 'Kaverui iria purapaiveira aue iava opita isiro akoro tovopasia.'),
   ('xp',
    'Sel kabang ol save mekim long sel bilong kokonat bilong putim kabang long em.'),
   ('xe',
    'They make gourd containers from coconut shells in order to put lime (in them).')]),
 ('kaveruko',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'hold in arms'),
   ('tkp', 'holim pikinini'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Oira kaverukopaoro vorevira karero oira-ia rorupaoro.'),
   ('xp', '???'),
   ('xe', 'Holding her in his arms he returned being very pleased with her.'),
   ('ex', 'Henry, Rosa kaverukopaoro ava kepa iare, oira ia rorupaoro.'),
   ('xp', 'Henry i amamas long Rosa na holim em i go long haus.'),
   ('xe', '???')]),
 ('kavesi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'cabbage'),
   ('tkp', 'kabis'),
   ('dt', '29/Jan/2005'),
   ('ex', 'Aputu riro kavesi pauto eisi Sisisivi.'),
   ('xp', 'Aputu man bilong planim kebis long Sisisivi.'),
   ('xe', 'Aputu is a big cabbage farmer in Sisisivi.')]),
 ('kavikavi',
  [('rt', 'kavi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'combine'),
   ('ge', 'work together'),
   ('tkp', 'ol i wok wantaim'),
   ('cm', '-re'),
   ('vx', '2'),
   ('arg', 'OBL'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Uva oavu-re kavikaviiva. Va vaisiaro avata.'),
   ('xp', '???'),
   ('xe',
    "They worked together for something else. Its name is, single men's house."),
   ('ex',
    'Sipito oirara rutu ora riakora tavirevoi aio kavikavisia riroa aioa purasia.'),
   ('xp',
    'Sip i tokim olgeta man na meri long kisim kaikai bilong wokim bikpela pati.'),
   ('xe',
    'The chief told all of the men and women to gather food in order to have a feast.')]),
 ('kavikaviru',
  [('rt', 'kaviru'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'partial'),
   ('ge', 'steal'),
   ('tkp', 'stil stil'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Sorato aio kuroro kavikaviruparevoi.'),
   ('xp', 'Poison man em i stilim ol haphap kaikai.'),
   ('xe', 'The poison man steals crumbs of food.'),
   ('ex',
    'Ro oirato ira kavikaviruparo reraera riro pokapokato. Ira oirara vuripieparevere kavikaviru kovo purapaoro.'),
   ('xp',
    'Em dispela man i save stilstil. Em dispela kain man em les man, em i laik wokim wok stil tasol na bagarapim ol man.'),
   ('xe', '???')]),
 ('kavikaviru',
  [('rt', 'kaviru'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('dx', 'Central'),
   ('ge', 'thief'),
   ('tkp', 'stilman'),
   ('dt', '17/Jul/2005'),
   ('ex', 'Riro kavikaviruirara visi, visigoa atarikare kavirutavora.'),
   ('xp', 'Yupela ol man bilong stil, yupela i bin stilim ol pis.'),
   ('xe', "You're big thieves, you stole the fish.")]),
 ('kaviko',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'love intensely'),
   ('tkp', 'laikim tumas'),
   ('cm', '-pa'),
   ('vx', '2'),
   ('arg', 'OBL'),
   ('dt', '17/Oct/2005'),
   ('cmt', 'Why no case marking on second example?'),
   ('ex', 'Ira oviitoa-pa oisoa kavikoiraopareve.'),
   ('xp', '???'),
   ('xe', 'He was always intensely loving his son.'),
   ('ex', 'Raratuiri ira oirara rutu kavikopareveira voea aio vatepaoro.'),
   ('xp', 'Pol i save lovim ol man na givim kaikai long ol.'),
   ('xe', 'Paul loves mankind and gives food to everyone.'),
   ('ex',
    'Oirato ro ira oirara rutu-pa siraoparo rera era kaviko-pa vovouto.'),
   ('xp',
    'Man i save sori long ol man, em dispela man i gat marimari long bel bilong em.'),
   ('xe', '???')]),
 ('kavikoa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'love'),
   ('ge', 'filial love'),
   ('tkp', 'laik'),
   ('dt', '29/Jan/2005'),
   ('ex',
    'Riroa rutu kavikoa oa purareva Pauto Jisu varapieoro vo rasio iare.'),
   ('xp',
    'Bikpela i bin marimari tru na em i bin salim Jisas i kam daun long graun.'),
   ('xe', '???')]),
 ('kaviru',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'steal'),
   ('ge', 'rob'),
   ('tkp', 'stil'),
   ('vx', '1'),
   ('sc', '???'),
   ('cmt', "Is it possible to specify what's stolen?"),
   ('dt', '17/Oct/2005'),
   ('ex', 'Opeita kavirupau.'),
   ('xp', 'Yu no ken stil.'),
   ('xe', "Don't steal!")]),
 ('kaviru',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'steal'),
   ('ge', 'rob'),
   ('tkp', 'stil'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Opeita oaravu avu vai kavirupari.'),
   ('xp', '???'),
   ('xe', 'Do not be stealing anything.'),
   ('ex', 'Oeavu kaukau kovo kaviruivo uva vuraraga sia vokie eisi kovoa.'),
   ('xp',
    'Sampela man i stilum gaden kaukau na moning taim mipela i go lukluk nating long gaden.'),
   ('xe', '???')]),
 ('kaviruto',
  [('alt', 'kavirupato'),
   ('rt', 'kaviru'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'thief'),
   ('ge', 'robber'),
   ('tkp', 'stilman'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Era oirato riro kaviruto.'),
   ('xp', 'Dispela man em i stil man tru.'),
   ('xe', 'This man is a big thief.'),
   ('ex',
    'Kaviruto ira vokiaro urioparoveira oaravu kavirapasia, oirara vararo.'),
   ('xp',
    'Stilman i save kam long nait na stilim ol sampela samting bilong ol man.'),
   ('xe', '???')]),
 ('kaviruvira',
  [('alt', 'kavirupavira'),
   ('rt', 'kaviru'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'stealthily'),
   ('ge', 'secretly'),
   ('tkp', 'olsem stil'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Opeita kaviruvira oaravu oupari oearovu vararo.'),
   ('xp', 'Yu no ken stil na kisim nating samting bilong narapela man.'),
   ('xe', '???')]),
 ('kavo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'collect'),
   ('eng', 'scavenge'),
   ('eng', 'pick up'),
   ('eng', 'collect'),
   ('tkp', 'mumutim'),
   ('tkp', 'kisim ol samting ol narapela i tromoem'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '26/Sep/2006'),
   ('ex', 'Uva oisoa rera keruaro kavoive rareto guruoro.'),
   ('xp', '???'),
   ('xe',
    'They would always hunt around for the bones collecting the charred parts.'),
   ('ex',
    'Sorairara oirara varaaro kavopaiveira aio kuroro vosia vara vikipaive ra vara kavopaive vara tovopasia okavikavi rovu sovaraia vara tovopasia vuri kavikaviro.'),
   ('xp',
    'Ol poisin man i save kisim ol hap kaikai bilong ol man taim ol tromoem na bai ol i kisim na putim insait long ol sampela kain marasin i nogut.'),
   ('xe', '???')]),
 ('kavokavo',
  [('rt', 'kavo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'perform sorcery'),
   ('ge', 'work black magic'),
   ('tkp', 'posin'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Eera kavokavoto ira oirara kavokavoparoveira.'),
   ('xp', 'Poison man em i save poisonim ol man.'),
   ('xe', 'That sorcerer is always working black magic on people.'),
   ('ex',
    'Rakaisirea ora Reviteri aiterea riroirara oirara ora riakora oisioa kavokavopasi ra kopiipave uvare oisioa vuri goruro-ia pitupasi.'),
   ('xp',
    'Rakaisirea wantaim Reviteri tupela i bin poisim ol planti man na meri na ol i save dai. Bikos tupela i save holim ol bilak paua.'),
   ('xe', '???')]),
 ('kavokavoa',
  [('rt', 'kavokavo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'sorcery'),
   ('ge', 'black magic'),
   ('tkp', 'posin'),
   ('dt', '17/Oct/2005'),
   ('ex',
    'Aakova kakaeto tavipaeva oisio teapi evoa avapau uva kavokavoa iava tavipaera.'),
   ('xp',
    'Mama i tokim ol pikinini olsem no ken go long hap i gat poisin ol man i toktok long em.'),
   ('xe', '???')]),
 ('kavokavoto',
  [('rt', 'kavokavo'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'sorcerer'),
   ('tkp', 'posin man'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Kavokavoto sorato oira kopipiepato.'),
   ('xp', 'Posin man i man blong mekim man i dai.'),
   ('xe', 'A black magic sorcerer kills people.'),
   ('ex',
    'Rakaisirea iravu kavokavoto. Ira kavokavoa-ia vuria rutu oisioa pitupareve. Oa goruaro uvuipai ra katai voki raga-ia upiau ra kopiu ovoiu.'),
   ('xp',
    'Rakaisirea em wanpela poisin man. Em i save holim poisin i nogut tru. Strong bilong em inap wanpela dei tasol bai yu sik na yu dai olgeta.'),
   ('xe',
    "Rakaisirea is a sorcerer. He always has really bad black magic, and it's strength is such that you will you get sick and die in one day.")]),
 ('kavora',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'red rash on skin'),
   ('ge', 'ringworm'),
   ('ge', 'tinea'),
   ('ge', 'grille'),
   ('tkp', 'skin i bagarap grile'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Vii kavora kavorato.'),
   ('xp', 'Yu grille man.'),
   ('xe', '???'),
   ('ex',
    'Kavorara oara oirara ora riakora varaararo-ia orapurapapeira oa rakariara kaekaepiepaiveira ra vuri kekepave oirara or riakora.'),
   ('xp',
    'Ol grile i save kamap long skin bilong ol man meri. Ol dispela grile i save hantapim skin na bai ol man meri i luk nogut.'),
   ('xe', '???')]),
 ('kavorato',
  [('rt', 'kavora'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'person with ringworm or tinea'),
   ('tkp', 'grile man'),
   ('nt', 'common insult'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Vii kavorato.'),
   ('xp', 'Yu grille man.'),
   ('xe', 'You you have grille.'),
   ('ex',
    'Riakova kavoratoa-pa reasipaopa oa iava rera arova torioro voreopa.'),
   ('xp',
    'Meri i bin les long grile man na em i ranawe long em na em i ranawe i go bek.'),
   ('xe', '???')]),
 ('kavori',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'crayfish'),
   ('ge', 'lobster'),
   ('tkp', 'kindam'),
   ('dt', '11/Feb/2007'),
   ('cmt', 'What does <kakare> mean in last example sentence?'),
   ('ex', 'Kavori votoupai ukoviroia.'),
   ('xp', 'Kindam i save istap blong wara.'),
   ('xe', 'Lobsters are usually in the water.'),
   ('ex', 'Ragai kavoriparai ukova sirova uvare rirovira kavori ruipaparai.'),
   ('xp', 'Mi kisim kindam arere long wara bikos mi laik kaikai tru kindam.'),
   ('xe',
    'I am catching crayfish by the water because I really like crayfish.'),
   ('ex',
    'Riropa kakare kavori okarea avakava sovaraia toupaiveira uva vokare ritapaiveira.'),
   ('xp',
    'Ol bikpela kindam i save stap insait long solwara we ol i save sutim ol.'),
   ('xe',
    'The big ??? crayfish, they live in the ocean where we spear them.')]),
 ('kavori',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'collect crayfish or lobster'),
   ('tkp', 'kisim kindam'),
   ('vx', '1'),
   ('cmt', 'Need example to illustrate agreement'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Kavorisia avau eisi ukovi.'),
   ('xp', 'Yu go kisim kindam long wara.'),
   ('xe', 'You go lobster-hunting in the water.')]),
 ('kavorou',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'covet'),
   ('ge', 'keep something intended for another'),
   ('ge', 'intercept'),
   ('tkp', '???'),
   ('cmt', 'Find an example without a discontinous NP'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('cm', '-ia'),
   ('sa', 'ogo'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Eakere ragai vaaro-ia kavorouuei monia.'),
   ('xp', '???'),
   ('xe', 'Why are you keeping my money?'),
   ('ex',
    'Opeita oearovu vararo-ia kovorou ragapata vara oupasia voea siraopiepaoro.'),
   ('xp',
    'Noken putim nating skin nating long samting bilong narapela na putim nating long ol na mekim ol sori.'),
   ('xe', '???')]),
 ('kavovoa',
  [('rt', 'kavovo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'valley'),
   ('ge', 'ravine'),
   ('tkp', 'maunten i stap olsem banis'),
   ('tkp', 'maunten i stap raonim ol narapela'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Kavovao va vao varata viapau karuviva toupai.'),
   ('xp', 'Dispela valley i tumas open.'),
   ('xe', '???'),
   ('ex',
    'Opukuirovu oara kavovoa purapaiveira opukui rovupa raviapau opukuiro? vurapape.'),
   ('xp',
    'Ol sampela maunten i save stap olsem banis long ol narapela maunten olsem na yumi no inap lukim ol narapela maunten.'),
   ('xe', '???'),
   ('ex', 'Kavovoa aue opukuirovu oara opukuirovu-re utopaiveira.'),
   ('xp', 'Ol maunten i stap olsem banis long ol narapela maunten.'),
   ('xe', '???'),
   ('ex', 'Kavovovira pukuiara toupaiveira Sureko urui-re.'),
   ('xp', 'Ol maunten i save stap raunim peles Sureko.'),
   ('xe', '???')]),
 ('kavovovira',
  [('alt', 'kavovopavira'),
   ('rt', 'kavovo'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'open slightly'),
   ('tkp', 'bikpela open spes'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Kavovovira pukuiare toupai.'),
   ('xp', 'Twopela mauntem istap op opew??.'),
   ('xe', '???'),
   ('ex',
    'Sureko urui kavovovira toupaiveira uvare pukuiara vo uroi utopaivera.'),
   ('xp',
    'Ples Sureko i stap insait long banis bikos ol maunten i pasim dispela ples.'),
   ('xe', '???')]),
 ('kavu',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'left.over'),
   ('eng', 'left behind'),
   ('eng', 'left over'),
   ('tkp', 'lusim bihain'),
   ('tkp', 'larim'),
   ('vx', '1'),
   ('sc', '???'),
   ('dcpv', 'true'),
   ('dt', '17/Oct/2005'),
   ('cmt', 'Is there a transitive usage?'),
   ('ex', 'Vokipavira-re varao kavupapiroi.'),
   ('xp', '???'),
   ('xe', 'These are left for tomorrow.'),
   ('ex', 'Varao kavu vokipavirare oara aiopere.'),
   ('xp', 'Ol dispela samting bai yu lusi bilong tumoro bai yumi kaikai.'),
   ('xp', '???')]),
 ('kavu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'leave.behind'),
   ('eng', 'leave behind'),
   ('tkp', 'larim'),
   ('tkp', 'lusim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('sc', '???'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Vaipeiri kavuoro kareroe Palavi eisi Vakora.'),
   ('xp', 'Palavi em i bin lusim Vaipeiri long Wakunai.'),
   ('xe', 'Palavi returned to Wakunai, leaving Vaiperi behind.')]),
 ('kavuava',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'limbum'),
   ('tkp', '???'),
   ('nt', 'small but very strong trunk'),
   ('cmt', "What's the plural?"),
   ('dt', '15/Apr/2006'),
   ('ex', 'Koeta purapaveira aueia kavuova ova aue keta.'),
   ('xp', 'Ol save wokum bubnana mhouse long piapela limbum??.'),
   ('xe', '???'),
   ('ex',
    'Kavuava iria vosarao iava sirivuko siara uva garepaoi rutu sirivuko arapa.'),
   ('xp', 'Wanpela kain limbum na i liklik tru long ol limbum.'),
   ('xe', '???')]),
 ('kavupie',
  [('rt', 'kavu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'leave.behind'),
   ('tkp', 'wanpela man tasol oli lusim i dai'),
   ('eng', 'leave behind'),
   ('vx', '2'),
   ('arg', 'O'),
   ('nt', 'nickname of man left after all his relatives had died'),
   ('dt', '08/Dec/2005'),
   ('ex',
    'Koikea iravu ira vaisipaiveira oisio kavupie aue iava uvare rera tatariako ora araoko irara rutu kopiiaepa rera raga arova oa iava rera vaisipaiveira oisio kavupie.'),
   ('xp',
    'Koike em wanpela man ol i save kolim em olsem kavupie long wanem olgeta lain. Brata na sista i dai lusim em tasol. Olsem na oli save kolim em olsem kavupie.'),
   ('xe', '???'),
   ('ex', 'Kavupietoavi era ira arova kopikopiaepa.'),
   ('xp', 'Ol i bin dai lusim dispela man.'),
   ('xe', '???'),
   ('ex', 'Kavupietoavia era patoto ira kavuivo.'),
   ('xp', 'Dispela pato ia em ol i lusim na ol i go nambaut.'),
   ('xe', '???')]),
 ('kavura',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'copra'),
   ('tkp', 'kopra'),
   ('dt', '17/Oct/2005'),
   ('ex',
    'Opita kouro iava kavura purapaveira ra vara sipopaive vara-ia vori oupasia.'),
   ('xp',
    'Ol kokonat ol i save mekim kopra long em na ol i save salim bilong kisim moni.'),
   ('xe',
    'From coconuts they make copra and send them in order to make money from them.')]),
 ('kavurao',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'dust'),
   ('tkp', 'das'),
   ('dt', '17/Oct/2005'),
   ('ex',
    'Vori isi koveavo urua reroaro uva vaisi-re tararagapaoro kavurao-pa vavaerae.'),
   ('xp',
    'Moni mi mekim pundaun handanit long bet na mi painim nating na han bilong mi i pulap long das.'),
   ('xe', '???')]),
 ('kavusi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'spit_out'),
   ('tkp', 'spet'),
   ('eng', 'spit forcefully towards mark'),
   ('eng', 'spit out'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '31/Jan/2007'),
   ('ex', 'Oira kopaoro rera kavusireva.'),
   ('xp', '???'),
   ('xe', 'Swallowing it, he spit it (something else) out.'),
   ('ex',
    'Tugarato riakorirei kopaoro uva okaotoova guruvaro kavusireva aue-ia revasi.'),
   ('xp',
    'Masalai i bin daonim pinis tupela meri na em i spetim lip bilong diwai talisa long blut.'),
   ('xe', '???')]),
 ('kavuvo',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'valley'),
   ('ge', 'ravine'),
   ('tkp', 'ples name long tupela maunten'),
   ('dt', '29/Jan/2005')]),
 ('kea',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'confused'),
   ('eng', 'think mistakenly'),
   ('eng', 'confused'),
   ('eng', 'deceived'),
   ('tkp', 'giaman'),
   ('cm', '-ia'),
   ('arg', 'OBL COMP'),
   ('cmt', 'Can you also get simply OBL for an argument?'),
   ('vx', '2'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Uva eva-ia keasiepa oiso iria ita uusipaoi.'),
   ('xp', '???'),
   ('xe', 'They both though in error (confused) that she was just sleeping.'),
   ('ex',
    'Aisi ragai kea-re uva voria oure oisio puraoro vavatepaavoi ikauvira.'),
   ('xp',
    'Aisi em i giamanim mi na em i kisim moni. Em i tok olsem bai mi givim hariap.'),
   ('xe', '???')]),
 ('keakea',
  [('rt', 'kea'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'lie'),
   ('tkp', 'tok giaman'),
   ('eng', 'lie'),
   ('arg', '???'),
   ('vx', '???'),
   ('cmt', "What's going on with the second sentence?"),
   ('dt', '17/Feb/2006'),
   ('ex',
    'Vokiara rutu-ia ragai keakeapaiveira oisio vii vatepaio aue vori vii varo-ia kovoa.'),
   ('xp',
    'Olgeta dei ol i save giagiamanim mi olsem bai mipela i givim yu moni long wok bilong yu.'),
   ('xe', 'Every day they deceive me ???.'),
   ('ex', 'Voea keakea reo Vairi oisio eisi-va kareroe Vakora.'),
   ('xp', 'Vairi em i giamanim ol olsem em i kam long Wakunai.'),
   ('xe', '???')]),
 ('keakeato',
  [('rt', 'keakea'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'deceiver'),
   ('tkp', 'man bilong giaman'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Riro keakeato vii.'),
   ('xp', 'Yu man bilong giaman.'),
   ('xe',
    'Pita kakaevure taviparevo oisio keakeato era ira urioparoi raiva-ia vokopaoro.'),
   ('xp',
    'Pita i tokim ol pikinini olsem em giaman ia i wokabaut i kam long rot.'),
   ('xe', '???')]),
 ('keari',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'bamboo'),
   ('tkp', 'mambu bilong wokim spia'),
   ('nt', 'used for making arrow shafts'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Keari aue paupasia rauru iapasi aue-ia koeta.'),
   ('xp', 'Blong putim arrow na bai shut long bamboo.'),
   ('xe', '???'),
   ('ex',
    'Kearia oa paupaiveira aue purapasia rauru vara-ia orekerovu riitapasia vara-ia.'),
   ('xp',
    'Tiktik ol i save planim bilong mekim spia bilong sutim ol narapela samting long em.'),
   ('xe', '???')]),
 ('kearito',
  [('rt', 'keari'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'long arrow'),
   ('tkp', 'longpela spia'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Uriou kearitoa-va ira-ia rauruva puravere oira-ia kora ritapasia.'),
   ('xp', 'Kam wantaim tiktik bai mi mekim spia bilong sutim kapul long em.'),
   ('xe',
    "Come with an arrow from which I'll make a spear with which to spear possums.")]),
 ('keavira',
  [('alt', 'keapavira'),
   ('rt', 'kea'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'deceptively'),
   ('ge', 'confused'),
   ('tkp', 'olsem giaman'),
   ('dt', '17/Oct/2005'),
   ('cmt',
    'Is there an objevt missing in the second clause of the 2nd example sentence?'),
   ('ex', 'Keavira uusisia avau.'),
   ('xp', 'Yu go slip giaman long hap.'),
   ('xe', 'You go pretend to be asleep.'),
   ('ex', 'Vao keavira tousia avau.'),
   ('xp', 'Yu go i stap giaman long hap.'),
   ('xe', '???'),
   ('ex',
    'Suparai keavira tuutaa taparevoi uva kepa rutu kovei uvare viapau oiraopavira rutu vataparevo.'),
   ('xp',
    'Suparai em i giaman nilim pos na haus i pundaun olgeta bikos em i no nilim tru.'),
   ('xe',
    "Suparai didn't really nail the post and the house fell down because he didn't nail it properly.")]),
 ('kee',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'shatter'),
   ('ge', 'fracture'),
   ('ge', 'chip'),
   ('tkp', 'brukim'),
   ('nt', 'applies to brittle objects'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Botol tou keerevora va kovepaoro.'),
   ('xp', '???'),
   ('xe', 'He shattered the bottle, dropping it.'),
   ('ex', 'Aguvi tou keepi Pita votouaru uvare rasito iare kove.'),
   ('xp', 'Botol i bruk bikos i pundaon i go daon long graon.'),
   ('xe', '???')]),
 ('keekee',
  [('rt', 'kee'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('rdp', 'full'),
   ('ge', 'chipped'),
   ('ge', 'shattered'),
   ('tkp', '???'),
   ('vx', '1'),
   ('cmt', "What's up with -viro?"),
   ('dt', '27/Oct/2005'),
   ('ex', 'Sisiro keekeeoviropa.'),
   ('xp', '???'),
   ('xe', 'The mirror is shattered.'),
   ('ex', 'Sisiro keekee eva evoa toupai rasito ivaraia.'),
   ('xp', 'Brukbruk mirror ia istap hantap long graon.'),
   ('xe', '???')]),
 ('keekeepa',
  [('rt', 'kee'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'shattered object'),
   ('ge', 'fractured object'),
   ('tkp', '???'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Keekeepa sigoa.'),
   ('xp', 'Naip i bruk pinis.'),
   ('xe', 'The knife is broken.'),
   ('ex', 'Pita keekeepa sigoa ari ragai vearoa sigoa.'),
   ('xp', 'Pita i gat naip i brukbruk tasol mi gat gutpela.'),
   ('xe', 'Peter has a broken knife but I have a good one.')]),
 ('keekeeri',
  [('rt', 'keeri'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'spear'),
   ('tkp', '???'),
   ('nt',
    'similar toraavai but only small area fitted with flying fox bones,'),
   ('dt', '07/Sep/2006'),
   ('ex',
    'Keekeerin puraiveira aueiva gari araivo ra auetapo tukepaiveira aatu keruro.'),
   ('xp', 'Spia ol i sone putim makmak long en wantim bones blon blak bokis.'),
   ('xe', '???'),
   ('ex', 'Rauru kae keekeeri-pa.'),
   ('xp', 'Spia i gat makmak long em.'),
   ('xe', 'The spear has marking.')]),
 ('keekeerito',
  [('rt', 'keekeeri'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'insert to hold arrow tip'),
   ('tkp', 'het bilong mekpas bilong spia'),
   ('dt', '07/Sep/2006'),
   ('ex',
    'Keekeerito ira purapaiveira aue iava aari ova aatu keruro rauru kaero-ia.'),
   ('xp',
    'Ol save workim ol makmak na putim i go bowes bilong [flying fox] long ol spia; na kamapim ol makmak.'),
   ('xe', '???, they made it from ??? flying fox bones on the spears.')]),
 ('keera',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'call.for'),
   ('eng', 'call for'),
   ('eng', 'beckon to'),
   ('eng', 'signal for meeting'),
   ('tkp', 'singaut'),
   ('tkp', 'kolim'),
   ('cm', '-re'),
   ('vx', '2'),
   ('arg', 'OBL'),
   ('sc', 'EMISSION.SOUND'),
   ('cmt',
    "What's the story with the last sentence, Pita keerapa viire pausia?"),
   ('dt', '15/Nov/2005'),
   ('ex', 'Rerare keeraiepa, Auoro Soraviri, varau!'),
   ('xp', '???'),
   ('xe', 'We (exclusive) called to him, Hey Soraviri, come down!'),
   ('ex', 'Keera avakava sirova toupareveira atari aiopaaro.'),
   ('xp', 'Keera em i save i stap tasol solwara kaikaim of liklik fish.'),
   ('xe', '???'),
   ('ex', 'Gurukoa eva toivaiva oa oirara-re keerapai.'),
   ('xp', 'Krai bilong garamut i singautim ol man.'),
   ('xe', 'The garamut is a ??? that signals people.'),
   ('ex', 'Keerapa ra avasi eisi karoa.'),
   ('xp', 'Tun i singaut long yu bai yupela i go long gaden.'),
   ('xe', '???'),
   ('ex', 'Pita keerapa vii-re pausia.'),
   ('xp', 'Pita sigaut long yu bai yupela sitdown.'),
   ('xe', 'Pita calls for you to sit down.')]),
 ('keeriva',
  [('ps', '???'),
   ('ge', 'crest of cockatoo'),
   ('tkp', 'kangai'),
   ('cmt',
    'glossed in Tok Pisin as "gras bilong koki ol i save brukim na pas bilogn putim long het"'),
   ('dt', '27/Oct/2005'),
   ('ex',
    'Keeriva iria purapaiveira kokio kare oruaro-ia oisio osia kakata ra vara-va pupipaive.'),
   ('xp',
    'Gras bilong feather ol i save mekim gras bilong ol pisin olsem koki na ol i save singsing kaul wantaim.'),
   ('xe', '???')]),
 ('keesi',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'box'),
   ('eng', 'box'),
   ('eng', 'case'),
   ('tkp', 'kes'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '30/Apr/2006')]),
 ('keetaa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'jaw'),
   ('tkp', 'wasket'),
   ('eng', 'chin'),
   ('eng', 'jaw'),
   ('eng', 'mandible'),
   ('dt', '27/Oct/2005'),
   ('ex',
    'Keetaa oirato iava gasiivo ora upopaoro vokiaro. Uva viapau vearovira aioparevo uvare keetaa grasipiro.'),
   ('xp',
    'Wisket bilong man ol i brukim taim ol i pait long nait. Na em i no wok long kaikai bikos wisket em i bruk.'),
   ('xe', '???')]),
 ('keevuru',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'glide'),
   ('tkp', '???'),
   ('cmt', 'Check classification. Example has Class B agreement, no?'),
   ('vx', '1'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Keevuruparoi keravo.'),
   ('xp', '???'),
   ('xe', 'The hawk is gliding (through the air).')]),
 ('keevuruvira',
  [('rt', 'keevuru'),
   ('ge', '???'),
   ('tkp', '???'),
   ('dt', '18/Apr/2007'),
   ('ex',
    'Kokio kare keevuruvira papapaiveira keauvere oisio koveroi rara kevururo vara???.'),
   ('xp',
    'Ol pisin i save plai isisi antap. Na ai yu ting olsem bai em i pundaun taim i plai ???.'),
   ('xe', 'Birds fly gliding ???')]),
 ('kegi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'taunt'),
   ('tkp', 'duim'),
   ('tkp', 'posim'),
   ('vx', '2'),
   ('arg', 'O COMP'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Ragai kegiparevoi va purasia osa reasiparai va puraarapa.'),
   ('xp', '???'),
   ('xe', 'He is taunting me to do it, as I am not wanting to do it.'),
   ('ex',
    'Virievi iria ragai kegipaoro ragai tapo avaoe eisi-re kovoa osia reasiragaparae.'),
   ('xp',
    'Virievi i wok long posim mi na em i go wantaim mi long gaden taim mi wok long les.'),
   ('xe', '???')]),
 ('kei',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'house with a single pitch roof'),
   ('tkp', 'haus i gat wanpela sait rup bilong haus'),
   ('nt', 'Full form is <kei kepa>.'),
   ('cmt', 'Is "lean-to" a good translation?'),
   ('dt', '10/Jan/2007'),
   ('ex', 'Avaisisi ira kei kepa-ia uusiparoveira vegoaro.'),
   ('xp', 'Avaisisi em i save slip long wansait haus long bus.'),
   ('xe', 'Avaisisi sleeps in a lean-to in the jungle.')]),
 ('keke',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'look_at'),
   ('eng', 'look at'),
   ('tkp', 'lukim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('cmt', 'Example needed'),
   ('sc', 'PERCEPTION'),
   ('dcpv', 'true'),
   ('dt', '15/Feb/2007'),
   ('ex',
    'Aakova kakaeto tavipaevo oisio keke osia vitavipaveira oisio opeita sigoa-ia pitupari teapi oratoeu.'),
   ('xp',
    'Mama i tokim pikinini olsem, Lukim nau, olsem mi save tokim, yu no ken holim naip nogut yu katim yu yet.'),
   ('xe', '???'),
   ('ex', 'Asia ira Arari kekere ovusia gaupare aakova-pa.'),
   ('xp', 'Asia em i lukim Arari taim emi krai long mama bilong em.'),
   ('xe', '???'),
   ('ex', 'Ragai keke ovusia oravikira vo rovua-ia.'),
   ('xp', 'Lukim mi taim mi kalap long wara.'),
   ('xe', '???')]),
 ('keke',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'look'),
   ('tkp', 'luk'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('dcpv', '???'),
   ('dadv', '???'),
   ('dt', '19/Mar/2006'),
   ('ex', 'Vearo kekepau ragai osireiaro-ia vii oupa ovusia ragai taviri.'),
   ('xp',
    'Yu luk gut tru long ai bilong mi bai mi maritim yu sapos yu tok orait.'),
   ('xe', 'You look good in my eyes I will marry you if you tell me.'),
   ('ex', 'Vo araorei kataivira kekepasiei.'),
   ('xp', 'Tupela brata i luk wankain.'),
   ('xe', 'The two brothers look the same.')]),
 ('kekepie',
  [('rt', 'keke'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'show'),
   ('tkp', 'soim'),
   ('tkp', 'kamapim'),
   ('arg', '???'),
   ('vx', '???'),
   ('dt', '08/Dec/2005'),
   ('ex',
    'Rarasori oirara kekepiepare kovoa-ia oisio osia ragavira va sirova utuavere va puraoro.'),
   ('xp', 'Rarason em i soim ol man long hao bai ol lukim na bihainim.'),
   ('xe',
    'Robinson is showing people the work so that they can do it in the same way.'),
   ('ex', 'Kekepie sopa rutu vavao oa karekepiepareva Rarasori.'),
   ('xp', 'Wanpela kain samting tru ia Rarason wok long soim.'),
   ('xe', '???')]),
 ('kekeputu',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'nearly'),
   ('ge', 'almost'),
   ('tkp', 'klostu tru'),
   ('vx', '1'),
   ('cmt', 'Can this have a real subject, as opposed to a dummy subject?'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Kekeputui vuuta ra karesi.'),
   ('xp', '???'),
   ('xe', 'The time is nearly present for you both to return.'),
   ('ex',
    'Vuuta kekeputupaoro avapai Rarasori-pa ra vatuava kekesia karero atoia.'),
   ('xp',
    'Taim i wok long klostu nao bilong Rarason bai em go bek long ples na lukim meri bilong em.'),
   ('xe', '???')]),
 ('kekeputuvira',
  [('alt', 'kekeputupavira'),
   ('rt', 'kekeputu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'close'),
   ('ge', 'nearly'),
   ('tkp', '???'),
   ('nt', 'Alternate forms kekeputu and keputu are used.'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Kekeputuvira va tovorivo.'),
   ('xp', '???'),
   ('xe', 'You put it very close to the edge.'),
   ('ex',
    'Vuuta vurapaavoi kekeputuvira kokeva vo vuutaro. Oa-ia viapau avarae eisi-re kovoa.'),
   ('xp',
    'Taim em mi lukim olsem i klostu taim bilong ren bai pundaon olsem na mi no go long gaden.'),
   ('xe', '???')]),
 ('kekeraokovira',
  [('alt', 'kekeraokopavira'),
   ('rt', 'kekeraoko'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'nicely'),
   ('tkp', '???'),
   ('dt', '12/Dec/2006'),
   ('ex', 'Kekeraokovira rutu va puraparivoi.'),
   ('xp', '???'),
   ('xe', 'You are making it very nicely.'),
   ('ex',
    'Tevire ira torara iava kakurao purarevo kekeraukovira uva eva raga oa iava vaa voriavo.'),
   ('xp',
    'Tevire i wokim handol bilong tamiok i luk olsem naispela tru. Long dispela tasol na m baim.'),
   ('xe', 'Tevire made a nice handle for the axe ???.'),
   ('ex', 'Kekeraokovira Tevire kakuraoro purapareveira torarara iare.'),
   ('xp', 'Tevire em i save wokim ol gutpela handol bilong ol tamiok.'),
   ('xe', '???')]),
 ('kekesopa',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'tree'),
   ('tkp', '???'),
   ('nt', 'Its fragrant leaves used when cooking opossum.'),
   ('sf', 'FLORA'),
   ('dt', '28/Jan/2005'),
   ('ex',
    'Kekesopa riro sesi arut va guruvaro ia kova oripaveira ora oaravu varuere ara.'),
   ('xp',
    'Kekesopa wanpola kain diwai isave simell get toy ol save kukua wantam olmeat blong kapul na ol kain hamusi.'),
   ('xe', '???')]),
 ('kekevoto',
  [('ps', '???'),
   ('dx', 'Central'),
   ('ge', '???'),
   ('tkp', '???'),
   ('dt', '27/Aug/2005'),
   ('nt',
    'hypothetical root which does not occur without derivational morphology--<ora>- or -<vira>'),
   ('cmt', 'Sera not sure about word---possibly archaic')]),
 ('kekevotovira',
  [('alt', 'kekevotopavira'),
   ('rt', 'kekevoto'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'out of sight'),
   ('ge', 'slyly'),
   ('tkp', 'luk olsem holim pas'),
   ('tkp', 'i no inap larim'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Kekevotovira va kaeri.'),
   ('xp', '???'),
   ('xe', 'Carry it so that no one sees it.'),
   ('ex',
    'Vuataevi iria kekevotovira voriara-ia pitupaeveira viapau uvuipaoi ra ikauvira oa-ia vorio.'),
   ('xp',
    'Vuataevi em meri holim pas moni. em i no nap baem kuik samting long moni.'),
   ('xe',
    "Vuataevi always holds on to money secretely and won't go shopping with it.")]),
 ('kekira',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'moon'),
   ('ge', 'month'),
   ('tkp', 'mun'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Kekira viei vo katai iva sovaraia.'),
   ('xp', 'Namben blong ol moon long wonpela year.'),
   ('xe', '???'),
   ('ex', 'Eraopiepatoa-ia kekira ira-ia kakaevure sikuru rovopaveira.'),
   ('xp', 'Nabatu mun ol pikinini i save statim skul bilong ol.'),
   ('xe', '???')]),
 ('keo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'give a party'),
   ('ge', 'make a feast'),
   ('tkp', 'mekim pati'),
   ('vx', '???'),
   ('arg', '???'),
   ('cmt', 'Clearer examples needed.'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Vao vioia keo uva aioro opoa.'),
   ('xp', 'Yo kaikai diapela hap taro.'),
   ('xe', '???'),
   ('ex', 'Keosia uriouei vo aio-ia oa puraiovoi vii-pa riro aio rutu.'),
   ('xp', 'Mipela mekim dispela bikpela party long yu.'),
   ('xe', '???'),
   ('ex', 'Sisivarea kookai rutu aiorevo rera-ia keoro uva kopiiroe.'),
   ('xp', 'Sisivarea em i kaikaim kakaruk olgeta na em dai.'),
   ('xe', 'Sisivarea ate all of the roosters ??? and he died.')]),
 ('keoka',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'left over'),
   ('ge', 'surplus'),
   ('tkp', 'lep'),
   ('vx', '???'),
   ('dt', '27/Oct/2005'),
   ('cmt', 'Archaic?'),
   ('ex', 'Vao oa putepaivo voviei ivaraia.'),
   ('xp', 'Waneme I left over long wamber??.'),
   ('xe', '???'),
   ('ex',
    'Keoka iraorai riroara rutu aiooro varuara osia viapau vara opesipieavo uvare vukurae uva oaravu touvira toupaivo.'),
   ('xp',
    'Mi napim laik bilong kaikai planti habus tru. Tasol mi no pinisim olgeta bikos mi pulap olsem na sampela i stap yet.'),
   ('xe', '???')]),
 ('keopa',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'taste good'),
   ('tkp', 'swit'),
   ('am', 'false'),
   ('vx', '1'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Vearo supu rutu vaoia atari supu oa-ia ora keopa.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Keoparoi rutu rovo supu oia va aioro.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex',
    'Keopa rutu eva varua oavaterevora Rarasori Mateas-pa. Uva va-ia keopa va aiopaoro.'),
   ('xp',
    'Em wanpela gutpela mit Rarason i bin givim long Matias. Nea em i wok long testim gut tru taim em i kaikai.'),
   ('xe', '???')]),
 ('keovira',
  [('alt', 'keopavira'),
   ('rt', 'keo'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'sweetly'),
   ('tkp', '???'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Keopavira rutu vo supue??purariroi oa aiopaiovoi.'),
   ('xp', 'Dispela sop emi sweet tra mipalal ikaikaim.'),
   ('xe', '???'),
   ('ex',
    'Kokai pitoka keopavira rutu gesiei uvare va iava ivita verarevo kakaeto.'),
   ('xp',
    'Sospen kakaruk i olsem smel suit tru, bikos pikinini i raosim lit bilong pot.'),
   ('xe', '???')]),
 ('kepa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'house'),
   ('tkp', 'haus'),
   ('eng', 'house'),
   ('eng', 'building'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Uusipa kepa.'),
   ('xp', 'Haus bilong sleep.'),
   ('xe', 'Sleep house.'),
   ('ex', 'Kepa oavivu rutu oa oirara rutu tauvapaiveira rasitoa rutu-ia.'),
   ('xp',
    'Haus em wanpela samting tru i save halivim tru olgeta man olgeta hap bilong graun.'),
   ('xe', '???')]),
 ('kepa toupato',
  [('rt', 'kepa'),
   ('rt', 'tou'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('dx', 'Central'),
   ('ge', 'homebody'),
   ('tkp', 'man i save stap long haus tasol'),
   ('dt', '02/Sep/2005')]),
 ('kepetai',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'tall woman'),
   ('tkp', '???'),
   ('dt', '28/Jan/2005')]),
 ('kepi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'fracture'),
   ('ge', 'break'),
   ('tkp', 'brukim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '20/Nov/2006'),
   ('ex', 'Teapi ragai kokotoaro kepiive.'),
   ('xp', '???'),
   ('xe', "It wouldn't be good if they fractured my leg."),
   ('ex', 'Vosia kokotoa vii iava kepipiro ra viapau uvuipauei ra vokapari.'),
   ('xp', 'Sapos lek bilong yu i bruk bai yu no inap wokabaut.'),
   ('xe', "If your leg is broken, you can't walk around.")]),
 ('kepia',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'nest'),
   ('tkp', '???'),
   ('nt',
    'made of sticks and woven into loose platform, built by crows, hawks, etc.'),
   ('dt', '27/Oct/2005'),
   ('ex',
    'Kepia oa purapaiveira kokio kare aue-ia garepa raorovi rutu evaoraoro ra voa takura kouave vo kepi-ia.'),
   ('xp',
    'Haus ol pisin i save wokim long ol liklik han tru bilong diwai. Na bai oli putim kiau long dispela haus.'),
   ('xe', '???')]),
 ('kepikepi',
  [('rt', 'kepi'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('cl', 'isi'),
   ('ge', 'tree fern'),
   ('tkp', '???'),
   ('sf', 'FAUNA'),
   ('nt', 'small with thorns and edible new shoots'),
   ('dt', '12/Feb/2005'),
   ('ex',
    'Kepikesi oiso toupai oga evava?? Votoupai veira regoarorirovira kupei aro karupiepai vosa aire paraoro purasa auepa pe ra kupeivira toupaive ra kavupaive.'),
   ('xp',
    'Em iotsen?? Wanpela diwai??em isave kamapim ol new pela hand blong plaint istap long bush.'),
   ('xe', '???')]),
 ('kepiko',
  [('rt', '???'),
   ('ps', '???'),
   ('ge', '???'),
   ('tkp', 'wanpela plant'),
   ('dt', '27/Oct/2005'),
   ('ex', '???'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Votoupai vegoao riro kaekaepa guruvaro.'),
   ('xp', 'Kepiko isave istap long ol diwai em long pela leaf.'),
   ('xe', '???'),
   ('ex',
    'Kepikoa oa kovapapaeira evaoara kovuaro-ia. Vara ivaratapo korakare toupaiveira.'),
   ('xp',
    'Kepiko emi wanpela kain plant i save gro long namel bilong diwai. Na ol kapul i save stap hantap long em tu.'),
   ('xe', '???')]),
 ('kepiriko',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'betel pepper vine with fragrant leaf'),
   ('tkp', 'sampela kain diwai'),
   ('nt', 'A wild type and not eaten.'),
   ('dt', '27/Oct/2005'),
   ('ex',
    'Kepiriko evaova vaisiaro oa oupaiveira va-va pupipasia ovusia gesipape vearopievira.'),
   ('xp',
    'Kepiriko i nem bilong wanpela diwai ol i save kisim bilong singsing kaul wantaim na bai save smel gut tru.'),
   ('xe',
    'Kepiriko is the name of a tree that they get to singsing with while it smells good.')]),
 ('kepiro',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'cicada'),
   ('tkp', 'wanpela kain binatang'),
   ('nt', 'largest type'),
   ('sf', 'FAUNA.INSECT'),
   ('dt', '27/Oct/2005'),
   ('ex',
    'Kepiro vosia raoiava sikokare vosiararo. Riro kukueva irova ora putaeuarei pokopokopiepaeveira ra gurukopaive.'),
   ('xp',
    'Cicada em i wanpela binatang long ol lain bilong grasopa. Em i gat bikpela het na tu emi sae pairapim tupela wing bilong em i save krai.'),
   ('xe', '???')]),
 ('kepisi',
  [('ps', 'V'),
   ('pt', '???'),
   ('dx', 'Central'),
   ('ge', 'dam'),
   ('tkp', 'pasim wara'),
   ('vx', '???'),
   ('dt', '22/Nov/2005'),
   ('ex', '???'),
   ('xp', '???'),
   ('xe', '???')]),
 ('kepisiva',
  [('rt', 'kepisi'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('dx', 'Central'),
   ('ge', 'dam'),
   ('tkp', 'dam'),
   ('dt', '21/Aug/2005'),
   ('ex', '???'),
   ('xp', '???'),
   ('xe', '???')]),
 ('kepita',
  [('rt', 'kepi'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'firewood stick'),
   ('tkp', '???'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Kepitara evao raortro gareparavi.'),
   ('xp', 'Liklik hand blong diwai.'),
   ('xe', '???'),
   ('ex', 'Sopia kepitara ouri ra tuitui kasi puraa.'),
   ('xp', 'Spia, yu kisim ol pai wut na bai  mi wokim paia.'),
   ('xe', '???')]),
 ('kepitai',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'small girl'),
   ('tkp', '???'),
   ('nt',
    'Also used as a nickname which stays with the female even as an adult.'),
   ('dt', '27/Oct/2005'),
   ('ex',
    'Kepitai vosia garevisivi aakova vo kakae kavaueve ra eisi oira vaisipaive oisio kepitai.'),
   ('xp',
    'Liklik bun nating meri taim mama i karim em ol i save kolim liklik longpela bun nating meri.'),
   ('xe', '???'),
   ('ex',
    'Kepitai riro kaekaeva riakova, iria vararo viapau riro vututuvira toupaive.'),
   ('xp', 'Liklik longpla meri, skin bilong em i no pat.'),
   ('xe', '???')]),
 ('kepito',
  [('alt', 'kepipato'),
   ('rt', 'kepi'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'crippled person'),
   ('tkp', 'man i gat bun i bruk'),
   ('nt', 'can be a broken leg or a broken arm'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Ro oirato ira kepivira vokapareve.'),
   ('xp', 'Man i no wokabaut gut.'),
   ('xe', "A man who can't walk well."),
   ('ex',
    'Oirato ro ira iava kokotoa kepivira toupaive. Oire reraera ira viaispaivere oisio kepito.'),
   ('xp',
    'Man em lek bilong em i bruk. Orait em dispela kain man ol i save kolim olsem lek bruk man.')]),
 ('kepo',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'two-sided'),
   ('tkp', 'i gat tupela sait'),
   ('vx', '???'),
   ('cmt', 'Need ex illustrating verb agreement.'),
   ('dt', '27/Oct/2005'),
   ('ex',
    'Pita Sera taviparevoi oisio uriou ragai-pa asi kepo reiva. Ravarei ura uvare aasi ruipaparai.'),
   ('xp',
    'Pita i tokim Sera olsem, yu kam wantaim tupela hap buai. na bai mi kaikai bikos mi laik kaikai buai.'),
   ('xe', '???')]),
 ('kepoi',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'small type of shell'),
   ('tkp', 'wanpela kain sel'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Kepoi vo toupai avakona savaraa.'),
   ('xp', 'Kepoi shell em i save i stap long sol wara.'),
   ('xe', '???'),
   ('ex', 'Kepoi garevavi iria avakava sovara-ia toupaeveira.'),
   ('xp', 'Kepoi em i wanpela liklik sel, i save stap insait long solwara.'),
   ('xe', '???')]),
 ('keposi',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'wood chips from a cut'),
   ('ge', 'sliver of wood'),
   ('tkp', 'haphap diwai akis i wokim'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Keposiara oara verapaiveira vosia evaoara toepaive.'),
   ('xp',
    'Ol liklik haphap diwai ol save raosim taim ol i save katim ol diwai.'),
   ('xe', '???')]),
 ('kepoto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'vulva opening'),
   ('ge', 'labia'),
   ('tkp', 'bokis'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Kepoto ira toupareveira riakora rutu-ia vataupa tavukia.'),
   ('xp', 'Hap i save stap op long olgeta meri long ples tambu.'),
   ('xe', '???')]),
 ('kera',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'albatross-like bird'),
   ('tkp', 'pisin i save stap long solwara'),
   ('nt', 'sea bird, sharp pointed wings, probably a Frigate Bird'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '28/Jan/2005'),
   ('ex', 'Kokioto kera votoupaiveira avakava-ia.'),
   ('xp', 'Kera em wanpela pinis Isave istap long solwara.'),
   ('xe', 'The bird called kera likes being on the beach.')]),
 ('kerakera',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'mushroom'),
   ('tkp', '???'),
   ('nt', 'very large edible variety'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Kerakera arua aiopara.'),
   ('xp', 'Em I olsem kum tasol blong kaiken??.'),
   ('xe', '???'),
   ('ex',
    'Kerakera aruato iraora puraparoveira rasitoa-ia. Ira oripaiveira rera aiopasia.'),
   ('xp',
    'Masrum em wanpela kain kumu i save gro long graon. Ol save kukim na kaikaim.'),
   ('xe', '???')]),
 ('kerari',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'all vegetation'),
   ('ge', 'glory of God'),
   ('tkp', 'olgeta purpur samting'),
   ('dt', '27/Oct/2005'),
   ('ex', 'Keraria purareva oisio ravaia toupaoro rera vaisaro kaepiepape.'),
   ('xp',
    'Dispela kriesen god i bin wokim olsem bai olgeta man i mas givim glori na bik nem i go long em.'),
   ('xe', '???')]),
 ('keraria',
  [('rt', 'kerari'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'all vegetation'),
   ('ge', 'glory of God'),
   ('tkp', '???'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Keraria purari oveu atosia popita atosia.'),
   ('xp', 'Wokem hook blong downim kapiaka na kokosi.'),
   ('xe', '???'),
   ('ex',
    'Pauto ira keraria purareva vearopiea oa-ia touparo rera-pa uvuipai ra riro vaisi vorepiepaive oirara pautoa-pa.'),
   ('xp',
    'God i bin wokim dispela gutpela creation bai ol man stap long em na givim bik nem hona i go bek long God.'),
   ('xe', 'God created all of the plants ???')]),
 ('kerau',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'stiff'),
   ('ge', 'rigormortis'),
   ('ge', 'rigid'),
   ('tkp', '???'),
   ('vx', '1'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Keraurai.'),
   ('xp', '???'),
   ('xe', "I'm stiff."),
   ('ex',
    'Viapau uvuiparai ra vavaea pukoa uvare kerae riro va-ia kokeva vokavoiva.'),
   ('xp',
    'Mi no inap bendim han bilong mi bikos mi wokabaut long bikpela ren.'),
   ('xe', "I can't bend my hand because I walked in a big rain.")]),
 ('kerauto',
  [('rt', 'kerau'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'paralyzed person'),
   ('tkp', '???'),
   ('dt', '11/Feb/2007'),
   ('ex', 'Kerauto ro ira viapau uvuipa ra vokapareve.'),
   ('xp', 'Man i no inap wokabaut.'),
   ('xe', "A paralyzed person is one who can't walk."),
   ('ex',
    'Vosia kerauto oirato era ira viapau uvuipa ra vokapareve ari era urua raga-ia uusiparovere.'),
   ('xp',
    'Sapos paralyzed man, em dispela man i no inap wokabaut em bai slip tasol long bet.'),
   ('xe',
    "If a man is paralyzed, he can't walk but instead will just sleep in bed.")]),
 ('keravisi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'plough under'),
   ('ge', 'turn soil over'),
   ('tkp', 'hukim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Rasia keravisipaivorao.'),
   ('xp', '???'),
   ('xe', 'They turned the soil over yesterday.')]),
 ('keravisia',
  [('rt', 'keravisi'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'hook'),
   ('ge', 'plough'),
   ('tkp', 'huk'),
   ('cmt',
    "I don't think va-ia should be there twice in second ex. sentence."),
   ('dt', '28/Oct/2005'),
   ('ex', 'Ragai keravisi purapaa ra oveu atora rara utupaua.'),
   ('xp', 'Mi wokim huk bai mi daonim kapiak long bihain.'),
   ('xe', '???'),
   ('ex', 'Keravisia oupaavoi ra va-ia opita isi rovai keravisia va-ia.'),
   ('xp', 'Bai mi kisim huk na bai mi hukim ol kokonat long em.'),
   ('xe', "I'm going to get a hook and with it I'll hook it with it.")]),
 ('keravo',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'hawk'),
   ('tkp', 'tarangau'),
   ('eng', 'Little Eagle (Hieraaetus morphnoides)'),
   ('sa', 'avaruka'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Kokioto keravo viaraia raga papapareveira.'),
   ('xp', 'Pinis i save plai tasol long antap.'),
   ('xe', '???'),
   ('ex',
    'Keravo ira tuura vegoro sovaraia toupareveira. Ra voa varuere kare upopareve oea aiopasia.'),
   ('xp',
    'Tarangau em i save stap long ol bik bus. na em i save kilim ol abus na kaikaim.'),
   ('xe', '')]),
 ('Kereaka',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('eng', 'name'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Kereaka uruia vaisiaro oisio uva ravireo rokopareveira.'),
   ('xp', 'Kereaka em i nem bilong ples istap long hap bilong san i go daon.'),
   ('xe', '???')]),
 ('kerekoi',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'fly'),
   ('tkp', 'liklik kain palai i gat makmak'),
   ('sf', 'FAUNA'),
   ('nt', 'small type found in jungle area'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Kerekoi votupaiveiro vegoaro.'),
   ('xp', 'Em i save i stap tasol long bik bush.'),
   ('xe', '???'),
   ('ex', 'Kerekoi-ia toupaiveira korikoriara oira avuaro-ia.'),
   ('xp',
    'Kerekoi emi wanpela liklik kain palai i save gat makmak long beksait bilong em.'),
   ('xe', '???')]),
 ('kerere',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'arrow made of sago leaf midrib'),
   ('tkp', 'supsup of i wokim long saksak nok'),
   ('cmt', 'Does this word actually exist? Is it misspelled?'),
   ('dt', '28/Oct/2005')]),
 ('kerereua',
  [('rt', 'kerere'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'arrow made of sago leaf midrib'),
   ('tkp', '???'),
   ('dt', '18/Feb/2004'),
   ('ex', 'Kerereua purapai aue iava tetevu guruva iapasia.'),
   ('xp', 'Ol i mekim spia bilong sut long em.'),
   ('xe', 'He is making a spear in order to shoot with it.')]),
 ('kerete',
  [('ps', 'POST'),
   ('ge', 'inside out'),
   ('ge', 'reverse'),
   ('tkp', 'tanim links'),
   ('dt', '23/Feb/2005'),
   ('cmt',
    "Described by Firchow, but haven't been able to find any examples")]),
 ('kerete',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'turn around'),
   ('tkp', 'tanim'),
   ('cmt', "Where's the subject agreement on the second example?"),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Vii keretea.'),
   ('xp', 'Mi tanim yu.'),
   ('xe', 'I am turning you around.'),
   ('ex', 'Ragai-ia pituoro ragai kerete.'),
   ('xp', 'Mi holim yu bai mi tanim yu.'),
   ('xe', '???'),
   ('ex',
    'Pita koie korirevo uva oira iava kovuto kereterevo oisio ra voava rereoviro tuitui kasi-ia.'),
   ('xp',
    'Pita em i katim pik na em i tanim bel bilong em olsem bai drai long paia.'),
   ('xe', '???'),
   ('ex', 'Oirato riakova kereterevoi oira-va vuri pitupitu purasia.'),
   ('xp', 'Man i tanim meri na mekim pasin nogut long em.'),
   ('xe',
    'The man turned around the woman in order to do naughty things to her.')]),
 ('Kerevaru',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'river'),
   ('tkp', 'wanpela raun wara'),
   ('nt',
    'river behind Sisisivi village which flows over high double waterfalls'),
   ('dt', '28/Jan/2005'),
   ('ex', 'Ukova kerevaru esis toupa sisivi.'),
   ('xp', 'Em wampela river em istap long sisivi villege.'),
   ('xe', 'It is a river that is in Sisivi village.')]),
 ('keri',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'make enemies with'),
   ('ge', 'reject friendship'),
   ('tkp', '???'),
   ('cm', '-va'),
   ('vx', '2'),
   ('arg', 'OBL'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Ragai-va keripauveira.'),
   ('xp', '???'),
   ('xe', 'You are always being my enemy.'),
   ('ex', 'Pita ragai-va keriparoveira oa iava viapau vearovira toupave.'),
   ('xp',
    'Pita i save birua wantaim mi, olsem na mitupela i no save stap gut.'),
   ('xe', '???')]),
 ('Keriaka',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('eng', 'name'),
   ('dt', '28/Jan/2005')]),
 ('kerikerisi',
  [('rt', 'kerisi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'partial'),
   ('ge', 'evaluate'),
   ('ge', 'judge carefully'),
   ('tkp', 'jas'),
   ('tkp', 'skelim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '12/Dec/2006'),
   ('ex', 'Vearovira rera vo reoaro kerikerisiri.'),
   ('xp', '???'),
   ('xe',
    'Mark his words well. Think carefully about his talk evaluating it.'),
   ('ex', 'Karurua reo kerikerisipato vo-va Rotokasi taere-ia.'),
   ('xp', 'Karuru em i man bilong jasim tok long Rotokas area.'),
   ('xe', 'Karuru is a judge from the Rotokas area.')]),
 ('kerio',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'coconut leaf'),
   ('tkp', 'lip bilong kokonas'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Sirosia riro keriopato auere pekuri purapasia.'),
   ('xp', 'Sirosia em boi bilong katim lip kokonat bilong wokim basket.'),
   ('xe', '???')]),
 ('kerioua',
  [('rt', 'kerio'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'coconut torch'),
   ('ge', 'stalk of palm leaf'),
   ('tkp', 'bumbum pangal'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Kerioua kovei opitato iava uva kakaeto upovoi.'),
   ('xp', 'Pangal bilong kokonat i pundaon na paitim pikinini.'),
   ('xe', 'A palm leaf stalk fell from the coconut tree and hit the boy.')]),
 ('keripaara',
  [('rt', 'keripa'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'enemies'),
   ('tkp', 'birua'),
   ('cmt', 'Double-check vowel length'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Keripara oisioa purapaive voari tuariri vo osia osia ra reivu ora kasipupe ra viapau ora voeava vearovira toupaive.'),
   ('xp',
    'Pasin birua ol i bin save mekim long taim bipo. Sapos tupela klen i kros bai ol i no inap stap gut.'),
   ('xe', '???')]),
 ('keripato',
  [('rt', 'keripa'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'enemy'),
   ('tkp', 'birua'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Viva keripato. Ira vii uposia auepaavoi vii-va.'),
   ('xp', 'Em i enemi blong yu. Em i laik paitim yu.'),
   ('xe', 'He is your enemy. He wants to fight with with you.'),
   ('ex', 'Keripato oirato ro ira iravu-va vurivira toupareve.'),
   ('xp', 'Birua man em i man istap nogut wantaim narapela man.'),
   ('xe', '???')]),
 ('kerisi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'evaluate'),
   ('tkp', 'skelim gut tok'),
   ('eng', 'discern'),
   ('eng', 'evaluate'),
   ('eng', 'judge talk or situation well'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Rera kerisiiva.'),
   ('xp', '???'),
   ('xe', 'They evaluated him.'),
   ('ex',
    'Kerisi oa oisio toupaivoi, vosia iravu vuria purareve, rariropai rara rera vo vavataro-ia pauavere va kerisisia vosia oirapa o kuuvua.'),
   ('xp',
    'Skelim gut, sapos wanpela man i mekim rong bai ol bikman i sindaun long hevi bilong em na skelim sapos i tru o nogat.'),
   ('xe', '???')]),
 ('kerisito',
  [('alt', 'kerisipato'),
   ('rt', 'kerisi'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'discerning person'),
   ('tkp', '???'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Kerisipato oirato ira oaravu-ia taraipaoro vara eveiparevoi oara karekepai.'),
   ('xp', 'Man i save lukim na luksave long ol samting i wok long kamap.'),
   ('xe', '???')]),
 ('kerisivira',
  [('alt', 'kerisipavira'),
   ('rt', 'kerisi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'wisely'),
   ('ge', 'discerningly'),
   ('tkp', '???'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Kerisivira reoparoi.'),
   ('xp', '???'),
   ('xe', 'He talks with good judgement.')]),
 ('keritara',
  [('rt', 'kerita???'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'trash'),
   ('tkp', 'pipia'),
   ('eng', 'trash'),
   ('eng', 'rubbish'),
   ('eng', 'garbage'),
   ('dt', '05/Feb/2005'),
   ('ex', 'Kouekare aio kovoroia keritara puraivoi.'),
   ('xp', 'Ol pik i kaikai nogut ol gaden bilong yu.'),
   ('xe', "Pigs eat your garden's garbage.")]),
 ('keriva',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'hawk feather'),
   ('tkp', '???'),
   ('nt', "used in coded leaf letter to mean keripa 'enemy'"),
   ('sa', 'keripa'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Keriva iria purapaiveira aue iava kokio  kare oruaro ra oira takuvuive oira tukeoro votovopasia kukue pupia vutaro-ia.'),
   ('xp',
    'Bilas ol i save mekim long gras bilong pisin. na ol i sae olgeta gras pisin na pasim wantaim. na putim long het bilong ol na singsing wantaim.'),
   ('xe', '???')]),
 ('keroroi',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'lean-to'),
   ('tkp', '???'),
   ('nt', 'archaic'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Keroroi kepa vao oa-ia viapau aue kovuaka vakukuearo-ia.'),
   ('xp', 'Haus i nogat rup long het bilong em.'),
   ('xe', '???')]),
 ('kerosiri',
  [('alt', 'karasiri'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'kerosene'),
   ('tkp', 'kerasin'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Pita ira ragai vaterevo karasiria uvare ragai oiraro vuravu viapau aviavipaevo.'),
   ('xp', 'Pita i givim mi kerosin bikos lam bilong mi i no lait.'),
   ('xe', "Peter gave me kerosene because my lamp won't light.")]),
 ('keru',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'harden like bone'),
   ('tkp', 'bun'),
   ('vx', '1'),
   ('cmt', "What's the telicity?"),
   ('dt', '28/Oct/2005'),
   ('ex', 'Semen keruovere.'),
   ('xp', '???'),
   ('xe', 'The cement will become hard as bone.'),
   ('ex',
    'Tasia tapiakara vagievo aveke kasi-ia uva rovae oa iava kerue uva viapau uvuipai va aiosia.'),
   ('xp',
    'Tasa em i mumuim kapiok long ston. na i paia olsem na i no inap long kaikaim.'),
   ('xe', '???')]),
 ('keru',
  [('rt', 'keru'),
   ('wf', 'kerua'),
   ('wf', 'keruara'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'bone'),
   ('tkp', '???'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Koue kerua.'),
   ('xp', 'Bun blong pig.'),
   ('xe', 'Pig bone.'),
   ('ex', 'Ragai-va kerua.'),
   ('xp', 'Em bun blong mi.'),
   ('xe', "That's my bone."),
   ('ex',
    'Keruara oirara rutu-ia toupaiveira oara voea vararo gorupiepaiveira ravokapaive.'),
   ('xp',
    'Ol bun istap long olgeta man ol dispela bun i save strongim bodi bilong taim ol i wokabaut.'),
   ('xe', '???')]),
 ('kerui',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'thin'),
   ('ge', 'bony'),
   ('ge', 'skinny'),
   ('tkp', 'bun nating'),
   ('vx', '1'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Em ro ira viapau sopeiavoi toupa.'),
   ('xp', 'Man i no gat mit i blong bodi.'),
   ('xe', '???'),
   ('ex',
    'Ragaia keruito ragoa-ia viapau varuaravai toupaveira ora aue tuga ragai vararo-ia.'),
   ('xp', 'Mi bun nating man mi nogat mit na gris istap long bodi bilong mi.'),
   ('xe', '???')]),
 ('keruiato',
  [('rt', 'keruia'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'determination'),
   ('tkp', 'man i save laik painim aut'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Keruiato ira oisio ruipaparoveira oaravu kekepareve o vara-ia tarai ruipaparo rutu.'),
   ('xp', 'Man bilong laik lukim ol samting na save tru tru.'),
   ('xe', '???')]),
 ('keruito',
  [('rt', 'kerui'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'skinny person'),
   ('ge', 'thin person'),
   ('tkp', '???'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Vosia viapau Pita vearovira aiopareve rakeruitovira toupareve.'),
   ('xp', 'Sapos Pita i no kaikai gut bai em i stap olsem bun nating man.'),
   ('xe', '???')]),
 ('kerupiua',
  [('rt', 'kerupi???'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'tree'),
   ('tkp', '???'),
   ('cmt', 'archaic?'),
   ('nt', 'is a source of salt when burned'),
   ('dt', '28/Oct/2005')]),
 ('keruria',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'persistent'),
   ('ge', 'stubborn'),
   ('ge', 'determined'),
   ('tkp', 'strong yet'),
   ('vx', '1'),
   ('nt', 'archaic?'),
   ('cmt', 'Does this take a sentential predicate? Can it occur alone?'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Keruriaparoi avaarapa.'),
   ('xp', '???'),
   ('xe', 'He is determined not to go.')]),
 ('keruriato',
  [('rt', 'keruria'),
   ('ps', '???'),
   ('ge', 'determination'),
   ('tkp', 'strong pasin'),
   ('dt', '18/Feb/2004')]),
 ('keruriavira',
  [('alt', 'keruriapavira'),
   ('rt', 'keruria'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'persistently'),
   ('ge', 'obstinantly'),
   ('tkp', '???'),
   ('dt', '28/Aug/2005')]),
 ('kesi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'limp'),
   ('tkp', 'wokabaut nogut'),
   ('cmt', 'No example'),
   ('vx', '1'),
   ('dt', '08/Dec/2005')]),
 ('kesie',
  [('ps', '???'),
   ('ge', 'yellow'),
   ('tkp', 'yelo'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Sem Akoitai vokeparo iava kaapara kesievira toupaiveira.'),
   ('xp', 'Kapa long haus bilong Sam Akoitai i yelo.'),
   ('xe', '???')]),
 ('kesievira',
  [('rt', 'kesie'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'yellow'),
   ('tkp', 'yelo'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Sem Akoitai vo kepaaro iava kaapara kesievira toupaiveira.'),
   ('xp', 'Kapa long haus bilong Sam Akoitai i yelo.'),
   ('xe', "The roof of Sam Akoitai's house is yellow.")]),
 ('kesikea vaaguru',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'Yellow Mangrove (Ceriops tagal)'),
   ('tkp', '???'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Kesiea vaagurua oa vo toupaiveira avakava sirova.'),
   ('xp', 'Yelopela mangur i save stap arere long solwara.'),
   ('xe', 'The Yellow Mangroves live along the coast.')]),
 ('kesio',
  [('ps', '???'),
   ('ge', 'clever'),
   ('tkp', 'i save tumas'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Kesiotoa rutuvi vosia ira iparo evaovaia via kesioti arutu vigo gareaia ipau iroa.'),
   ('xp', 'Yu man tru yu clarem no yo go antap long liplip rop tasol.'),
   ('xe', '???'),
   ('ex', 'Kesio rugoirara vearovira rutu oaravu purapaiveira.'),
   ('xp', 'Ol man i gat gutpela sae ol i save mekim gut ol samting.'),
   ('xe', '???')]),
 ('kesioto',
  [('rt', 'kesio'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'clever man'),
   ('tkp', 'man i save tumas'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Kesioto ro oirato ira riroa vatoupare taraia. Oara rutu purapasia votara iava.'),
   ('xp',
    'Man save i gat bikpela save em i save stap wantaim em i kain wokim ol kain wok long dispela save.'),
   ('xe', '???')]),
 ('kesivira',
  [('alt', 'kesipavira'),
   ('rt', 'kesi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'limping'),
   ('tkp', '???'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Kesivira tasiparevoi.'),
   ('xp', '???'),
   ('xe', 'He steps in a limping way.'),
   ('ex',
    'Avukato kesivira tasiparevo uvare riroa kapua rera kokotoaro-ia toupaivo.'),
   ('xp',
    'Lapun man i no wokabaut gut tumas bikos em i gat bikpela sua long lek bilong em.'),
   ('xe', 'The old man steps with a limp because many sores are on his leg.'),
   ('ex', 'Ruke kesivira raga rakoru-ia pituparevo uvare ururiparoe oira-pa.'),
   ('xp', 'Luk i no holim gut snek bikos em i wok long poretim.'),
   ('xe', '???'),
   ('ex', 'Kesivira vokapaoro avaroe Visia kokotu rausia.'),
   ('xp',
    'Visa em i wokabaut long ol pinga bilong lek bilong em long holim pasim kakaruk.'),
   ('xe', '???')]),
 ('keta',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'comb hair'),
   ('tkp', 'komim'),
   ('dx', 'Pipipaia'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '31/Jul/2005'),
   ('ex', 'Orui ketaparevoi kakaeto.'),
   ('xp', '???'),
   ('xe', 'The boy is combing his hair.')]),
 ('ketaka',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'notch out'),
   ('ge', 'make groove'),
   ('tkp', 'wokim liklik baret'),
   ('arg', 'O'),
   ('vx', '2'),
   ('nt', 'The notched end of the long pole is used to hold fruit'),
   ('dt', '28/Oct/2005'),
   ('ex', 'Tuuta ketakareva kovurui tovosia va-ia.'),
   ('xp', '???'),
   ('xe', 'He notched the pole in order to put up the rafter with it.'),
   ('ex',
    'Ragai tuuta ketakaarava-ia ravaivara-ia gareavi tovoa evaoa ra kepa puraa.'),
   ('xp',
    'Mi katim pos na mekim spes bai mi putim liklik diwai long na wokim.'),
   ('xe', '???')]),
 ('ketakaa',
  [('rt', 'ketaka'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'groove'),
   ('ge', 'niche'),
   ('tkp', '???'),
   ('cmt', 'Check vowel length'),
   ('dt', '28/Oct/2005')]),
 ('ketato',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'tusk of pig'),
   ('tkp', 'tit bilong pik'),
   ('dt', '28/Oct/2005'),
   ('cmt', 'Bad ex. sentence'),
   ('ex', 'Koueto ketato.'),
   ('xp', 'Pig i gat teeth i kam autsait.'),
   ('xe', '???'),
   ('ex', 'Vosia ketato orapuraro koietoa-ia rarirovira kasipuparo.'),
   ('xp', 'Sapos bikpela tit i kamap pik man em bai kros tumas.'),
   ('xe', 'When tusks appear on the male pig, he gets ??? angry.')]),
 ('ketoo',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'plant which came up from seed'),
   ('tkp', '???'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Kovato ira purapa roviroveira vosia evao ara iava kueisiro keeto ravaisi kakuve teetoa vusioro.'),
   ('xp',
    'Kur i sae kamap sapos sit bilong ol diwai i bruk na kur bai kamap.'),
   ('xe', '???')]),
 ('ketoo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'sprout from seed'),
   ('tkp', 'kru i kamap'),
   ('dcsv', 'true'),
   ('vx', '???'),
   ('arg', '???'),
   ('cmt', 'Check vowel length. Double-check verb class. A or B?'),
   ('dt', '28/Oct/2005'),
   ('ex',
    'Kukuato aiava kovei aisivu raisuarore uva rakaria eruei oire ketoovoi.'),
   ('xp',
    'Long diwai galip wanpela galip i pundaun na skin bilong em i sting orait i grow.'),
   ('xe', '???'),
   ('ex',
    'Vosia kakaupau ruipapau ra kakau takuraro keetopie rovorivere rakovape.'),
   ('xp', 'Sapos yu laik planim kakau pastaim yu mas mekim kur i gro.'),
   ('xe', '???')]),
 ('ketoopie',
  [('rt', 'ketoo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make sprout from seed'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Ka kau takuraro ketopie.'),
   ('xp', 'Seed blong cocaa em I igat sprout long seed.'),
   ('xe', '???'),
   ('ex', 'Auero rutu ketoopiepareveira.'),
   ('xp', '???'),
   ('xe', 'He makes all things (plants) sprout from seeds.')]),
 ('ketoopieara',
  [('rt', 'ketoopie'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'sprouts'),
   ('ge', 'buds'),
   ('tkp', '???'),
   ('cmt', 'Double-check vowel length'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Ketoopiea vao oa rovuvira pukupaive evao kakura iava oisio kovape.'),
   ('xp', 'Samting i solap liklik long pikinini diwai olsem bai gro.'),
   ('xe', '???')]),
 ('ketoroa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('dx', 'Central'),
   ('ge', 'kettle'),
   ('tkp', 'ketol'),
   ('eng', 'kettle'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '30/Apr/2006')]),
 ('ketu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'break.off'),
   ('ge', 'break off a piece'),
   ('tkp', 'brukim liklik hap'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '17/Oct/2005'),
   ('ex', 'Vii ragai-pa opooavai ketu uvare vii riroa rutu aiopari.'),
   ('xp', 'Yu brukim hap taro bilong mi bikos yu kaikaim bikpela hap.'),
   ('xe', '???')]),
 ('kevaita',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'kid'),
   ('ge', 'joke'),
   ('ge', 'jest'),
   ('tkp', 'tok pilai'),
   ('vx', '1'),
   ('dt', '28/Jan/2006'),
   ('ex', 'Oiso ruipaparai ra orakevaitaragave.'),
   ('xp', '???'),
   ('xe', 'I want to just joke around between us.'),
   ('ex',
    'Ragai kevaitaparai uva vii kasipuuei  osia vii kevaitapaavoi aioarepaoro.'),
   ('xp',
    'Mi wok long pilai tasol yu kros. Mi pilai long yu long askim kaikai.'),
   ('xe', 'I am kidding and you are angry as I kid you about giving food.')]),
 ('kevaita',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'kid'),
   ('ge', 'joke'),
   ('ge', 'jest'),
   ('tkp', 'tok pilai'),
   ('cmt', 'double check gloss'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '11/Dec/2006'),
   ('ex', 'Vii kevaitapaavoi aio arepaoro.'),
   ('xp', 'Mi pilai long yu long askim kaikai.'),
   ('xe', "I'm joking with asking for food.")]),
 ('kevaitato',
  [('rt', 'kevaita'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'jester'),
   ('ge', 'kidder'),
   ('tkp', '???'),
   ('nt', 'In the good sense of the word jokester.'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Oiratoa riro kevaitato oaravu puraporo oara viapau oiraopai.'),
   ('xp',
    'Man em man bilong tok pilai tasol long ol narapela samting em i save giaman.'),
   ('xe',
    "The man is a big joker making up things, things which aren't true."),
   ('ex',
    'Sisiovara riro ora kevaitairara ovusia ovutarovu-ia orakasipupiepaaveira rara putepaive ora kevaitapaoro.'),
   ('xp',
    'Ol Sisiovara ol i save tok pilai tumas long ol yet. Tasol sampela taim ol i save koros taim pilai bilong ol i go moa yet.'),
   ('xe',
    'People from Sisiovara are big jokers but sometimes they anger each other going too far with the joking.')]),
 ('kevira',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'dog'),
   ('tkp', 'dok'),
   ('dt', '31/Oct/2005'),
   ('cmt', 'How does this differ from kaakau?'),
   ('ex', 'Vo Siureko-ia uva rirovira kevira kare toupaiveira.'),
   ('xp', 'Long ples Siureko planti dok i save stap.'),
   ('xe', 'Many dogs are in Siureko.')]),
 ('kevisi',
  [('ps', 'N'),
   ('pt', '???'),
   ('dx', 'Central'),
   ('ge', 'cabbage'),
   ('tkp', 'kebis'),
   ('eng', 'cabbage'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '17/Nov/2005')]),
 ('kevoisi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'persistent'),
   ('ge', 'determined'),
   ('tkp', '???'),
   ('dt', '31/Oct/2005'),
   ('vx', '1'),
   ('cmt',
    'Does this take a sentence complement? Is it aspectual, "continue, keep" (e.g. blijf in Dutch)?'),
   ('ex', 'Kevoisiparoi va puraarapa reasipaoro.'),
   ('xp', '???'),
   ('xe', "He is determined not to do what he doesn't want to do."),
   ('ex',
    'Pita upiare kevoisipaoro kepa puraparevo oisio teapi kepa viapau opesipe rara upia rera rerepieve.'),
   ('xp',
    'Pita i sanap strong long sik na em i wokim haus. Olsem nogut sik i daunim em na haus i no inap pinis.'),
   ('xe', '???')]),
 ('kevoisivira',
  [('alt', 'kevoisipavira'),
   ('rt', 'kevoisi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'determined-like'),
   ('tkp', '???'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kevoisivira vovoupata eisi piepasa.'),
   ('xp', '???'),
   ('xe', 'You all desire determinedly to do it like that.'),
   ('ex',
    'Kevoisivira rera vurapaavo oisio rutu uvuipa ra pirutuva-ia iviroro ovaratavu iare.'),
   ('xp',
    'Mi lukim em olsem tru em nap strong long kalapim wara i go long hap sait.'),
   ('xe',
    'I saw him determined that he would cross the ??? to the other side.')]),
 ('kie',
  [('ps', 'EXCL'),
   ('ge', 'Watch out!'),
   ('ge', 'Be careful!'),
   ('tkp', 'lukaut'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Ora vatatopou e vatatopauei avu.'),
   ('xp', 'Yu must lukaut yu mas lukout gut.'),
   ('xe', '???'),
   ('ex', 'Aakova kakaeto tavipaeva oisio, Kie! Aiteto vii ragisia uriopa.'),
   ('xp', 'Mama i tokim pikinini olsem, Lukaut! Pap bai kam stikim yu.'),
   ('xe', "Mama told her boy, Watch out! Dad's going to come whip you.")]),
 ('kii',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'short of'),
   ('ge', 'lacking'),
   ('tkp', '???'),
   ('cmt',
    "Meaning in example sentence is opposite of Firchow's translation!"),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('dt', '31/Oct/2005'),
   ('cmt', 'Check construction type.'),
   ('ex', 'Kiiei oirara arova aioa.'),
   ('xp', '???'),
   ('xe', "There isn't enough food for people."),
   ('ex',
    'Kepa kiiei rutu oirara iava uva viapau ragai tavukiavai ra uusira.'),
   ('xp', 'Haus i pulap tru long ol man na mi nogat spes bilong slip.'),
   ('xe', 'The house is full of people and I have no ??? to sleep.')]),
 ('kiikariko',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'type of bird'),
   ('tkp', '???'),
   ('nt', 'duck-like, webbed feet and flat bill, lives in shallow holes, ???'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kiikariko iriavu kokiova iria eriara-ia toupaeveira. Oisio kokotova osia pato kare.'),
   ('xp',
    'Kiikariko em i wanpela kain pisin i save stap insait ol hul i go insait na lek bilong em i olsem pato.'),
   ('xe', '???')]),
 ('kiipie',
  [('rt', 'kii'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'cause to be short of'),
   ('ge', 'cause to be lacking'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Riro irara aioara kiipieivo tavete irara ragai-re.'),
   ('xp', '???'),
   ('xe', 'The visitors caused me to be short of food.')]),
 ('kiire',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'play tag'),
   ('tkp', '???'),
   ('vx', '1'),
   ('ex', 'Kakaevure kiirepaaepa.'),
   ('xp', '???'),
   ('xe', 'The children were playing tag.'),
   ('dt', '01/Jun/2005')]),
 ('kiire',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'tag'),
   ('tkp', 'wanpela kain pilai'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kiire purapaiveira kakaevure vosia iravu kakaeto iravu-ia pitureve raeraita iravu va tarioro.'),
   ('xp',
    'Tag em wanpela kain pilai ol mangi i save mekim. Wanpela mangi bai tasim narapela na em bai ran bihainim narapela mangi ???.'),
   ('xe', 'Children play tag as one boy hold ??? and another chases it.')]),
 ('kiiru',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'walking stick'),
   ('ge', 'cane'),
   ('tkp', 'stik bilong wokabaut'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kiiru iria kukuearo-ia otuotua toupaiveira iria purapaiveira aue iava koosiva.'),
   ('xp',
    'Stik bilong wokabaut i gat sap long het bilong em. Ol i save mekim wanpela kain limbum.'),
   ('xe', '??? which they make from ???.')]),
 ('kiiuto',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'type of ant'),
   ('tkp', 'anis'),
   ('dt', '01/Sep/2005'),
   ('ex',
    'Rupavira toupare kiiuto, vegoaro toupai vasito sovaraia, vigei aiopaiveira.'),
   ('xp', 'Em i bikpela anis i save kaikai yumi.'),
   ('xe',
    'This type of ant stays ???, lives in the jungle on the ground. They bite us.')]),
 ('kiki',
  [('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'kick'),
   ('tkp', 'kikim'),
   ('nt', 'Tok Pisin loan'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Pita peripa isi-va avaroi va isi kikisia kakae vure iare.'),
   ('xp', 'Pita i kisim bol i go na em i kikim i go long ol pikinini.'),
   ('xe', 'Peter is going with the ball to kick it to the boys.')]),
 ('kiki',
  [('cl', 'isi'),
   ('ps', 'N'),
   ('pt', 'CL'),
   ('ge', 'ball'),
   ('tkp', 'bal'),
   ('dt', '28/Jul/2005')]),
 ('kikipi',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('dx', 'Pipipaia'),
   ('ge', 'sago.midrib'),
   ('eng', 'sticks within sago stalk'),
   ('eng', 'arrow made of sago midrib'),
   ('tkp', 'rap bilong saksak'),
   ('tkp', 'supsup bilong saksak'),
   ('cmt', 'Check vowel length of final vowel: kikipi or kikipii?'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kikipi ira toupareveira tetevu guruvaro vuruvuru-ia roia. Ira verapaiveira ravo guruvaro turapaive.'),
   ('xp',
    'Stik i save stap namel long ol lip saksak ol i save rausim dispela stik na samapim ol lip saksak.'),
   ('xe', '???')]),
 ('kikipisi',
  [('rt', 'kikipi'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'iron spear'),
   ('ge', 'iron rod'),
   ('tkp', 'spia'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kikipisi aearisi riro kaekae isi oara purapaiveira popotepairara.'),
   ('xp', 'Hap haeian i longpela ol waitman i save wokim.'),
   ('xe', '???')]),
 ('kikira',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'mix meat and greens'),
   ('tkp', 'abusim abus na savor'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Oirato koorato kikirarevoi arua tai tapo rera orisia.'),
   ('xp', '???'),
   ('xe',
    'The man is mixing the possum with green vegetables in order to cook it.')]),
 ('kikiraeko',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'spider'),
   ('tkp', 'spaida'),
   ('nt',
    'Has green to yellow body, black legs and head, makes very large ???. It is much like the akave spider but larger.'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kikiraeko iriavu vaisiaro akave iria vararo-ia kokovara ora vurivuria toupaiveira. Rupara tapo kokotoara, oara kukue riropiepaiveira.'),
   ('xp',
    'Kikiraeko i nem bilong spaida i gat grinpela na yelo bodi na i gat ol blakpela lek i save mekim het i bikpela.'),
   ('xe', '???')]),
 ('kikisi',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'midrib of sago frond'),
   ('tkp', 'nok'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kikisi ira tetevu guruvaro isivaroia toupareveira ira verapaiveira vara turapasia ra vara-ia kepara purapaive.'),
   ('xp',
    'Bun i save stap beksait long lip saksak. Ol i save rausim na somapim saksak na mekim haus long em.'),
   ('xe', '???')]),
 ('kikisikae',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'broom'),
   ('tkp', 'brum'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kikisikae oa purapaiveira aue iava tetevu ora aue opita.'),
   ('xp', 'Brum ol i save mekim long stik bilong saksak na long kokonas.'),
   ('xe', 'Brooms are made from sago and coconut.')]),
 ('kikisiova',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'bow made for children'),
   ('tkp', 'banara bilong pikinini'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kikisiova koetava iria purapaiveira oira kakaero-pa aue iava veeta pariero.'),
   ('xp',
    'Liklik bunara ol i save mekim bilong liklik mangi man. Ol i save mekim long ol hap mambu.'),
   ('xe', '???')]),
 ('kikitausi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'tear off with teeth'),
   ('tkp', 'brukim'),
   ('nt', 'There is humor associated with the use of this word in this way.'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Oirato koue sope kikitausipare va aiopaoro.'),
   ('xp', '???'),
   ('xe', 'The man is tearing the flesh of the pig off as he is eating it.'),
   ('ex', 'Kakau kokai kikitausirevoi kurivira rutu rera aiooro.'),
   ('xp', 'Dok i bagarapim tru kakaruk taim em kaikaim.'),
   ('xe', 'The dog ripped flesh from the rooster while eating him.')]),
 ('kikoo',
  [('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'second-born'),
   ('tkp', 'numba tu pikinini'),
   ('eng', 'second-born (but not last) child'),
   ('cmt', 'What if someone is both second-born and last-born?'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Sira kikoova Vaeako sirovapava aue iava uvare Vaeako irapavira kavauopa uva Sira oira sirova kavauopa.'),
   ('xp',
    'Sira em i namba tu bikos Vaeako i bin bon pas na Sira i bon bihain long em.'),
   ('xe',
    'Sira is number two and Vaeako is ??? because Vaeako was born first and Sira was born ???.')]),
 ('kilia',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'clarify'),
   ('tkp', 'kiliaim'),
   ('nt', 'Tok Pisin loan'),
   ('arg', '???'),
   ('vx', '???'),
   ('cmt', 'Example needed'),
   ('dt', '08/Jun/2005')]),
 ('kio',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'nudge'),
   ('tkp', 'tasim'),
   ('eng', 'attract attention by touching, tapping, or scratching'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Ragai kiorevoi.'),
   ('xp', '???'),
   ('xe', 'He nudged me gaining my attention (discreetly).'),
   ('ex', 'Oirato riakova kiorevoi sipareo-ia oira uvuru sovaraia.'),
   ('xp', 'Man i tasim meri long pinga name long planti man.'),
   ('xe', 'He man touched the woman with his finger ???.')]),
 ('kipa',
  [('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'wan samting tru'),
   ('tkp', '???'),
   ('vx', '???'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Sikuruapa kipaavora laibri vukuro vateoro.'),
   ('xp',
    'Wanpela samting tru ia ol i bin mekim long givim ol laibri buk long skul.'),
   ('xe', '???')]),
 ('kipapie',
  [('rt', 'kipa'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'wan samting tru'),
   ('tkp', '???'),
   ('vx', '???'),
   ('dt', '22/Nov/2005'),
   ('ex',
    'Oirato riakova-pa kipapieiraoreva uvare oira vateieva sigoa uva va raga vokapaeveira viapau rava kavupaeve uvare viapau oira oisi oai.'),
   ('xp',
    'Man i bin mekim wanpela samting tru long taim em givim meri long wanpela naip. Na em i save wokabaut wantaim dispela naip tasol bikos em i nogat wanpela naip olsem.'),
   ('xe', '???')]),
 ('kipe',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'cut grass with a sickle'),
   ('tkp', 'saripim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '31/Oct/2005'),
   ('cmt', 'What does varao mean in example?'),
   ('ex', 'Kokapa kou kipe varao.'),
   ('xp', 'Yu katim grass.'),
   ('xe', 'Cut the grass.'),
   ('ex', 'Pita isisio kou kipere uvare kepa ruvaraia toupaivoi.'),
   ('xp', 'Pita em i katim gras bikos i stap klostu long haus.'),
   ('xe', 'Peter is cutting the grass because it is close to the house.')]),
 ('kipekipe',
  [('rt', 'kipe'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'point of land in water point at base of fridge'),
   ('tkp', 'poin'),
   ('dt', '31/Oct/2005'),
   ('cmt', 'Double check original'),
   ('ex', 'Uruvaovi ora Ivitu vo kipekipeo puraeve vo oraaivaropieovo.'),
   ('xp', 'Uruvaori riva na Ivitu riva bung na mekim dispela point yia.'),
   ('xe', 'Uruvaori and Ivitu meet to make a point here.'),
   ('ex',
    'Uruvaovi vaio ora Kikiovi airea Vurairiva-ia kipekipea puraereva oisio osia kariava.'),
   ('xp',
    'Uruvaovi wantaim Kikiovi tupela i bin mekim Vurairiva poin olsem palai.'),
   ('xe',
    'Uruvaovi and Kikiovi, the two of them made Vurairiva Point like a lizard.')]),
 ('kipekipea',
  [('rt', 'kipekipe'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'point of land in water'),
   ('ge', 'point at base of ridge'),
   ('ge', 'spine of lizard'),
   ('tkp', '???'),
   ('dt', '18/Feb/2004')]),
 ('kipeto',
  [('rt', 'kipe'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'sickle'),
   ('ge', 'scythe'),
   ('tkp', 'scrip'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Oavu varao iava kovoparaiva oa porokovira toupaiveira oisio osia siarepe isisio kipeto.'),
   ('xp',
    'Wanpela hap samting bilong ol samting bilong wok olsem gras naip bilong katim gras.'),
   ('xe', '???')]),
 ('kipu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'paint'),
   ('tkp', 'penim'),
   ('eng', 'paint'),
   ('eng', 'smear on surface'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Voea kipupaive vo taritaro-ia varao opita taritaro.'),
   ('xp', '???'),
   ('xe', 'They would smear them with the mashed coconut meat.'),
   ('ex',
    'Ragai varaaro kipuivo varasiri-ia uvare kasikasi ragai-ia orapuraroe.'),
   ('xp',
    'Ol i penim bodi bilong mi long marasign bikos kaskas i kamap long bodi bilong mi.'),
   ('xe',
    'They painted my body with medicine because scabies appeared on me.')]),
 ('kipukipu',
  [('rt', 'kipu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'rub.on'),
   ('tkp', 'rabim'),
   ('eng', 'rub on'),
   ('eng', 'smear on'),
   ('eng', 'massage'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Dec/2005'),
   ('ex',
    'Pita ira opita gatao-ia orakipukipu uvare poupouare toupaivoi rera kagavearo-ia.'),
   ('xp',
    'Pita em i rabim rabim meme bilong kokonat bikos ol das i stap long pes bilong em.'),
   ('xe', 'Peter painted himself with coconut ??? because ??? on his face.'),
   ('ex', 'Orakipukipuae aue-ia kukupira pupisia.'),
   ('xp', 'Ol i rabim kle long skin bilong singsing kaul.'),
   ('xe', 'They rubbed ??? on themselves for the singsing.')]),
 ('kipupaa',
  [('rt', 'kipu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'paint brush'),
   ('tkp', 'bros bilong pen'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Uriou kipupaa-va eva ragai vaaro.'),
   ('xp', 'Kam wantaim bras bilong mi.'),
   ('xe', 'Come with that brush of mine.')]),
 ('kipupato',
  [('rt', 'kipu'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'painter'),
   ('tkp', 'pen man'),
   ('avm', 'pa_required'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kipupato ro ira kepa kipupa.'),
   ('xp', 'Man blong paintim ol house.'),
   ('xe', 'A painter is one who paints houses.'),
   ('ex', 'Vokipavira kipupato urioparoi kepa kipusia.'),
   ('xp', 'Tumoro bai man bilong penim haus bai kam napenim haus.'),
   ('xe', 'Tomorrow the painter is coming to paint the house.')]),
 ('kipuvira',
  [('alt', 'kipupavira'),
   ('rt', 'kipu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'smeared'),
   ('tkp', 'bagarap long pen'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Vao varoa kipuvira toupai aue iava peri.'),
   ('xp', 'Dispela laplap em i gat paint long em.'),
   ('xe', 'This clothing is smeared with paint.')]),
 ('kira',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'kina'),
   ('tkp', 'kina'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kira oa-ia Papua Niugini siovaraia.'),
   ('xp', 'Kina ol i yusim insait long Papua Niugini.'),
   ('xe', 'The kina is (used) inside of Papua New Guinea.')]),
 ('kirava',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', '???'),
   ('tkp', 'no pilim gut'),
   ('vx', '1'),
   ('cmt',
    'Need to investigate meaning--Firchow\'s translation ("oblivious") seems wrong!'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kakaeto kiravaroi uvare viapau aakova ikauvira rera sisiuevo oa iava rirovira gauparevoi.'),
   ('xp',
    'Pikinini i no pilim gut bikos mama i no wasim em haraiap olsem na em i krai.'),
   ('xe', '???')]),
 ('kire',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'play tag'),
   ('tkp', 'wanpela kain pilai'),
   ('vx', '???'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kire kakaevure visikopareira kirepaoro.'),
   ('xp', 'Em wanpela kain pilai bilong ol pikinini.'),
   ('xe', '???'),
   ('ex',
    'Kire iria purpaiveira oratariopaoro rairavu vatereve raera ita iravuva tarioro.'),
   ('xp',
    'Em wanpela pilai ol i save ron bai narapela i givim narapel bai givim gen ran bihainim narapela.'),
   ('xe', '???'),
   ('ex',
    'Pita kirepa Tomas tapo ora rera araokoirararo. Tomas Pita vaterevoi rera-ia pituoro uva viapau uvuipa ra vorevira oira vatereve uvare viapau uvuiparoi ra ikaupareve.'),
   ('xp',
    'Pita i pilai tas wantaim Tomas na ol brata bilong em. Tomas em i givim Pita taim e i holim em na em i no inap givim bek gen bikos em i no inap ran.'),
   ('xe', '???')]),
 ('kiri',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'rip open'),
   ('ge', 'tear open'),
   ('tkp', 'bruk'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Araisi ruu kirikirirevoi.'),
   ('xp', '???'),
   ('xe', 'He rips open the rice bag.'),
   ('ex', 'Eapu kepa kiripaa eva ra avaave evaova kavuoro.'),
   ('xp', 'Bai mi brukim haus bilong kurakun na bai ol lusim diwai na go.'),
   ('xe', 'I will break open the ant house and they will leave the tree.')]),
 ('kirikaokao',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bird'),
   ('tkp', '???'),
   ('eng', 'Dollarbird (Eurystomus orientalis)'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Kirikaokao iria kokovaravira avuaua toupaivera ari putaearei reroaro avaka toupaiveira ora popoteara. Evaraiava vearopie kekepaoveira.'),
   ('xp',
    'Kirikaokao em wanpela dola pisin beksait bilong em i gat grinpela gras. Na handanit long tupela wing bilong em i gat brupela gras na long bel i gat ol liklik waitpela gras. Long ol dispela kala em i save luk nais tru.'),
   ('xe', '???')]),
 ('kirioto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'vulva and all parts inside'),
   ('tkp', 'kan'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Kirioto riakora iava osia sovaraia oaravu toupaivoi.'),
   ('xp', 'The vulva of a woman is ???.'),
   ('xe', '???')]),
 ('kiro',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'write'),
   ('tkp', 'rait'),
   ('tkp', 'raitim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Vii vaisiaro kirori.'),
   ('xp', 'Raitim nem bilong yu.'),
   ('xe', 'Write your name.'),
   ('ex', 'Ragai rovoparai kiropaoro vukua-ia.'),
   ('xp', 'Mi wok long stat long rait long buk.'),
   ('xe', 'I am starting to write in the book.')]),
 ('kirokiro',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bush used for sorcery'),
   ('tkp', 'tanget raft'),
   ('nt', '???'),
   ('cmt',
    'Kirokiro em wanpela kain tanget ol i save yusim long mekim ol kainkain marasin tumbuna'),
   ('dt', '31/Oct/2005')]),
 ('kirokiro',
  [('rt', 'kiro'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'write'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '31/Oct/2005'),
   ('ex', 'Ragai kirokiropaa pepa ivaraia.'),
   ('xp', 'Mi wok antap long teibol.'),
   ('xe', 'I am writing on paper.')]),
 ('kirokiropato',
  [('rt', 'kirokiro'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'pencil'),
   ('ge', 'pen'),
   ('ge', 'typewriter'),
   ('tkp', 'pensil'),
   ('avm', 'pa_required'),
   ('sc', 'INSTRUMENT'),
   ('dt', '31/Oct/2005'),
   ('ex',
    'Vosia viapau kirokiropatoavai toupareve ra viapau uvuipauei oavuavuvai karekepe pepa-ia.'),
   ('xp', 'Sapos pensol i no stap bai yu no gat samting bai kamap long pepa.'),
   ('xe', "If there isn't a pencil, ???.")]),
 ('kiroko',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'clock'),
   ('tkp', 'kilok'),
   ('nt', 'Tok Pisin loan'),
   ('ig', 'yes'),
   ('dt', '04/Oct/2004')]),
 ('kiru',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'have sore near mouth'),
   ('tkp', 'i gat soa long maus'),
   ('vx', '1'),
   ('cmt', 'Is there a transitive version?'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Orakiruroe rao aveke-ia eisi Ivitu sisiupaoro.'),
   ('xp', 'Em bin mekim soa long ston long Ivitu.'),
   ('xe', '???')]),
 ('kirukiru',
  [('rt', 'kiru'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'crisp'),
   ('tkp', 'drai pinis'),
   ('vx', '1'),
   ('dt', '01/Nov/2005'),
   ('cmt', "What's the -a doing in kirukiru?"),
   ('ex', 'Korasiope koue sope vao kirukirua.'),
   ('xp', 'Mit blong kapur na meat blong pig em stongpela mit tru.'),
   ('xe', '???'),
   ('ex',
    'Vao kirukiruiraopai koie sope uva viapau uvuipai reuriara-ia va taritasia.'),
   ('xp',
    'Dispela mit bilong pik i strong tumas olsem na mi no inap memeim long ol tit.'),
   ('xe', "The pig meat is ??? and I can't chew it with my teeth.")]),
 ('kirukirua',
  [('rt', 'kiru'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'biscuit'),
   ('ge', 'ground food parched on fire'),
   ('tkp', 'bisket'),
   ('dt', '01/Nov/2005'),
   ('cmt', 'Check original--example seems messed up.'),
   ('ex', 'Kirukirua vao oa viapau ukovira aipavi.'),
   ('xp', 'Oiso osa biscuit kaikai i no gat wara instap insat long em.'),
   ('xe', '???'),
   ('ex',
    'Aioa oisio osia aue visikete kirukirua oa-ia viapau aue uukoavai va-ia.'),
   ('xp', 'Kaikai olsem biskit i drai i nogat wara long em.'),
   ('xe', '???')]),
 ('kirupato',
  [('rt', 'kiru'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'person with sores around the mouth'),
   ('tkp', 'man i gat soa long maus'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Oivato/riarova kirupato-vii vigoa-ia kapuara taupai.'),
   ('xp', 'Nom on meri isat sore long em.'),
   ('xe', '???'),
   ('ex',
    'Oirato ira gisipoaro-ia tarivira toupaive aue kapua. Oire reraera kirupato.'),
   ('xp', 'Man sua i raonim maus bilong em. Orait em dispela em mas sua man.'),
   ('xe', '???')]),
 ('kitoiva',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'crayfish'),
   ('ge', 'shrimp'),
   ('tkp', '???'),
   ('nt', 'Very small shrimp, comes in with whitebait on surface of ocean.'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Kitoiva votoupaeveira avakava-ia garepavi kavoriva.'),
   ('xp', 'Em liklik kindam em i save i stap long solwara.'),
   ('xe', '???'),
   ('ex',
    'Kitoiva iria avakava iava uriopaoveira. Ra uukovi sirova ipapao. Ra voa oira oupaive oira oiripasia. Oire oira aioive.'),
   ('xp',
    'Liklik kain kindam i save kam long sol wara. Na bihainim wara i go antap. Na ol i save kisim na kukim kaikai.'),
   ('xe', '???')]),
 ('kitu',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'store'),
   ('ge', 'save'),
   ('tkp', 'hipim'),
   ('tkp', 'lusim hap bilong bihain'),
   ('vx', '???'),
   ('arg', '???'),
   ('cmt',
    'Need to double-check classification--old example (deleted) looked like class B'),
   ('cmt', 'Need a better example'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Ragai vao-ia kiturai aioa vore osia ogoera ra va aioa.'),
   ('xp',
    'Mi lusim dispela hap kaikai bilong bihain. Na sapos mi hangere bai mi kaikaim bihain.'),
   ('xe', '???')]),
 ('kitukitu',
  [('rt', 'kitu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'scrub clothes'),
   ('tkp', 'sikirapim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cmt', "Doesn't seem related to kitu--may not be a reduplication of kitu"),
   ('dt', '01/Nov/2005'),
   ('ex', 'Vearovira varoara kitukituerevora.'),
   ('xp', 'Tupela meri i wasim gut laplap.'),
   ('xe', 'The two of them washed the clothes well.'),
   ('ex',
    'Vosia varoa kitukituragari osia viapau aue siopu ra viapau vearo kekepape.'),
   ('xp',
    'Sapos yu wasim nating laplap taim i nogat sop, laplap bai i no nap luk gut.'),
   ('xe', "If you scrub clothing without soap it won't look good.")]),
 ('kiu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'insert'),
   ('eng', 'put in'),
   ('eng', 'insert'),
   ('tkp', 'go insait'),
   ('tkp', 'putim insait'),
   ('dcsv', 'true'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '18/Nov/2005'),
   ('ex', 'Vao kiupe evoa kepa.'),
   ('xp', 'Yu putim pik insait long house.'),
   ('xe', '???'),
   ('ex', 'Voea rutu kukueara kiuivere koatapaoro.'),
   ('xp', '???'),
   ('xe', 'They all will go head first, entering in.'),
   ('ex',
    'Pita koie kiurevoi kasiura sovaraia. Teapi aio kovo aioeve vosia ratau toupaeve.'),
   ('xp',
    'Pita i putim pik insait long banis. Nogut i kaikaim gaden sapos em i stap autsait.'),
   ('xe',
    'Peter put the pig inside of the fence. Unfortunately he woud eat the garden if he were outside.')]),
 ('kiupie',
  [('rt', 'kiu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'insert'),
   ('eng', 'put in first'),
   ('eng', 'insert first'),
   ('tkp', 'go insait'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '30/Apr/2006'),
   ('ex', 'Vavaea kiupieri rikui sovara iare.'),
   ('xp', '???'),
   ('xe', 'Thrust your hand into the hole.'),
   ('ex', 'Vavaea kiupie tarausisi iava oapa-ia.'),
   ('xp', 'Yu putim han i go insait long poket bilong trauses.'),
   ('xe', 'Put your hands inside the pockets of your trousers.'),
   ('ex', 'Evo turu kiupie rogori vavo-re.'),
   ('xp', 'Yu putim pastaim hap paia wut ya i go long hap.'),
   ('xe', '???')]),
 ('kiuto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of ant'),
   ('tkp', 'anis'),
   ('nt', 'has stinging bite'),
   ('sf', 'FAUNA.INSECT'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Kiuto rupato iro avupurato kasikasipievive.'),
   ('xp', 'Em anis em i save kaikai yumi.'),
   ('xe', '???'),
   ('ex',
    'Kiuto rupato ira vasipievira avu puraparoveira uvuipau ragauri rara viavureve.'),
   ('xp', 'Blakpela anis em inap kaikaim yu na bai yu inap krai.'),
   ('xe', 'The black ant ??? makes bites ???.')]),
 ('kiuve',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'tropical.ulcer'),
   ('eng', 'tropical ulcer'),
   ('tkp', 'bikpela sua'),
   ('nt', 'very large in size'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Kiuve kapua eva viia.'),
   ('xp', 'Yu gat bikpela sore em i stap long yu.'),
   ('xe', '???'),
   ('ex',
    'Kiuve kapuaro iava oisioa kopiipave voari tuariri uvare vuri kapuaro rutua.'),
   ('xp',
    'Bikpela sua tru ol i bin save dai long em long taim bipo bikos em ol sua nogut tru.'),
   ('xe',
    'From tropical ulcer sores they always died long ago because they are really bad sores.')]),
 ('kiuvu',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'wind'),
   ('tkp', 'win'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Riro varivarito kiuvu ira oisio-va urioparoveira Kireaka-ia.'),
   ('xp', 'Strongpela win i save kam olsem long Kereaka.'),
   ('xe', 'A very mighty wind always come this way ??? Kereaka.')]),
 ('koa',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'skin'),
   ('eng', 'bark'),
   ('eng', 'skin'),
   ('eng', 'peel'),
   ('tkp', 'raosim skin'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Rakariua koaivere evaova iava.'),
   ('xp', '???'),
   ('xe', 'They will take the bark off of the tree.'),
   ('ex',
    'Siriavi iria rakaria kaaevoi tegotoa iava uva vo iro-ia opita ogata tuuevoi.'),
   ('xp',
    'Siriavi em i pulim rop bilong wel banana na em i mekim woksak kokonat long em.'),
   ('xe',
    'Siriavi peeled the skin from the wild banana and sewed a coconut bag from the vine.')]),
 ('koai',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'clam'),
   ('tkp', 'sel'),
   ('tkp', 'kina'),
   ('nt', 'edible, found on the west coast of Bougainville'),
   ('dt', '01/Nov/2005'),
   ('ex',
    'Vo toupa avaka vogara savarain aiopa kave voea kasipai aue aicoro curapasa.'),
   ('xp',
    'Em i stap insade lang wetsan blong sol wara ol save kaikainu na makim kanag??.'),
   ('xe', '???'),
   ('ex',
    'Koai riro kaekaepato akoroto ira eisi toupareveira Kereaka-ia ira oupaiveira rera oripasia ra rera aiopaive.'),
   ('xp',
    'Koai em wanpela kain kina o sel i save stap long Kereaka. Ol i save kisim na kukim na kaikai.'),
   ('xe',
    'Koai is a kind of long clam that is found in Kereaka which they get in order to cook and they eat it.')]),
 ('koakoa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'plant'),
   ('tkp', '???'),
   ('sf', 'FLORA'),
   ('nt', 'has broad leaf like wild banana, used to make temporary shelter'),
   ('dt', '01/Nov/2005'),
   ('ex', 'Vosia avapau arua tuesia koakoa tokouvere tapo vagiva rakusia.'),
   ('xp', 'Taim yu go kisim kumu bai yu katim tu sampela ol wail banana lip.'),
   ('xe', 'When you go harvest vegetables, you also cut wild banana leaf.'),
   ('ex',
    'Koakoa oa guruvaroia kepapaupa vego keparo aio vagirotapo puropai riroara kovoaratoupai orueia koakoa.'),
   ('xp', 'Ol save ???iuol bush hause ?usav  isat pant work longeu.'),
   ('xe', '???'),
   ('ex',
    'Pita, rakariara koakoa eva ra evaova iava oara erakoera ra vara-ia tuitui kasivio.'),
   ('xp',
    'Pita, yu raosim ol skin bilong dwiai i bin drai diwai yet na bai yumi mekim paia long em.'),
   ('xe', "Peter, bark the tree ??? and from them we'll make fire.")]),
 ('koakoa',
  [('rt', 'koa'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'bark a tree'),
   ('ge', 'remove the skin'),
   ('tkp', 'rausim skin'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Rakariua koaivere evaova iava.'),
   ('xp', 'Bai ol i rausim skin bilong diwai.'),
   ('xe', 'They will take the bark off of the tree.')]),
 ('koara',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'put together'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('nt', 'Used when putting all of the songs on a recording tape.'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Vara rutu koaraavoi.'),
   ('xp', '???'),
   ('xe', 'I am putting them all together.'),
   ('ex',
    'Kova koara tovoivoi teipi ovaraia oisio ravara uvupaive teipi gaupiepaoro.'),
   ('xp',
    'Ol i putim planti singsing insait long tepe olsem bai ol i save harim taim ol i save pleim tepe.'),
   ('xe', '???')]),
 ('koarao',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'bird'),
   ('tkp', 'pisin'),
   ('eng', 'Yellow-eyed Cuckoo-shrike (Coracina lineata)'),
   ('sa', 'urieva'),
   ('dt', '03/Nov/2005'),
   ('ex',
    'Koarao kokiova iria iava kesievira toupasiveira osireitoarei ovaisivu iava tapo oira vaisipaiveira oisio urieva.'),
   ('xp',
    'Koarao em nem bilong wanpela pisin tupela ae bilong em i yelo na ol i save kolim em tu long narapela nem, urieva.'),
   ('xe', '???')]),
 ('koaraua',
  [('rt', 'koara'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'shelf'),
   ('tkp', '???'),
   ('eng', 'shelf'),
   ('eng', 'platform for keeping things up off of the ground'),
   ('eng', 'table'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Koaraua voa uva uvuipa aio tovipau.'),
   ('xp', 'Long hap yu can putim ol samhing.'),
   ('xe', 'On this table you can put food.'),
   ('ex',
    'Koaraua oa ivaraia oaravu tovopaiveira. Teapi rasito vara vuripiepareve.'),
   ('xp',
    'Tebol ol i save putim ol samting antap long em. nogut graun i bagarapim.'),
   ('xe', '???')]),
 ('koarava',
  [('rt', 'koara'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'table'),
   ('tkp', 'tebol'),
   ('cmt', 'How do koaraua and koarava differ?'),
   ('dt', '04/Nov/2005'),
   ('ex',
    'Vouva uvuipa aiopasia rera ivaraia ora kiropasia kaarava ivaraia tebol.'),
   ('xp', 'Hap yu can kaikai long na yu kan writer antap long en.'),
   ('xe', '???'),
   ('ex', 'Koarava iria ivaraia aiopaiveira ora rigatopaiveira.'),
   ('xp', 'Tebol ol i save kaikai antap long em, na rait antap long em.'),
   ('xe', 'The table, they eat and write on top of it.')]),
 ('koasio',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'clam'),
   ('tkp', '???'),
   ('nt',
    'edible clam like koai only longer, found on the west coast of [MISSING]'),
   ('dt', '26/Apr/2006'),
   ('sa', 'koai'),
   ('cmt', 'Are koasio and koai the same?'),
   ('ex',
    'Vo vaisirei koai ora koasio kataitoa raga-ia ira eisi toupareveira Kereaka-ia.'),
   ('xp',
    'Tupela nem i stap long wanpela kina tasol. Em koai na koasio, dispela i save stap long Kereaka.'),
   ('xe',
    "The two names 'koai' and 'koasio' refer to the same thing, which is found in Kereaka.")]),
 ('koata',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'enter'),
   ('tkp', 'go insait'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('sc', 'MOTION'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Pupipaoro koatapaiera.'),
   ('xp', '???'),
   ('xe', 'While dancing we (exclusive) entered (the village).'),
   ('ex',
    'Pita koata kepa siovara iare oisio ravoa uusiro uvare viapau uvuipa ra ratau uusiro. Ari uvuipa ra koataro voa usisia kepa-ia.'),
   ('xp',
    'Pita em i go insait long haus olsem bai em i slip long ap. Bikos em i no inap slip autsait, tasol em i nap i go slip insait long haus.'),
   ('xe', 'Peter went inside of the house ???.')]),
 ('koatapie',
  [('rt', 'koata'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'put in'),
   ('ge', 'accept'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Jisas koatapieri.'),
   ('xp', '???'),
   ('xe', 'Accept Jesus into (your life).'),
   ('ex',
    'Ragai kovoa koatapieava vo voisi oa-ia oa iava uvuiparai ra vearovira vokovo purapa.'),
   ('xp',
    'Mi bin putim tru wok long hat bilong mi olsem na mi save mekim gut wok.'),
   ('xe', '???')]),
 ('koauve',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'snail'),
   ('tkp', 'bosi'),
   ('nt', 'large brown type found in the jungle'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Vo topaiveira vegoaro vurivuri kekevavi oira koauve.'),
   ('xp', 'Enu i save istap long bik bus em i luk braun.'),
   ('xe', '???'),
   ('ex', 'Koauve toupaeveira eisi tura vego.'),
   ('xp', 'Snel i save i stap long bus.'),
   ('xe', "The 'koauve' snail lives in the jungle."),
   ('ex',
    'Koauve vurivuri kekeva iria tuura vegoro sovaraia toupaeveira. Viapau uvuipai ra vovokio oira kekeri vo atoia.'),
   ('xp',
    'Koauve em wanpela kain snel i save stap long bik bus. Em i save luk braun, na tu yu no inap lukim nao long ples tude.'),
   ('xe', '???')]),
 ('koavaato',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'betel.vine'),
   ('tkp', 'daka'),
   ('eng', 'betel pepper vine fruit'),
   ('dt', '14/Nov/2005'),
   ('ex',
    'Koavato ira tovopaiveira akoroa-ia, ra ita vara rutu aiopaive, ra aasiva turuepe.'),
   ('xp', 'Ol i save kaikai daka wantaim kabang, em nau bai meme buai i red.'),
   ('xe',
    'They put betel vine with lime powder and they eat them, and the betel nut is red.'),
   ('ex', 'Koavato ira tapo aasi aiopaveira voeao papua niu gini pairara.'),
   ('xp', 'Daka ol man bilong Papua Niu Gini i save kaikai buai wantaim.'),
   ('xe',
    'Betel nut vine is something the people of Papua New Guina eat with betel nut.')]),
 ('koe',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'spoon out a solid'),
   ('tkp', 'wawan'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Aio koko koepaevora aako spun-ia va aiopaoro.'),
   ('xp', '???'),
   ('xe', 'Mother spooned out the portion of food while eating it.'),
   ('ex', 'Upiriko koko koepae Maria koetoa-ia aiopaoro.'),
   ('xp', 'Maria i kaikai plet kaukau long spun.'),
   ('xe', 'Maria is spooning ??? eating ???.'),
   ('ex', 'Aakova sipuru-ia aio koko koepae va aiopaoro.'),
   ('xp', 'Mama em i spunim pelet kaikai long sipun na kaikaim.'),
   ('xe', 'Mother dishes out a plate of food with a spoon and eats it.')]),
 ('koea',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'swing'),
   ('ge', 'choir'),
   ('tkp', 'ol i singsing'),
   ('cmt', "Ex. sentence doesn't contain entry."),
   ('dt', '04/Nov/2005'),
   ('ex', 'Erapa uvuru oea erapai Pauto vaisiaro kaepiepaoro.'),
   ('xp', 'Grup i sing na litimapim nem bilong God.'),
   ('xe', '???'),
   ('ex', 'Ruruvupa kakaero koa-ia uiriaepa eisi Vakora-ia.'),
   ('xp',
    'Ol pikinini bilong Ruruvu i bin win taim ol i bin sing long Wakunai.'),
   ('xe', 'The Ruruvu children won at singing in Wakunai.')]),
 ('koekoe',
  [('rt', 'koe'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'spoon out'),
   ('tkp', 'spunim'),
   ('rdp', 'full'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '04/Nov/2005'),
   ('cmt', 'Is gloss right?'),
   ('ex', 'Koekoe ukoa koekoeri eira-ia kapu.'),
   ('xp', 'Yu kisikisim wara long dispela kapu.'),
   ('xe', '???'),
   ('ex', 'Pita ukoa koekoereva kapu-ia.'),
   ('xp', 'Pita i bin rausim wara long kap.'),
   ('xe', 'Peter removed the water with a cup.'),
   ('ex', 'Pita, araisi koekoeri eera-ia sipuru ra aiori.'),
   ('xp', 'Pita, yu spunim rais long sipun ia na bai yu kaikaim.'),
   ('xe', 'Peter, dish out the rice with a spoon and eat (it).')]),
 ('koekoeto',
  [('rt', 'koekoe'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'type of ant'),
   ('tkp', 'anis'),
   ('dt', '01/Dec/2004'),
   ('ex',
    'Koekoeto ira oisioa kepa sioparo-ia rasito vuripiereve vosia viapau oiraravai toupaive.'),
   ('xp',
    'Em binatang i save bagarapim graun insait haus, taim ol man i no istap.'),
   ('xe', '???')]),
 ('koepato',
  [('rt', 'koe'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'spoon'),
   ('tkp', 'spun'),
   ('avm', 'pa_required'),
   ('sc', 'INSTRUMENT'),
   ('dt', '14/Nov/2005'),
   ('ex', 'Arua rooviro tapipato koepato.'),
   ('xp', 'Spun bilong bringim ol sup kumu.'),
   ('xe', '???'),
   ('ex',
    'Vosia viapau sipuruaravai ra viapau uvuipavio tauoara iava kasiraopa aioro kaepiepasia tauoara iava.'),
   ('xp',
    'Sapos i nogat sipun i stap na bai yumi no inap kisim hotpela kaikai long ol pelet.'),
   ('xe', "If there are no spoons, we can't ???.")]),
 ('koeta',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'mature'),
   ('ge', 'grow'),
   ('ge', 'ripen'),
   ('tkp', '???'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('nt', 'The word vioroi is now used instead.'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Upiriko kovo koetaei.'),
   ('xp', '???'),
   ('xe', 'The sweet potato garden is mature.'),
   ('ex', 'Upiriko kovo koetai ra vo kovo eripe.'),
   ('xp', 'Gaden kaukau i orait pinis bai yumi digim.'),
   ('xe', 'The sweet potato garden is ripe and can it can be harvested.')]),
 ('koetaova',
  [('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'arrange'),
   ('eng', 'arrange marital exchange'),
   ('tkp', 'stretim marit'),
   ('vx', '???'),
   ('dt', '08/Dec/2006'),
   ('ex', 'Aue koetaovapare aiteto ra oratuutuukoave riakorirei-ia.'),
   ('xp', 'Papa i wok long stretim gut tru bai ol i senis long tupela meri.'),
   ('xe',
    'Father arranged things and they will make an exchange with the two women.')]),
 ('koetapie',
  [('rt', 'koeta'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'investigate'),
   ('ge', 'satisfy curiosity'),
   ('tkp', 'painim aut gut'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '04/Nov/2005'),
   ('ex',
    'Riakova koetapiesa uturoepa ato-ia, uvare rera vo kovoaro-ia aioara.'),
   ('xp', '???'),
   ('xe',
    'The man followed the woman to confirm or satisfy his suspicions that she [???].'),
   ('ex',
    'Rakisi koetapiesia rutu utura atoia uvare utuva erievo ragai oiraaro.'),
   ('xp',
    'Rakisi i bin digim yam bilong mi, na mi painim em aut gut long ples.'),
   ('xe', 'Rakisi ??? because ???.'),
   ('ex',
    'Riakova koetapiesia uturoepa oirato uvare rera vo kovoaro kavirueva.'),
   ('xp',
    'Man i bin bihainim meri i go bilong painim aut gut bikos em i bin stilim gaden kaikai bilong em.'),
   ('xe', 'The man trailed the woman because she stole (from) his garden.')]),
 ('koetava',
  [('alt', 'koeta'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bow'),
   ('tkp', 'banara'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Koeta iria-ia kora ritaapaveira ora aue tapo kokio.'),
   ('xp', 'Bonara ol i save sutim kapul na pisin.'),
   ('xe', 'With a bow they shoot possums and birds.'),
   ('ex', 'Koetava iria-ia orekerovu rutu ritapaiveira.'),
   ('xp', 'Banara ol i save sutim planti samting long em.'),
   ('xe', 'With bows they shoot many things.')]),
 ('koetavira',
  [('alt', 'koetapavira'),
   ('rt', 'koeta'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'absolutely'),
   ('ge', 'satisfying'),
   ('tkp', '???'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Evao rao-ia koetavira pituri.'),
   ('xp', '???'),
   ('xe', 'Hold the branch absolutely securely.'),
   ('ex', 'Rarairi, koetavira itara pitupaoro opita atoupau.'),
   ('xp', 'Rarairi, yu holim strong taim yu daunim kokonas.'),
   ('xe', 'Rarairi, ???.')]),
 ('koeto',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'sand crab'),
   ('tkp', 'kuka'),
   ('dt', '08/Feb/2005'),
   ('ex', 'Koeto ira vo toupareveira rogararo.'),
   ('xp', 'Kuka i save i stap long wet san.'),
   ('xe', 'The crab lives on wet sand.')]),
 ('kogo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'cut'),
   ('ge', 'chop'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Tuariripa irara oisoa evaoara kogopaive vara toepaoro aue-ia.'),
   ('xp', '???'),
   ('xe', 'People of long ago always chopped trees cutting them with stones.'),
   ('ex',
    'Tuariripairara oea oisioa evaoara kogopaive aue-ia avekeva torara.'),
   ('xp', 'Ol man bilong bipo ol i save katim diwai long ston tamiok.'),
   ('xe', '???')]),
 ('kogova',
  [('rt', 'kogo'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'stone axehead'),
   ('tkp', 'akis'),
   ('dt', '01/Dec/2004'),
   ('ex', 'Kogova torara avekeva tuariri pairara iriaia oiso a toepaova.'),
   ('xp', 'NO DATA.'),
   ('xe', '???'),
   ('ex', 'Kogova tuariripairara oiraro torara iria-ia oisioa toepaive.'),
   ('xp',
    'Tamiok ston bilong ol man bilong bipo ol i save yusim long katim diwai.'),
   ('xe', '???')]),
 ('koi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'high pitched sound'),
   ('tkp', 'noisi antap'),
   ('sc', 'EMISSION.SOUND'),
   ('vx', '1'),
   ('nt', 'archaic'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Peri koiivo aue-ia araisi ovusia uusipaoe.'),
   ('xp', 'Ol i bin kaikai rais taim Peni i bin wok long slip.'),
   ('xe', '???')]),
 ('koie',
  [('ps', 'N'),
   ('pt', 'ANIM'),
   ('ge', 'pig'),
   ('tkp', 'pik'),
   ('dt', '26/Sep/2006'),
   ('ex', 'Koie uporevo Pita uva oira aiorevo rera raga aruvea.'),
   ('xp', 'Pita em i kilim pik na em yet i kaikaim asde.'),
   ('xe', 'Peter killed a pig and he himself ate it yesterday.'),
   ('ex', 'Koiea ouavoi va aiosia uvare vurivira rutu ogoeparai.'),
   ('xp', 'Hap pik mi kisim bilong kaikai bikos mi hangere nogut tru.'),
   ('xe',
    'I have gotten the pig in order to eat it because I am terribly hungry.')]),
 ('koie',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'act like a pig'),
   ('tkp', 'wokabaut olsem pik'),
   ('cmt',
    'Is there a verb meaning "get pig to eat/kisim pik", as Firchow\'s dictionary suggests?'),
   ('vx', '1'),
   ('dt', '22/Nov/2005'),
   ('ex',
    'Aretei koiepaoi vavaeareia vokapaoro uvare tuukeru oirare upiapaoe vavaeareia vokapaoro.'),
   ('xp',
    'Aretei em i wokabaut olsem pik long tupela han bikos beksait bilong em i pen.'),
   ('xe', '???')]),
 ('koike',
  [('ps', '???'),
   ('ge', 'left'),
   ('tkp', 'kais'),
   ('tkp', 'lephan'),
   ('dt', '04/Nov/2005'),
   ('cmt',
    "What's the POS? Third example has rutu between vearo and paveira. Is that right?"),
   ('ex', 'Aretua koike eisi-va rera Ruruvu.'),
   ('xp', 'Aretu em i kaisman bilong Ruruvu.'),
   ('xe', 'Aretua is to the left of Ruruvu.'),
   ('ex', 'Koike viia koiketo koikevira toepari.'),
   ('xp', 'Yu kais man yu katim long left side.'),
   ('xe', '???'),
   ('ex', 'Era koike ira vearo rutu paveira vo vavaere-ia tapo kovopasia.'),
   ('xp', 'Dispela han kais man ia em gutpela tru long wok long han wantaem.'),
   ('xe', '???'),
   ('ex',
    'Rapiriria koike ira eisi toupareveira Uriora. Ira koike vavae iava kovopareveira.'),
   ('xp',
    'Rapiriri em i kais man ne em i save stap long uriora. Em i save wok long han kais.'),
   ('xe',
    'Rapiriri is a left-hander who is usually in Uriora. He works using his left hand.')]),
 ('koiketo',
  [('rt', 'koike'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'left-handed person'),
   ('tkp', 'kaisman'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Pita era koiketo.'),
   ('xp', 'Pita em i kaisman.'),
   ('xe', 'Peter is left-handed.'),
   ('ex',
    'Rapiriria koiketo ira uvuipa ra koike vavae-ia vii uporeve ovusia keapau oisio ira ragai upopare toepa vavae-ia.'),
   ('xp',
    'Rapiriri em i han kais man, em inap paitim yu long han kais, taim yu ting bai em i paitim yu long rait han.'),
   ('xe',
    'Rapiriri is a left-hander and he can hit you with his left hand when you think wrongly that he will hit me with his right hand.')]),
 ('koikevira',
  [('alt', 'koikepavira'),
   ('rt', 'koike'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'off.course'),
   ('eng', 'off course'),
   ('eng', 'off base'),
   ('eng', 'off track'),
   ('tkp', 'olsem i no stret'),
   ('tkp', 'abrusim rot'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Koikevira va uvuta.'),
   ('xp', '???'),
   ('xe', "You didn't hear it quite right."),
   ('ex', 'Koikevira koveroe Andru.'),
   ('xp', 'Andru i pundaun long kais sait.'),
   ('xe', 'Andru fell on his left side.'),
   ('ex', 'Koikevira reoreoparoe avukato vosia rasireo-ia orareopaae.'),
   ('xp',
    'Lapun man i toktok olsem i no stret taim ol i mekim toktok bilong graun.'),
   ('xe', 'The old man spoke ??? when they discussed land issues.')]),
 ('koikoi',
  [('rt', 'koi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'groan with pain'),
   ('tkp', 'krai wantaim pein'),
   ('vx', '1'),
   ('dcsv', 'true'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Eake-pa koikoipari?'),
   ('xp', 'Long wanem yu singaut?'),
   ('xe', 'For what reason are you groaning?'),
   ('ex', 'Raki koikoiparevoi upiapaoro.'),
   ('xp', 'Raki i singaut taim em i sik.'),
   ('xe', "Rai groans with pain when he's sick."),
   ('ex', 'Oirato upiapaoro koikoipare.'),
   ('xp', 'Man i sik na em i krai wantaim pein.'),
   ('xe', 'The man is sick and crying out from pain.')]),
 ('koikoipato',
  [('rt', 'koikoi'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('dx', 'Pipipaia'),
   ('ge', 'groaner'),
   ('ge', 'whiner'),
   ('tkp', 'man i wok long krai wantaim pen'),
   ('avm', 'pa_required'),
   ('dt', '04/Nov/2005')]),
 ('koikoipie',
  [('rt', 'koikoi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make groan with pain'),
   ('tkp', 'mekim krai wantaim pen'),
   ('vx', '2'),
   ('dt', '16/Nov/2005'),
   ('ex', 'Jon riakova koikoipierevoi uvare oira uporevoi.'),
   ('xp', 'Jon i mekim meri singaut taim em i paitim em.'),
   ('xe',
    'John is making the woman groan with pain because he is hitting her.'),
   ('ex', 'Upia avuva koikoipieiraopaivo vokiaro.'),
   ('xp', 'Sik i mekim bubu i gron long nait.'),
   ('xe', 'The sickness made grandmother groan with pain during the night.')]),
 ('koikoito',
  [('rt', 'koikoi'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'orphan'),
   ('tkp', 'nogat papamama'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Koikoito kakaeto ro ira arova kopiisi akovatoarei.'),
   ('xp', 'Pikinini em papamama i dai na lusim.'),
   ('xe', 'An orphan boy is one who the parents died without.'),
   ('ex', 'Koikoito ro ira arova kopisi akorivatoa-re ra retoriva.'),
   ('xp', 'Man i no gat papamama tasol ol lukautim.'),
   ('xe', '???')]),
 ('koisi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'honey'),
   ('tkp', 'hani'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Koisi aiopava oira ukoaro-ia ukaiopareira.'),
   ('xp', 'Ol i save dringim wara blong em.'),
   ('xe', '???'),
   ('ex', 'Koisi rovu oa-ia ukaiopaaveira.'),
   ('xp', 'Hani ol i save dringim wara bilong em.'),
   ('xe', 'Honey water, they drink from it.')]),
 ('koisiva',
  [('rt', 'koisi'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'honey bee'),
   ('tkp', 'binen'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Koisiva koisiva ragaiia pituevoi papaei koisiva.'),
   ('xp', 'Bee em i fly n em holim mi..'),
   ('xe', '???'),
   ('ex', 'Koisiva iria papapaeveira.'),
   ('xp', 'Hani bii isave palae nabaut.'),
   ('xe', '???'),
   ('ex',
    'Koisiva iria vearoa purapaeveira uukoa, oa iava uvuipauei ra sipareoara rutu veaveari.'),
   ('xp',
    'Hani bi i save kamapim switpela wara, dispela wara yu nap likim tru ol pinga bilong yu long em.'),
   ('xe',
    'The honey bee makes honey, which you can like from your fingers.')]),
 ('koivira',
  [('alt', 'koipavira'),
   ('rt', 'koi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'soprano'),
   ('ge', 'high pitch'),
   ('tkp', 'nois antap'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Riakora koivira kovapai erapaoro.'),
   ('xp', 'Ol meri ol sing antap taim ol i singsing.'),
   ('xe', 'The women sing soprano while singing.'),
   ('ex', 'Sera koivira kovapaoi.'),
   ('xp', 'Sera em i sing antap.'),
   ('xe', 'Sera sings soprano.')]),
 ('koka',
  [('ps', 'V'),
   ('pt', 'B'),
   ('eng', 'agree'),
   ('ge', 'agree to meet with'),
   ('tkp', 'tok orait long bung'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '01/Aug/2005'),
   ('ex', 'Oirato riakova kokarevo uva rera iare avao vokiaro.'),
   ('xp', 'Man i pasim tok wantaim meri na meri i go long em long nait.'),
   ('xe', '???')]),
 ('kokai',
  [('ps', 'N'),
   ('pt', 'ANIM'),
   ('ge', 'rooster'),
   ('tkp', 'kakaruk'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kokai ira gaupareveira avito kekepaoro.'),
   ('xp', 'Kakaruk man i save krai taim em i lukim tulait.'),
   ('xe', 'The rooster cries as the sun appears.'),
   ('ex', 'kataitoarei kokai vaio'),
   ('xp', 'tupela kakaruk tasol'),
   ('xe', 'two chickens')]),
 ('kokara',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'tempt'),
   ('ge', 'try'),
   ('ge', 'test'),
   ('tkp', 'traim'),
   ('cm', '-ia'),
   ('arg', 'OBL COMP'),
   ('vx', '2'),
   ('cmt', 'Follow up on control structure?'),
   ('dt', '17/Jul/2005'),
   ('ex', 'Urioparoepa oira-ia kokarapaoro oiso-re ra va uvupaeve.'),
   ('xp', '???'),
   ('xe', 'He came tempting her so that she would listen to it (do it).'),
   ('ex',
    'Pita Dorin-ia kokaraparoe oisio ra vearo vaisi puraeve ra vuria purasi.'),
   ('xp',
    'Pita i traim Dorin bai em i tok orait bai tupela i wokim pasin nogut.'),
   ('xe', 'Peter tested Dorin, if she said okay, they would fool around.')]),
 ('kokaraa',
  [('rt', 'kokara'),
   ('ps', '???'),
   ('ge', 'temptation'),
   ('tkp', 'du'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Wesli koveroe kokaraa-ia.'),
   ('xp', 'Wesli i pundaun long traem.'),
   ('xe', 'Wesley fell to temptation.'),
   ('ex', 'Kokoraa -via kokaraa era pita -via kokaraa paoi eira sara.'),
   ('xp', 'Pita eni taram yu/sara eni taram yu.'),
   ('xe', '???')]),
 ('kokarapato',
  [('rt', 'kokara'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'Satan'),
   ('ge', 'evil spirit'),
   ('ge', 'tempter'),
   ('tkp', 'seten duman'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Tugarato kokarapato ira oirara kovepiepareveira vuria iare.'),
   ('xp', 'Evil spirit em i save daunim ol man i go long sin.'),
   ('xe', 'The tempter spirit always makes people fall into evil.'),
   ('ex', 'Satan kokarapato via/tugavato kokavapato.'),
   ('xp', 'Satan em man blong taram yu eni spirit eni taim yu.'),
   ('xe', 'Satan is a ???.')]),
 ('koke',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make rain'),
   ('tkp', 'mekim ren'),
   ('vx', '1'),
   ('cmt', 'Other examples?'),
   ('dt', '04/Nov/2005'),
   ('ex', 'Kokerivere.'),
   ('xp', 'Yu bai mekim ren i pundaun.'),
   ('xe', 'You will make it rain.'),
   ('ex', 'Voari tuariri uva oisio purapaveira Sirovisia koke purapato.'),
   ('xp',
    'Long taim bipo ol i save tok olsem Sirovisi em man bilong mekim ren.'),
   ('xe', 'Long ago they said that Sirovisi was a rain maker.')]),
 ('kokee',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'peek through a blind or crack'),
   ('tkp', '???'),
   ('cm', '-re'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Kakaevure kokeepaiva kepa iava.'),
   ('xp', '???'),
   ('xe', 'The children peeked through the blind of the house.'),
   ('ex', 'Jon kokeeparevoi oirara-re ovusia orareopai.'),
   ('xp', 'Jon i lukluk long hol bilong wol long ol man taim ol i miting.'),
   ('xe', 'John peeked at people while they talked.')]),
 ('kokepato',
  [('rt', 'koke'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'rainy month'),
   ('tkp', 'mun bilong ren'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Kokepatoa utavai aue Jun.'),
   ('xp', 'Mun Jun em i taim bilong rein.'),
   ('xe', '???')]),
 ('kokepato',
  [('rt', 'koke'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'rain-maker'),
   ('tkp', 'man bilong mekim rein'),
   ('ex', 'Juna kokepato ira siovaraia raivara rutu vuriera.'),
   ('xp', 'Jun em mun bilong rein insait long em ol rot i bagarap.'),
   ('xe', '???'),
   ('dt', '04/Nov/2005')]),
 ('kokepie',
  [('rt', 'koke'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'make.rain'),
   ('tkp', 'mekim ren'),
   ('eng', 'make rain'),
   ('vx', '???'),
   ('cmt',
    'What is <osa> doing in the ex. sentence? And where is the direct object?'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Tuariripairara osa kokepiepaive ora voea-re.'),
   ('xp', 'Ol man bilong bipo ol i save mekim ren long ol yet.'),
   ('xe', 'The ancestors made it rain on themselves.')]),
 ('kokerao',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'opossum with black and white hair'),
   ('ge', 'mottled'),
   ('ge', 'spotted'),
   ('tkp', 'kapul i gat mak'),
   ('dt', '12/Feb/2005'),
   ('ex',
    'Kokerao korato kokerao rupa kopikopi pato ara aue popote (pogara).'),
   ('xp', 'Kapur em i gat makmak blak na waitpela garas bilong en.'),
   ('xe', '???'),
   ('ex', 'Koorato kokeraopato, ira-ia toupai aue rupa ora aue popote.'),
   ('xp', 'Kapul i gat makmak bilak na waitpela garas.'),
   ('xe', 'On the kokeraopato possum there is black and white.')]),
 ('kokeriva',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'rain'),
   ('tkp', 'ren'),
   ('dt', '15/Apr/2006'),
   ('cmt', "How does kokeriva relate to kokeva? What's the plural?"),
   ('ex', 'Kokeriva igei-re koveoe osia karepaie vokiaro.'),
   ('xp', 'Ren em i pundaun long mipela long rot taim mipela kam long nait.'),
   ('xe', 'The rain fell on us as we returned at night.')]),
 ('kokeu',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'shellfish'),
   ('tkp', 'kina kramsei'),
   ('nt',
    'It has a blue shell inside and out and is found in saltwater along the coast. This name preferred to asiraue which also means blue shellfish.'),
   ('cmt', 'Is this a sentence?'),
   ('dt', '04/Nov/2005'),
   ('ex',
    'Kokeu iria toupaeveira vo ukoviro vituaro-ia avakava-ia uva veauaro toupaiveira.'),
   ('xp',
    'Sel i save stap long as bilong wara long solwara em i save stap long arere bilong wara.'),
   ('xe', '???')]),
 ('kokeva',
  [('rt', 'koke'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'rain'),
   ('tkp', 'ren'),
   ('dt', '12/Jan/2006'),
   ('ex', 'Kokeva vokiara rutu-ia kovepaoi.'),
   ('xp', '???'),
   ('xe', 'It rains every day.'),
   ('ex', 'Kapi, uriou kokeva koveoi.'),
   ('xp', 'Kapi, yu kam, ren i pundaun nau.'),
   ('xe', 'Kapi, come, the rain is falling.'),
   ('ex', 'Vaiterei siovaraia kekira vaioa kokeva kovepaora.'),
   ('xp', 'Insait long tupela mun ren i bin wok long pundaon.'),
   ('xe', 'For two months it has rained.')]),
 ('koki',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'chisel out'),
   ('ge', 'chip away'),
   ('tkp', 'boaim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Rivutevira raga vo eva kokipareve.'),
   ('xp', '???'),
   ('xe', 'He would be chipping away at the tree just like a wood borer.'),
   ('ex', 'Timoti ora Jon topiruu kokipasivoi.'),
   ('xp', 'Timoti na Jon i digim hol long kaku.'),
   ('xe', 'Timothy and John chiselled away ???.')]),
 ('kokio',
  [('ps', 'N'),
   ('pt', 'ANIM'),
   ('ge', 'bird'),
   ('tkp', 'pisin'),
   ('nt', 'generic term'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '27/Sep/2006'),
   ('ex', 'Kokiova gaupaevoi.'),
   ('xp', 'Pisin i krai.'),
   ('xe', 'The bird is crying.'),
   ('ex', 'Kokioto iarevoi aue Visiai.'),
   ('xp', 'Visai i sutim pisin.'),
   ('xe', 'Visiai shot a bird.'),
   ('ex', 'Ragai kokio taraparai okarei ritaravo karei orisia.'),
   ('xp', 'Mi painim pisin na bai mi sutim sampela na kukim kaikai.'),
   ('xe', '???')]),
 ('Kokipaia',
  [('rt', 'kokipaia'),
   ('ps', '???'),
   ('ge', 'name of village'),
   ('tkp', 'ples nem'),
   ('dt', '18/Feb/2004')]),
 ('kokito',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'ear canal'),
   ('tkp', 'yau'),
   ('tkp', 'ia'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kokito ira iava reoara uvuupapeira.'),
   ('xp', 'Yumi save harim tok long ia.'),
   ('xe', 'Through the ear, words are heard.')]),
 ('kokivira',
  [('alt', 'kokipavira'),
   ('rt', 'koki'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'unobstructed hole'),
   ('tkp', 'go insait stret'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Evaova kokivira toupaivoi.'),
   ('xp', 'Diwai i gat hol.'),
   ('xe', 'The tree has holes.'),
   ('ex',
    'Vavaviri kokivira araisi koko aiorevo uva vuruvuru-ia rikua toupaivo.'),
   ('xp',
    'Vavaviri em i kaikaim pleit rais long namel. Na namel bilong rais i gat hul.'),
   ('xe', 'Vavaviri ??? and there is a hole in ???.')]),
 ('koko',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'pour'),
   ('eng', 'pour'),
   ('eng', 'serve'),
   ('eng', 'dish out'),
   ('eng', 'portion out'),
   ('tkp', 'kapsaitim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '18/Nov/2005'),
   ('ex', 'Aio kuvuro kokopaivere oirara tauoara-ia vara.'),
   ('xp', '???'),
   ('xe', 'The people will dish out the bundles of food onto limbum plates.'),
   ('ex', 'Vutevi aio kokoro purae tauoara-ia.'),
   ('xp', 'Vutevi i wokim kaikai long ol pelet.'),
   ('xe', 'Vutevi made a plate of food.')]),
 ('koko',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'flower'),
   ('tkp', 'plaua'),
   ('cmt', 'Translated by Firchow as "open flower". Need to check.'),
   ('dt', '10/Nov/2005'),
   ('ex', 'Kokoa rovopai karupaoro.'),
   ('xp', 'Palaua i stat long op.'),
   ('xe', 'The flower is starting to open up.')]),
 ('kokoi',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'type of banana'),
   ('tkp', 'liklik han taro'),
   ('nt', 'eating variety, short thick type'),
   ('pt', 'itoo'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokoi vioropava.'),
   ('xp', 'Banan i save mao.'),
   ('xe', '???'),
   ('ex',
    'Kokoi itoova vaisiaro otekupava iria vioropiepaiveira oira aiopasia.'),
   ('xp',
    'Kokoi em nem bilong wanpela banana ol i save mekim mao na kaikaim.'),
   ('xe', '???')]),
 ('kokoisi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'sweat'),
   ('ge', 'perspire'),
   ('tkp', 'swet'),
   ('vx', '1'),
   ('sc', 'BODILY.PROCESS'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokoisi aisia ruvira toupai.'),
   ('xp', 'Palaoan karamap yet.'),
   ('xe', '???'),
   ('ex',
    'Aretei ikaupaoro voreoe eisi-va sikurua. Uva rirovira rutu kokoisioe vova kagave.'),
   ('xp',
    'Aretei em i ran bek i kam long skul. Na tuat i kamap long pes bilong em.'),
   ('xe', 'Aretei ran back from school. And her face was very sweaty.')]),
 ('kokokoru',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'flower bud'),
   ('tkp', 'plaua i pas'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Ragai kokokoru vurapaavo orua karupaivoi rara ravireo kaeroviro.'),
   ('xp', 'Mi lukim karamap plaua, bai open taim san i kam antap.'),
   ('xe', 'I looked at the flower bud ???.')]),
 ('kokoo',
  [('ps', 'CLASS'),
   ('ge', 'plateful'),
   ('tkp', 'kaikai long plait'),
   ('eng', 'food on a plate'),
   ('dt', '16/Mar/2005'),
   ('ex', 'Ragai aio kokoo aiopaavoi tauo iava.'),
   ('xp', 'Mi kaikaiim kaikai stap long plait.'),
   ('xe', "I'm eating the food from a plate."),
   ('ex', 'Sera aio kokoo taepae tauo-ia.'),
   ('xp', 'Sera em i karim kaikai long plet.'),
   ('xe', 'Sera is ??? a plate of food.'),
   ('ex', 'Dokas aio kokoo kaepaoro uriopaoi.'),
   ('xp', 'Dokas i karim pelet kaikai i kam.'),
   ('xe', 'Dokas is coming carrying a plate of food.')]),
 ('kokooko',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', '???'),
   ('tkp', '???'),
   ('cmt',
    'Check vowel length. Some type of plant, I think--see custom medicine story'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Purupuru kerari ro iava kokookoto oupaiveira sisiupara purapasia.'),
   ('xp',
    'Ol plaua bilong ol kain kain plant ol i save kisim bilong mekim marasin bilong was was.'),
   ('xe', '???')]),
 ('kokookoa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'open flower'),
   ('tkp', '???'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Vearoa rutu vavao kokookoa.'),
   ('xp', 'Naispela palaua tru.'),
   ('xe', '???'),
   ('ex', 'Kokookoa vearopie kekepapeira vosia karuvira toupaive.'),
   ('xp', 'Plaua i save luk nais tru taim i stap open.'),
   ('xe', '???')]),
 ('kokoote',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bird'),
   ('tkp', '???'),
   ('eng', "Baillon's Crake (Porzana pusilla)"),
   ('nt', 'Disinctive feature is its strange night cry'),
   ('sa', 'ekarako'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokoote iria vokiaro gaupaeveira.'),
   ('xp', 'Pisin i save krai long nait.'),
   ('xe', "The Baillon's Crake cries at night."),
   ('ex',
    'Kokoote kokiova iria rasiuaro raga vokapaeveira. Viapau uvuipaoi ra papapaeve uvare etekuvira putaearei toupaiveira.'),
   ('xp',
    'Kokoote i wanpela pisin i save wokabaut tasol long graun.Em i no inap plai bikos tupela wung bilong em i luk olsem sot.'),
   ('xe', '???')]),
 ('kokootu',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'chicken'),
   ('eng', 'chicken'),
   ('eng', 'hen'),
   ('tkp', 'kakaruk'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokootu iria takura koupaoveira.'),
   ('xp', 'Kakaruk meri i save bonim kiau.'),
   ('xe', 'Chikens lay eggs.'),
   ('ex', 'Kokootu ruipapaveira oirara rutu uvare vearopiea rutua varua.'),
   ('xp', 'Kakaruk olgeta man i save laikim tru bikos em i gutpela abus tru.'),
   ('xe', 'Everyone wants chicken because it is good meat.')]),
 ('kokopa',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'mudskippers'),
   ('tkp', '???'),
   ('nt', 'live in mangrove swamps'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokopa ira tuvuuara-ia toupareveira.'),
   ('xp', 'Em wanpela kain liklik palai i save istap long tais.'),
   ('xe', 'The mudskipper lives in swamps.'),
   ('ex',
    'Kokopa ira vaguru kouro-ia tuvuara-ia toupareveira. Ira ora vikiparoveira oisio rutu putaearava toupare.'),
   ('xp',
    'Kokopa i wanpela kain liklik plai i save stap insait long tais bilong mangur. I save kalap olsem em i gat wing.'),
   ('xe', '???')]),
 ('kokopakou',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'overgrown bush area'),
   ('tkp', 'bus'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokopakou oa upiriko kovo ruuvoi.'),
   ('xp', 'Liklik bus i karamapim gaden kaukau.'),
   ('xe', 'Growth covers sweet potato gardens.'),
   ('ex',
    'Vosia kaukau kovo purari, oire kasivarivira kokopa touro toetoepari. Teapi aio kovo vuripe rara kokopakou orapurape.'),
   ('xp',
    'Sapos yu mekim gaden kaukau, orait yu mas klinim olgeta bus olgeta taim. nogut bus i bagarapim gaden kaikai.'),
   ('xe', '???')]),
 ('kokopeko',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'unconscious'),
   ('ge', 'in a stupor'),
   ('tkp', 'i laik dai'),
   ('vx', '1'),
   ('dt', '22/Nov/2005'),
   ('sc', '???'),
   ('ex', 'Kokopeko ro oirato kokepekoparoi kopiragapaoro.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Raki kokopekoparoi oisio ra kopiiro.'),
   ('xp', 'Raki i wok long hap dai.'),
   ('xe', 'Raki is unconscious as if about to die.')]),
 ('kokopekovira',
  [('alt', 'kokopekopavira'),
   ('rt', 'kokopeko'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'unconsciously'),
   ('tkp', 'olsem i dai'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Raki kokopekovira pieparevoi ra kopiiro.'),
   ('xp', 'Raki i wok long hap dai, tasol bai em i dai yet.'),
   ('xe', 'Raki ???.')]),
 ('kokopeoto',
  [('rt', 'kokopeo'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of plant'),
   ('tkp', '???'),
   ('sf', 'FLORA'),
   ('nt', 'grass-like with green and white striped leaves'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Kokopeoto rerapauiveira kokovara guruvato ora popote ta raivaro.'),
   ('xp', 'Ol i save plaint?? Eni green pelaleares na eni white .'),
   ('xe', '???'),
   ('ex', 'Kokopeoto ira vegoaro toupareveira, ra popotevira kokookopareve.'),
   ('xp', 'Em wanpela kain plant bilong bus na i save gat waitpela palaua.'),
   ('xe', '???')]),
 ('kokopuoto',
  [('rt', 'kokopuo'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'butterfly'),
   ('ge', 'moth'),
   ('tkp', 'batapiai'),
   ('nt', "Is the 'ave' in the second ex sentence really just 'aue'?"),
   ('dt', '13/Nov/2005'),
   ('ex', 'Kokopuoto ira papaparevoi.'),
   ('xp', 'Batapalai i palai i go.'),
   ('xe', 'The butterfly flies.'),
   ('ex',
    'Kokopuoto kekeavo ave vearopieto ovusia kokoa iava uukovi ivuparevo.'),
   ('xp', 'Mi lukim naispela bataplai taim em i pulim wara bilong plaua.'),
   ('xe',
    'I saw a pretty butterfly while he pulled water from an open flower.'),
   ('ex', 'Kokopuo kare uuko rupipai purupuru kokokoro-ia.'),
   ('xp', 'Ol bata plai ol i pulim wara long ol plaua.'),
   ('xe', '???')]),
 ('kokopuovira',
  [('alt', 'kokopuopavira'),
   ('rt', 'kokopuo'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', '???'),
   ('tkp', 'olsem bata plai'),
   ('cmt',
    "What about kokopuvira? Are we dealing with two words? Meanings don't seem related."),
   ('dt', '14/Nov/2005'),
   ('ex', 'Kokopuovira rutu Tokii vurapaavoi.'),
   ('xp', 'Mi ken lukim Mt. Bagana longwe tru.'),
   ('xe', 'I look at Mt. Bagana ???.'),
   ('ex', 'Kokopuvira pukuiara vurapaavo vavoisio avakava-ia.'),
   ('xp', 'Mi lukim ol maunten longwe tru long nambis i luk liklik tru.'),
   ('xe', 'I looked at the mountains ???'),
   ('ex', 'Kokopuovira rutu papapava kekepaoviroi vavoisio virauaaro.'),
   ('xp', 'Balus i luk olsem stret bata plai taim em i wok long pilai antap.'),
   ('xe', '???')]),
 ('Kokopuru',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('dx', 'Central'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('ig', 'yes'),
   ('dt', '26/Jul/2005')]),
 ('kokopuvira',
  [('rt', 'kokopu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'hazy'),
   ('ge', 'bluish in appearace at a distance'),
   ('tkp', '???'),
   ('nt', 'As mountains appear with a blue hazy appearace in the distance.'),
   ('dt', '18/Feb/2004'),
   ('ex',
    'Vosa torepau pukuia kukuearoi ra kokopuovira vuropari tauai ruture ra kuravira kekepapiro vegoa.'),
   ('xp',
    'Taim yu sanap lonas antap long wanepela mauntan bai yu lukluk igo longwe tru bai bush bai lukbtue.'),
   ('xe', '???')]),
 ('kokora',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'last-born son'),
   ('tkp', 'las pikinini meri'),
   ('dt', '12/Feb/2005'),
   ('cmt', 'Is this used only for males?'),
   ('ex', 'Rearia kokora ovoio kavauto.'),
   ('xp', 'Reari em i lasbon boi.'),
   ('xe', 'Reari is the youngest boy, the last-born.')]),
 ('kokorai',
  [('rt', 'kokora'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'last-born daughter'),
   ('tkp', '???'),
   ('cmt', 'Is this term gender-specific?'),
   ('sf', 'KIN'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Ovoio kavauva rutu riakova.'),
   ('xp', 'Last pela pikinin meri.'),
   ('xe', '???'),
   ('ex', 'Betia kokorai, ovoio kavauva.'),
   ('xp', 'Betty em i las bon meri.'),
   ('xe', 'Betty is youngest, the last-born girl.')]),
 ('kokorato',
  [('rt', 'kokora'),
   ('ps', 'N'),
   ('pt', 'NT???'),
   ('ge', 'last born son'),
   ('tkp', 'las pikinini man'),
   ('cmt', 'Can the term kokorava be used?'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Kokorato ovoi kavauto rutu.'),
   ('xp', 'Laspela pikinin nom em born bihain tru.'),
   ('xe', 'The youngest is truly the last-born.'),
   ('ex', 'Rearia kokorato.'),
   ('xp', 'Reari em i las bon man.'),
   ('xe', 'Reari is the last-born.')]),
 ('kokori',
  [('ps', 'V'),
   ('pt', '???'),
   ('ge', 'spiralled'),
   ('tkp', 'bosi tantan'),
   ('cmt', '???'),
   ('vx', '???'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kokori purari oira kirooro.'),
   ('xp', 'Yu raitim o.'),
   ('xe', 'You write it in a spiral.')]),
 ('kokorivira',
  [('rt', 'kokori'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'spiralling'),
   ('tkp', 'tantan'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Kokorivira va purava kirooro.'),
   ('xp', 'Yu raitim samting i raunpela.'),
   ('xe', 'Write something in a spiral.')]),
 ('kokoro',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'crazy'),
   ('tkp', 'longlong'),
   ('eng', 'crazy'),
   ('eng', 'insane'),
   ('eng', 'foolish'),
   ('eng', 'stupid'),
   ('am', 'true'),
   ('vx', '1'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Sioro ira oisioa kokoroparo vokiaro vokapaoro ora kovapaoro.'),
   ('xp', 'Sioro em i save rongrong na raun long nait na singsing.'),
   ('xe', 'A crazy peron is one who runs around at night singing.'),
   ('ex', 'Kokoropaoi rirovira upiapaoro.'),
   ('xp', '???'),
   ('xe', 'She is out of her head being very sick.')]),
 ('kokoroki',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'earring worn in the ear lobe'),
   ('ge', 'ring for finger or nose'),
   ('tkp', 'ring'),
   ('dt', '13/Feb/2005'),
   ('ex',
    'Kokoroki tuariri pariako vo rotokasaia oisi kokoroki tivopave vo kokiavaia ora vo iruvaoto.'),
   ('xp',
    'Ol meri blong Rotokas ol i save putim kokoroki long ear blong ol na long nare blong ol.'),
   ('xe', '???'),
   ('ex', 'Rotokasipa riako oisioa kokorokiara tovopaive vo kokiara-ia.'),
   ('xp', 'Bipo ol meri Rotokasi ol i save putim ia rig long ia bilong ol.'),
   ('xe', 'Rotokas woman always used to put earings on their ears.')]),
 ('kokoroku',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'crow'),
   ('tkp', '???'),
   ('dt', '03/Nov/2005'),
   ('cmt',
    "Is Rarova a person's name? There seems to be disagreement over classification? Is it V.A or V.B? Also, no translation for second half of one ex sentence!"),
   ('vx', '1'),
   ('dcsv', 'false'),
   ('ex', 'Kokai kokorokupare.'),
   ('xp', '???'),
   ('xe', 'The rooster is crowing.'),
   ('ex', 'Raku ira kokoropavira rutu vokapare.'),
   ('xp', 'Raku i wokabaut olsem rongrong man.'),
   ('xe', 'Raku is walking around crowing.'),
   ('ex', 'Kokai kokorokupareva voisio Rarova.'),
   ('xp', 'Kakaruk em i krai long Rarova.'),
   ('xe', 'The chicken is crowing ???'),
   ('ex',
    'Riroto kokai kokorokuparoi Rarova urui-ia. Oira kare kokai kare kokorokupaveira.'),
   ('xp', 'Bikpela kakaruk em krai long ples Rarova. ???.')]),
 ('kokorokupie',
  [('rt', 'kokoroku'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make.crow'),
   ('tkp', 'mekim kakaruk i krai'),
   ('eng', 'make crow'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kokai kokorokupievo aviua.'),
   ('xp', 'Tulait i laik bruk taim kakaruk i krai.'),
   ('xe', 'Dawn made the rooster cry.')]),
 ('kokoropato',
  [('rt', 'kokoro'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'crazy person'),
   ('tkp', 'longlong man'),
   ('eng', 'insane person'),
   ('eng', 'fool'),
   ('eng', 'crazy person'),
   ('eng', 'lunatic'),
   ('eng', 'nutcase'),
   ('avm', 'pa_required'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Avoeao kokoropatoa rutu ro oirato.'),
   ('xp', 'Ol man, dispela man i rongrong man tru.'),
   ('xe', '???, this man is truly crazy.')]),
 ('kokorosi',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'cockroach'),
   ('tkp', 'kakalak'),
   ('tkp', 'kokros'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokorosi iria vokapae.'),
   ('xp', 'Kokoros em i wokabaut.'),
   ('xe', 'Cockaroaches walk around.'),
   ('ex',
    'Kokorusi vuriva rutu iria-pa reasiparaveira uvare aioara vuripiepaeveira ora varoara.'),
   ('xp',
    'Kokros i nogut tru, mi save les tru long em bikos em i save bagarapim ol kaikai wantaim ol laplap.'),
   ('xe',
    "Cockroaches, they are really bad and I don't like them because they always ruin food and clothing.")]),
 ('kokorovira',
  [('alt', 'kokoropavira'),
   ('rt', 'kokoro'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'foolishly'),
   ('tkp', 'olsem longlong'),
   ('dt', '03/Nov/2005'),
   ('ex',
    'Ragai riakova vurapaavo kokoropavira ovusia vokapaevo raivaro uva keaparae oisio kokopava uvare upiapaoe.'),
   ('xp',
    'Mi lukim meri taim em i wokabaut olsem long long meri long rot bikos em i sik.'),
   ('xe',
    'I watched the woman being crazy while she walked on the road and I mistook her for a crazy person because she was sick.')]),
 ('kokoru',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'flower bud'),
   ('tkp', 'piaua i pas'),
   ('cmt', 'Check vowel length. Is kokoru related to kokoruuto?'),
   ('dt', '14/Nov/2005'),
   ('ex', 'Kokoru vosia kokoa viapa karupaivi.'),
   ('xp', 'Taim flower ino open iyet.'),
   ('xe', '???'),
   ('ex', 'Kokoru kare arua kovo vuripieivo.'),
   ('xp', 'Ol binatang i bagarapim gaden kumu.'),
   ('xe', 'The insects ruined the vegetable garden.'),
   ('ex',
    'Kararuoa-ia kokoru kekeavo oa touvira karuvere rara ravireo kaeroviro.'),
   ('xp',
    'Mi lukim plaua bilong aibiskus i no op yet. Bai i open taim san i lait.'),
   ('xe',
    'I saw the flower bud on the ??? which will open once the sun ???.')]),
 ('kokoruu',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'insect-infested'),
   ('tkp', 'i gat snek o binatang'),
   ('cmt', 'Check vowel length'),
   ('vx', '1'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Ruve kovo kokoruei uvare viapau vo kovo iava kovopaiveira.'),
   ('xp', 'Gaden aibika i gat snek o binatang bikos ol i no save klimin.')]),
 ('kokoruu',
  [('rt', 'kokoruu'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'worm'),
   ('tkp', 'binatang olsem snek'),
   ('cmt', 'Check relationship between kokoruu and kokoruuto!'),
   ('dt', '10/Nov/2005'),
   ('ex', 'Kokoruuto ira vokaparevoi kakau rao-ia.'),
   ('xp', 'Binatang i wokabaut long han bilong kakau.'),
   ('xe', 'A ??? walks around on cocoa leaves.')]),
 ('kokosi',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'stinging nettle'),
   ('tkp', 'salat paitim skin'),
   ('dt', '03/Nov/2005'),
   ('ex', '')]),
 ('kokosi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'itch'),
   ('eng', 'itch'),
   ('eng', 'sting'),
   ('tkp', 'skirap'),
   ('vx', '1'),
   ('cmt',
    'I take it that the -ia arg is optional and licensed by general semantics---i.e., not an argument of the verb.'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kokosirai opoa-ia kopupa oa viapau vearovira oripiro.'),
   ('xp', 'Mi skirap long taro i no tan gut.'),
   ('xe', "I itch from the taro ??? that wasn't well cooked.")]),
 ('Kokosiria',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name of village'),
   ('tkp', 'ples nem'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Kokosiria eisi Sisivi-ia ruvaraia.'),
   ('xp', 'Kokosiria i stap klostu long Sisivi.'),
   ('xe', 'Kokosiria is close to Sisivi.')]),
 ('kokosito',
  [('rt', 'kokosi'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'sore-covered.man'),
   ('eng', 'person with sore-covered body'),
   ('tkp', 'man i gat sua oltaim'),
   ('dt', '03/Nov/2005'),
   ('cmt', "Tok Pisin translation doesn't match gloss? What's up?"),
   ('ex', 'Kokosito viaa kokosito kapuapato.'),
   ('xp', 'Yu gat sare.'),
   ('xe', '???'),
   ('ex', 'Kokosito era oirato.'),
   ('xp', 'Man ia lek nogut.'),
   ('xe', 'That man is a sore-covered man.'),
   ('ex',
    'Kokosito ro ira-ia opesiasia kapua toupaiveira oa iava eisi voea vaisipaiveira oisio kokosito.'),
   ('xp', 'Kokosito em man i gat soa bilong bipo, soa i no save pinis.'),
   ('xe', '??? and that\'s why they call them a "kokosito".')]),
 ('kokosiva',
  [('rt', 'kokosi'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'large sore'),
   ('ge', 'infection'),
   ('tkp', 'bikpela sua'),
   ('cmt', "Gloss and translation of entry in example don't dovetail!"),
   ('dt', '13/Feb/2005'),
   ('ex', 'Kokosiva via kokosiva ugoaia kapua toupa.'),
   ('xp', 'Yu save meri.'),
   ('xe', '???'),
   ('ex', 'Viviura kokosiva.'),
   ('xp', 'Viviura lek nogut.'),
   ('xe', 'Viviura is a ???.')]),
 ('kokotagoe',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of banana'),
   ('tkp', '???'),
   ('nt', 'has a strong smell when ripe'),
   ('pt', 'itoo'),
   ('sa', 'kovato'),
   ('sa', 'vatauvore'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokotagoe itova vioropava.'),
   ('xp', 'Em i nem bilong bananana i save mao.'),
   ('xe', '???'),
   ('ex', 'Kokotagoe itoova vaisiaro iria vioropiepaiveira oira aiopasia.'),
   ('xp', 'Kiokotagoe nem bilong banana em ol save mekim mao kaikai.'),
   ('xe', '???')]),
 ('kokote',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bird'),
   ('tkp', 'wanpela kain pisin'),
   ('nt',
    'has small black body, lives on the ground, feathers are not nicely [MISSING???]'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '03/Nov/2005'),
   ('ex',
    'Kokote iria vo gauaro uvuoro uvuipauei ra uririu uvare osivuraga gaupaeveira aue tapo putaearei viapau osio riro kaekaeareia oa iava viapau papapaeveira.'),
   ('xp',
    'Kokote krai bilong em i narapela kain yu nap poret sapos yu harim krai bilong em. na tu ol wuing bilong em i sot olsem na em i no inap plai.'),
   ('xe', "??? and that's why it doesn't fly.")]),
 ('kokoto',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'leg'),
   ('tkp', 'lek'),
   ('cmt',
    "It looks like <kokoto> can also be M or F when it's used as a predicative noun. Not sure how to register this in the dictionary..."),
   ('dt', '17/Nov/2006'),
   ('ex', 'Kokotoa raga iava upipai uvare koevrae aveke iare.'),
   ('xp', 'Lek bilong mi i pen bikos mi pundaun antap long ston.'),
   ('xe', '???'),
   ('ex',
    'Riro kaekae kokototoa Periaviri. Viapau uvuipauei rare tapo ikauri. Arivi vuripa arova ikaurevere.'),
   ('xp',
    'Periaviri i gat ol longpela lek. Nogat yu nap ran wantaim em tasol yu trangu bai em i ran lusim yu.'),
   ('xe', "Periaviri has really long legs. You can't run with him. ??.")]),
 ('kokotu',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'sucker of taro plant'),
   ('tkp', 'stik taro'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Ragai kokotuara pausia avaparai.'),
   ('xp', 'Mi laik go planim ol stik taro.'),
   ('xe', 'I am going to plant taro suckers.')]),
 ('kokotua',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'heel of foot'),
   ('tkp', 'as bilong lek'),
   ('dt', '10/Jan/2007'),
   ('cmt', 'example sentence needed')]),
 ('kokotuo',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'heel of foot'),
   ('tkp', 'nupela stik taro'),
   ('dt', '20/Nov/2006'),
   ('cmt',
    'How does this relate to kokotu or kokotua? Clean up second ex sentence.'),
   ('ex', 'Kokotuo vii iava kokotuo vo toupae vo kokotoia.'),
   ('xp', 'Em i stap long leg bilong yu.'),
   ('xe', '???'),
   ('ex',
    'Kokotoa-iva kokotuo iria kokotoa kaepiepaeveira ra vearovira vokapape.'),
   ('xp', 'Beksait bilong lek i save antapim lek taim yumi save wokabaut.'),
   ('xe', '???')]),
 ('kokovae',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'sing'),
   ('tkp', 'singsing'),
   ('sc', 'EMISSION.SOUND'),
   ('vx', '1'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Rirovira kokovaepaave.'),
   ('xp', '???'),
   ('xe', 'They would be singing a lot.'),
   ('ex', 'Kakae riako kokovaepaai.'),
   ('xp', 'Ol liklik meri ol i singsing istap.'),
   ('xe', 'Young women sing a lot.'),
   ('ex', 'Ragai oisio ruipaparaveira ra vovokiro rutu-ia kokovaepara.'),
   ('xp', 'Mi save laik singsing olgeta dei.'),
   ('xe', 'I like to sing every day.')]),
 ('kokovaeva',
  [('rt', 'kokovae'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'song'),
   ('tkp', '???'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokovaeva oiakovara.'),
   ('xp', 'Dispela eni song.'),
   ('xe', '???'),
   ('ex', 'Kokovaeva vaisiaro Vatevu.'),
   ('xp', 'Nem bilong singsing em i Vatevu.'),
   ('xe', 'The name of the song is Vatevu.'),
   ('ex',
    'Iriavu rutu-pa ruipaparaveira erava oisio ra oira raga-ia kokovaepara.'),
   ('xp',
    'Mi save laikim wanpela song tasol olsem bai mi save singim dispela song tasol olgeta dei.'),
   ('xe', '???')]),
 ('kokovara',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'green'),
   ('eng', 'green'),
   ('eng', 'unripe'),
   ('tkp', 'grin'),
   ('tkp', 'no mau'),
   ('sc', '???'),
   ('cmt', 'Check exs.'),
   ('vx', '???'),
   ('dt', '03/Oct/2006'),
   ('ex', 'Kokovara viapau vioroisi vaisi kokovaravira toupa.'),
   ('xp', 'Em i no luk mao yet.'),
   ('xe', '???')]),
 ('kokovara',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('cl', 'isi'),
   ('ge', 'unripe coconut'),
   ('tkp', 'kulau'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Kokovara isi opita isi viapau erakopape.'),
   ('xp', 'Kokonas i grin yet na i no drai.'),
   ('xe', "The unripe coconut is a coconut, which isn't dry."),
   ('ex', 'Kokovara isi opita isi.'),
   ('xp', 'Kokonas em i green.'),
   ('xe', 'The unripe coconut is a coconut.')]),
 ('kokovaravira',
  [('rt', 'kokovara'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'unripe'),
   ('tkp', 'grin no mau'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Itova kokovaravira raga toupae.'),
   ('xp', 'Banana i no mao yet.'),
   ('xe', 'The banana is just unripe.')]),
 ('kokovu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('dx', 'Central'),
   ('ge', 'shave head'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '23/Aug/2005'),
   ('ex', 'Oirato orui kokovurevoi uvare oira rutu siirevoi.'),
   ('xp', 'Man em i lusim liklik hap gras tasol taim em i katim olgeta gras.'),
   ('xe', '???')]),
 ('kokovua',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('sn', '1'),
   ('ge', 'small hill'),
   ('tkp', 'liklik maunten'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Kokovua aareavi vaesia.'),
   ('xp', 'Em liklik mounta.'),
   ('xe', '???'),
   ('ex', 'Savere kokovua purare orui-ia.'),
   ('xp', 'Savere i katim stret garas bilong en.'),
   ('xe', 'Savere is making ??? of his hair.'),
   ('ex', 'Vaesi kokovu-ia ikaupaoro iparae uvare etekua.'),
   ('xp', 'Mi ran i go antap long liklik maunten bikos i sotpela tasol.'),
   ('xe', "I went running up the steep hill because it's short.")]),
 ('kokovua',
  [('rt', 'kokovu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('sn', '2'),
   ('ge', 'shave the hair line'),
   ('tkp', 'sepim arere bilong gras'),
   ('arg', '???'),
   ('vx', '???'),
   ('cmt', 'Need another example'),
   ('dt', '03/Nov/2005'),
   ('ex', 'Savere kokovua purare orui-ia.'),
   ('xp', 'Savere i katim stret garas bilong en.'),
   ('xe', 'Savere shaved the hair on his head.'),
   ('ex', 'Savere oui rutu siirevoi uva gareavi kavure kokovua kukue-ia.'),
   ('xp',
    'Savere em i katim olgeta gras ne em i lusim liklik ap garas long het bilong em.'),
   ('xe', 'Savere cut all of his hair and a little ??? on his head.')]),
 ('kokovua',
  [('rt', 'kokovu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('dx', 'Central'),
   ('ge', 'head-shaving'),
   ('tkp', 'kat antap long gras bilong het'),
   ('dt', '23/Aug/2005'),
   ('ex', 'Oirato orui-ia kokovua purarevoi orasiioro.'),
   ('xp', 'Man em i wokim kat antap long gras bilong het bilong em.'),
   ('xe', '???')]),
 ('kokovupaparie',
  [('rt', 'kokovupa'),
   ('ps', '???'),
   ('ge', 'bamboo razor blade'),
   ('tkp', 'hap mambu bilong sepim resa'),
   ('dt', '03/Nov/2005'),
   ('ex',
    'Voari tuariri uva oirara oisioa givgivu kouro kokovuive aue-ia kokovupaparie.'),
   ('xp',
    'Long taim bipo ol man i save sevim maus garas bilong ol long ol liklik hap mambu.'),
   ('xe', '???')]),
 ('kokovurito',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'inside'),
   ('tkp', 'insait bilong kaikai'),
   ('eng', 'inside of anything edible'),
   ('cmt', 'What\'s going on with "orioripaiavaiava"?'),
   ('dt', '03/Nov/2005'),
   ('ex',
    'Seva oisio ruipapaoi ra orioripaiavaiava upoa kokovurito aioeve uvare riro gorupai rakaria.'),
   ('xp',
    'Seva em i laik olsem bai em i kaikaim insait bilong taro ol i skirapim bikos skin i strong.'),
   ('xe', 'Seva wants ??? because the skin is really strong.')]),
 ('koku',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'break off at base'),
   ('ge', 'snap off at base'),
   ('tkp', 'brukim skru bilong het'),
   ('arg', 'O'),
   ('vx', '2'),
   ('nt',
    'Refers to breaking off anything standing, and doing it without a knife'),
   ('dt', '14/Nov/2005'),
   ('ex', 'Pita sipoiua tuu kokureva kovoa-ia.'),
   ('xp', '???'),
   ('xe',
    'Peter broke off the standing sugar cane at the base in the garden.'),
   ('ex', 'Itova kokuoviroi.'),
   ('xp', 'Banana i bruk namel.'),
   ('xe', 'The banana is broken at the base.')]),
 ('kokuoku',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'plant'),
   ('tkp', 'plant olsem wail taro'),
   ('nt',
    'like wild taro but with edible young leaves. It has a distinct [MISSING???]'),
   ('sf', 'FLORA'),
   ('dt', '13/Jan/2006'),
   ('ex',
    'Oiso toupaiveira osa aue uriko uva vegoa rutu toupaiveira kouoko oiso toupa osa aue arua.'),
   ('xp', 'Em i save i stap long bus no oisave kaikaim olsem kumu tasol.'),
   ('xe', '???'),
   ('ex', 'Kokuokua arua oa ukoviro vatuaro-ia toupaiveira.'),
   ('xp', 'Wanpela kain kumu taro i sae istap arere long wara.'),
   ('xe', '???'),
   ('ex',
    'Kokuokua vo siarao iava urikova iava airepa guru vaaro aiopaiveira.'),
   ('xp',
    'Kokuoku i kam long ol lain bilong wail taro. Ol i save kaikaim yangpela lip bilong em.'),
   ('xe', '???')]),
 ('kokureko',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'umbrella'),
   ('ge', 'taro leaf'),
   ('tkp', 'ambrela'),
   ('tkp', 'lip bilong bikpela taro'),
   ('sf', 'FLORA'),
   ('dt', '13/Jan/2006'),
   ('ex',
    'Kokurekoara rirovira toupaiveira vegoaro, oara-ia oraraipipaveira kokeva-ia.'),
   ('xp',
    'Em i wanpela bikpela wail taro, na ol i yusim olsem amberela long taim bilong ren.'),
   ('xe',
    'Taro leaves are big in the jungloe, and you can cover yourself with it in the rain.'),
   ('ex',
    'Koevare vataupas or ora rakurakupas. Tuaripairava airaio??umballa. Koureko veagoaro toupai rirovi. Koureko oiso toupai osa umballa. Ora vigei vovokioiavo.'),
   ('xp',
    'Em i stap long bik bush of isave usim tam rain?? I podaw bai yum karamp long em em umbrell blong yum no ol man blong before.'),
   ('xe', '???')]),
 ('kokuuto',
  [('rt', 'kokuu'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'cheeky person'),
   ('tkp', '???'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Pita, vii riro kokuuto kuvuto.'),
   ('xp', 'Pita, yu man bilong giaman.'),
   ('xe', "Peter, you're a big ???."),
   ('ex', 'Rereki-ia riro kokuuto.'),
   ('xp', 'Rereki em i man bilong giaman.'),
   ('xe', 'Rereki is a big ???.')]),
 ('kooe',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'swing'),
   ('eng', 'swing on something'),
   ('tkp', '???'),
   ('vx', '1'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Kakaevure kooepaai.'),
   ('xp', '???'),
   ('xe', 'The children are swinging on something.'),
   ('ex', 'Peraisi kooeparoi garoava-ia.'),
   ('xp', 'Peraisi em i sisoo long kanda.'),
   ('xe', 'Peraisi is swinging on a vine.')]),
 ('kookaa',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'grasshopper'),
   ('tkp', '???'),
   ('sf', 'FAUNA'),
   ('nt',
    'Has small brown body. Alternate name of siikoto. Also referred to as the female of the siikoto'),
   ('dt', '29/Jul/2005'),
   ('ex', 'Koka sikoto ira aiopaiveira avuka riako.'),
   ('xp', 'Ol lapun meri ol i save kaikai dispela kain grasopa.'),
   ('xe', '???'),
   ('ex', 'Kooka sikoto votoupa veriva vegoaro aropato koora.'),
   ('xp', 'Grasshopper i save i stap long bush ol i save kaikaim.'),
   ('xe', '???'),
   ('ex', 'Kookaa sikoto votoupareveira egoaro aiopato.'),
   ('xp', 'Lokas i save i stap long bus na ol i save kaikai.'),
   ('xe', '???')]),
 ('kookai',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'rooster'),
   ('tkp', '???'),
   ('dt', '13/Feb/2005'),
   ('ex',
    'Kookai ira gaupare avi kekepaoro ora kookai aiopai sispea meat visei varo.'),
   ('xp',
    'Rooster em i save lukim tonight na yumi save kaikam roosten em meat.'),
   ('xe', '???')]),
 ('kookoo',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'doll'),
   ('ge', 'toy'),
   ('ge', 'infant'),
   ('tkp', '???'),
   ('nt', 'baby talk'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Kookoo u kakaeto aakovo kakaeto kookoo piap reva rovopiepaoro.'),
   ('xp', 'Mamo em givem susu long lilik baby.'),
   ('xe', '???'),
   ('ex', 'Kookoo kakaevure voreoaro, ora vosia akova kakaeto roropaeve.'),
   ('xp',
    'Tokto bilong ol pikinini, na tu sapos mama i givim susu long liklik pikinini.'),
   ('xe', '???')]),
 ('kookooia',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'mourn'),
   ('ge', 'singsing-cry'),
   ('tkp', '???'),
   ('vx', '1'),
   ('sc', 'EMISSION.SOUND'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Kookooiaaepa koovapaoro.'),
   ('xp', '???'),
   ('xe', 'They sang mournfully.'),
   ('ex', 'Oirara ora riakora kookooiapai gau kova roia kopiitoa-pa oirato.'),
   ('xp', 'Ol manmeri ol i singsing krai long man i dai.'),
   ('xe', 'Men and women ???.')]),
 ('kookoopeko',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'faint'),
   ('tkp', '???'),
   ('sc', '???'),
   ('vx', '1'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Kookoopekoroi.'),
   ('xp', '???'),
   ('xe', 'He fainted.'),
   ('ex', 'Raki kookoopekoparoi oisio ra kopiiro.'),
   ('xp', 'Raki i wok long hap dai.'),
   ('xe', 'Raki is faint and about to die.')]),
 ('kookoopi',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'tail'),
   ('tkp', '???'),
   ('nt', 'of e.g. lizard, dog, pig'),
   ('dt', '02/Dec/2004'),
   ('ex',
    'Kookoopi kaakau kookoopiaro kookoopi va toupare vo ira voresuraia vokapare aue kokoti.'),
   ('xp',
    'Tail blong dog wanem?? Kain animol igat fourpela legs em igat tail.'),
   ('xe', '???')]),
 ('kooku',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'lie'),
   ('tkp', 'giaman'),
   ('vx', '1'),
   ('arg', '???'),
   ('cmt', 'Is the person lied to really marked by iava?'),
   ('cm', 'iava'),
   ('dt', '19/Mar/2006'),
   ('ex', 'Koiku kookupaue, vii kuvupauei.'),
   ('xp', 'Yu giaman, yu no tok tru.'),
   ('xe', "You're lying. You're not telling the truth."),
   ('ex', 'Meri, kookupauei ragai iava.'),
   ('xp', 'Meri, yu giaman long mi.'),
   ('xe', "Mary, you're lying to me.")]),
 ('kookuto',
  [('rt', 'kooku'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'liar'),
   ('tkp', 'giaman'),
   ('avm', 'pa_barred'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Pita, viia kookuto.'),
   ('xp', 'Pita, yu man bilong giaman.'),
   ('xe', "Peter, you're a liar.")]),
 ('kookuvira',
  [('rt', 'kooku'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'lying'),
   ('tkp', 'olsem giaman'),
   ('dt', '13/Feb/2005'),
   ('ex',
    'Sera iava osireitorei vuri Sera uva kookuvira raga vurapaevo osa oirato varaoa kavirurevo.'),
   ('xp',
    'Tupela ai bilong Sera i bagarap na em i lukluk giaman tasol taim man i stilim laplap.'),
   ('xe', "Sera's eyes ??? and ???.")]),
 ('koopi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'coffee'),
   ('tkp', 'kopi'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Koopi paupaiveira turueirara oira-ia moni oupasia.'),
   ('xp', 'Ol retskin ol i save palanim kopii bilong kisim moni.'),
   ('xe',
    'The redskins (people from mainland PNG) plant coffee in order to make money from it.')]),
 ('koopipi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'red-leafed plant'),
   ('tkp', 'retpela purpur'),
   ('nt', 'grows low to ground'),
   ('sf', 'FLORA'),
   ('dt', '15/Feb/2005'),
   ('ex', 'Koopipi vo toupai vegaaro koopipi.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Revasipavira guru vapaivera ora vatapo vararo.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Koopipi kouro votoupaiveira vegoaro.'),
   ('xp', 'Em i wanpela kain plant i save istap long bus.'),
   ('xe', '???')]),
 ('koora',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'opossum'),
   ('tkp', 'kapul'),
   ('nt', 'generic term'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Koorato siopea oaive rotokasipairara.'),
   ('xp', 'Kapul em i mit bilong ol Rotokasi.'),
   ('xe', '???')]),
 ('kooroo',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'have hampered speech'),
   ('ge', 'be hoarse'),
   ('tkp', '???'),
   ('vx', '1'),
   ('cmt', 'Check vowel length'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Kooroopape.'),
   ('xp', '???'),
   ('xe',
    'It is hoarse (said, for example, when a radio speaker works poorly).'),
   ('ex', 'Koorooiraorai koovapaoro.'),
   ('xp', 'Nek bilong mi bagarap taim mi singsing.'),
   ('xe', 'I was really hoarse while singing.')]),
 ('koorooto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('sn', '1'),
   ('ge', 'type of lizard'),
   ('tkp', '???'),
   ('nt',
    "Makes a sound like a hoarse throat. Hold it in your hand and you will also become hoarse: sirie `lizard'."),
   ('sf', 'FAUNA'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Koorooto ira vegoaro toupareveira tatagara.'),
   ('xp',
    'Em i wanpela liklik palai i save istap aninit long ol sting diwai.'),
   ('xe', '???')]),
 ('koorooto',
  [('rt', 'kooroo'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('sn', '2'),
   ('ge', 'speechless man because of handicap'),
   ('tkp', 'man i gat nek i pas'),
   ('cmt', 'Check vowel length'),
   ('dt', '01/Mar/2005'),
   ('ex', 'Ivatoa korooto ira viapau vearovira reoreoparo.'),
   ('xp', 'Ivato em i nek nogut em i no save toktok gut.'),
   ('xe', "The mute Ivato can't talk right."),
   ('ex', 'Viia korooto koroovira reopauei.'),
   ('xp', '???'),
   ('xe', '???')]),
 ('kooroovira',
  [('rt', 'kooroo'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'hoarsely'),
   ('ge', '???'),
   ('tkp', 'olsem ???'),
   ('dt', '01/Mar/2005'),
   ('ex', 'Ruke koroovira reoparoveira.'),
   ('xp', 'Luk i no save toktok klia.'),
   ('xe', 'Luke always speaks with an impediment.'),
   ('ex', 'Koroovira toupaavoi viapau uvuipara reopasia roroiovira.'),
   ('xp', 'Mi no i nap toktok gut bikos nek blong i pas.'),
   ('xe', "I can't speak well because my neck is ???.")]),
 ('kooru',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'unripe.coconut'),
   ('eng', 'unripe coconut'),
   ('tkp', 'kulau'),
   ('nt', 'Full form is opita kooru.'),
   ('dt', '01/Oct/2006'),
   ('ex', 'Kooru ukaio ruipaparai.'),
   ('xp', 'Mi laik dring kulau.'),
   ('xe', 'I want to drink unripe coconut.')]),
 ('koota',
  [('ps', 'CLASS'),
   ('ge', 'rope-like'),
   ('tkp', '???'),
   ('eng', 'group of rope-like objects'),
   ('dt', '04/Sep/2005'),
   ('ex', 'Riro koota rutu aue iro evaova-ia.'),
   ('xp', 'Plenti rop tru long diwai.'),
   ('xe', '???'),
   ('ex', 'Iro kootaro kekeavo evaova-ia.'),
   ('xp', 'Mi lukim pinis plenti rop i stap long diwai.'),
   ('xe', '???')]),
 ('kootopa',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'corn'),
   ('tkp', 'mais'),
   ('dt', '01/Mar/2005'),
   ('ex', 'Kootopa kukara ovaisiuvu kukara.'),
   ('xp', 'Rotokas igat twopela long corn .'),
   ('xe', '???'),
   ('ex', 'Kootopa aioparai oara ouavo eisi uua.'),
   ('xp', 'Mi kaikai kon. Mi kisim long bung.'),
   ('xe', 'I am eating corn that I go at the meeting.')]),
 ('kootutu',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'stocky fat woman'),
   ('tkp', '???'),
   ('dt', '28/Aug/2005'),
   ('ex', 'Kootutu oo riako iri kovaevira riropao/toupaeve.'),
   ('xp', 'Dispela meri em I fatpela meri.'),
   ('xe', '???'),
   ('ex', 'Ivia kootutu, viapau uvuipaoi ikauvira vokasia.'),
   ('xp', 'Ivi em i patpela meri, na em i no inap wokabaut kuik.'),
   ('xe', "Ivi is fat, she can't walk quickly.")]),
 ('koou',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'incest between parent and child or siblings'),
   ('tkp', 'slip wantaim bisnis'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Kuvapea koou ira papari-va oureva.'),
   ('xp', 'Kuvape i bin kisim kandere bilong en o wanpisin bilong en.'),
   ('xe', 'Kuvape is a committer of incest who married a ???.')]),
 ('kooupato',
  [('rt', 'koou'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'incestuous person'),
   ('tkp', '???'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Kooupato oirato ira siara riakova ootoparoveira.'),
   ('xp', 'Man i save pamukim ol wanpisin meri.'),
   ('xe',
    'An incestuous person is one who has sex with woman of the same clan.')]),
 ('koova',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'sing'),
   ('tkp', 'singsing'),
   ('sc', '???'),
   ('nt', 'involves dancing while singing'),
   ('vx', '1'),
   ('dt', '08/Sep/2006'),
   ('ex', 'Voea rutu koova rovopaavere pupipaoro.'),
   ('xp', '???'),
   ('xe', 'All of them will start dancing, blowing the pipes and singsing.'),
   ('ex', 'Uriora vata koovapai pupipaoro.'),
   ('xp', 'Ol uriora ol i singsing long kaul.'),
   ('xe', 'The people of Uriora are having a singsing blowing pipes.')]),
 ('koova',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'song'),
   ('tkp', 'singsing'),
   ('vx', '1'),
   ('dt', '28/Sep/2006'),
   ('ex', 'Voea rutu koova rovopaavere pupipaoro.'),
   ('xp', '???'),
   ('xe', 'All of them will start dancing, blowing the pipes and singsing.'),
   ('ex', 'Uriora vata koovapai pupipaoro.'),
   ('xp', 'Ol uriora ol i singsing long kaul.'),
   ('xe', 'The people of Uriora are having a singsing.')]),
 ('koovoto',
  [('rt', 'koovo'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'cheeky person'),
   ('ge', 'joker'),
   ('tkp', 'siki man'),
   ('tkp', 'giaman'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Reviria riro koovoto vo riako-re varo verapaoro.'),
   ('xp',
    'Reviri i man bilong pilai na em i save rausim laplap long ol meri.'),
   ('xe', "Reviri is a big joker who removes the women's clothes."),
   ('ex',
    'Ro oirato ri koovoto viapau osio oiropa reorovaia respa rovere. Oira oiraparoia reoparovere.'),
   ('xp', 'Man i tok pilai tasol sampela taim bai mekim ol trupela stori.'),
   ('xe', '???')]),
 ('koovotova',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'posts used to hold up rafters'),
   ('tkp', '???'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Koovotova iri kepaa pitupaeva gorupiepaoro.'),
   ('xp', 'Post em save holim house long stongim.'),
   ('xe', '???'),
   ('ex', 'Koovotova iria ivaraia kaaroara tovotovopaiveira kepa-ia.'),
   ('xp', 'Longpela diwa i ol i save putim antap long pos na putim ol rapta.'),
   ('xe', '???')]),
 ('kopa',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'swallow'),
   ('eng', 'swallow'),
   ('eng', 'gulp down'),
   ('eng', 'ingest'),
   ('tkp', 'daunim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '28/Sep/2006'),
   ('ex', 'Kakau aioa kopaevoi ikauvira rutu.'),
   ('xp', 'Dok i daunim kwik hap kaukau.'),
   ('xe', 'The dog quickly swallowed the food.'),
   ('ex', 'Kopa aioa kopa va aiovo.'),
   ('xp', 'Yu kaikai na dawnim.'),
   ('xe', '???')]),
 ('kopakai',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'sugar cane'),
   ('tkp', '???'),
   ('sf', 'FLORA'),
   ('nt',
    'Striped with brown inside. It was formerly taboo for children to eat it since it stunted their ???'),
   ('dt', '14/Feb/2005'),
   ('ex',
    'Sipoiva kopakai, iria-ia oisioa kakaevure-re korukorupave voari tuariri.'),
   ('xp',
    'Suka ken i baraun insait, na ol man bilong bipo i save tambuim ol pikinini.'),
   ('xe', '???')]),
 ('kopakava',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'baby'),
   ('ge', 'black female'),
   ('tkp', '???'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Riakova kopakave rupa varava.'),
   ('xp', 'Black meri.'),
   ('xe', '???'),
   ('ex', 'Riakova kopakava eisi-va oira Buin.'),
   ('xp', 'Buin meri i skin bilak.'),
   ('xe', '???')]),
 ('kopakopa',
  [('rt', 'kopa'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'gulp.down'),
   ('tkp', '???'),
   ('eng', 'swallow quickly'),
   ('eng', 'qulp down'),
   ('am', 'false'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '30/Aug/2005'),
   ('ex', 'Aioara kopakopapari.'),
   ('xp', '???'),
   ('xe', 'You are gulping down the food.'),
   ('ex', 'Kaakau iria kaukau kopakopapaoi ikaupavira rutu.'),
   ('xp', 'Dok i wok long daunim kwik tru ol kaukau.'),
   ('xe', 'The dog is quickly devouring the sweet potatos.')]),
 ('kopakovira',
  [('alt', 'kopakopavira'),
   ('rt', 'kopakopa'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'gulping-like'),
   ('tkp', '???'),
   ('dt', '30/Aug/2005'),
   ('ex', 'Kopakopavira aiopare.'),
   ('xp', '???'),
   ('xe', 'He is eating quickly gulping down his food.'),
   ('ex', 'Kopakopavira aiopaevoi koie.'),
   ('xp', 'Pik i wok long kaikai hariap tru.'),
   ('xe', 'The pig is eating ???.')]),
 ('kopato',
  [('rt', 'kopa'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'big eater'),
   ('ge', 'voracious eater'),
   ('tkp', '???'),
   ('cmt', 'Make sure that form kopato, kopava, kopairara all exist.'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Riro aio kopava koie.'),
   ('xp', 'Pik i save kaikai tumas.'),
   ('xe', 'Pigs are big eaters.'),
   ('ex', 'Riro aio kopava koie kovoara-ia.'),
   ('xp', '???'),
   ('xe', 'The pig is a big eater in the gardens (destructive pest).')]),
 ('kopii',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'die'),
   ('eng', 'die'),
   ('eng', 'very ill'),
   ('tkp', 'dai'),
   ('vx', '1'),
   ('cmt', 'Check aspectual difference, dying/very ill vs. dead.'),
   ('dcsv', 'true'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Viapau oiso taraipasi oiso osa kopiipavioveira.'),
   ('xp', '???'),
   ('xe', "They both didn't know how we always die."),
   ('ex', 'Kopi apesia kopipauei eke?'),
   ('xp', '???'),
   ('xe', '???')]),
 ('kopiia',
  [('rt', 'kopii'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'death'),
   ('ge', 'sickness'),
   ('tkp', '???'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Riroa kopiia toupaivoi vovokio-ia.'),
   ('xp', 'Tudei i gat bikpela sik istap.'),
   ('xe', 'Serious illness exists today.'),
   ('ex', 'Kopia kopia-ia viei vouokio.'),
   ('xp', 'Bai dai nam tassol.'),
   ('xe', '???'),
   ('ex', 'Kopiia viepao.'),
   ('xp', 'Yo istap long dai.'),
   ('xe', '???')]),
 ('kopiipato',
  [('rt', 'kopii'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'sick person'),
   ('tkp', 'man i gat sik'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Riroirara kopiipairara toupaivoi eisi ruvarupa kepa-ia.'),
   ('xp', 'Plenti sik manmeri ol i stap long haus marasin.'),
   ('xe', 'Many sick people are in the medical station.')]),
 ('kopiipie',
  [('rt', 'kopii'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'kill'),
   ('eng', 'murder'),
   ('eng', 'kill'),
   ('tkp', 'mekim dai'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cmt',
    "Last example is a bit weird. If they killed him, he's dead, no? How could he be weak?"),
   ('dt', '18/Nov/2005'),
   ('ex', 'Tomas Jon kopiipierevo rera upooro.'),
   ('xp', 'Tomas i kilim dai Jon, taim em i paitim em.'),
   ('xe', 'Tomas killed John by hitting him.'),
   ('ex', 'Kopipie oirato upari uva kopiovoira.'),
   ('xp', 'Yo kilim man man i dai olgeta.'),
   ('xe', '???'),
   ('ex', 'Rera kopiipieiva. Oire raveraveroepa.'),
   ('xp', '???'),
   ('xe', 'They were killing him. And he became very weak.')]),
 ('kopiito',
  [('rt', 'kopii'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'dead person'),
   ('ge', 'corpse'),
   ('ge', 'seriously sick man'),
   ('ge', 'spirit of dead man'),
   ('tkp', '???'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Kopiito roia oirato kopiito urauraroia.'),
   ('xp', 'Dead man/spirit of dead man.'),
   ('xe', '???'),
   ('ex', 'Kopiitoava avai rera rovasia eisi tova urui.'),
   ('xp', 'Diaman ol i go palanim long matmat.'),
   ('xe', '???')]),
 ('kopikao',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bird'),
   ('tkp', 'pisin'),
   ('eng', 'Yellow Bittern (Ixobryuchus sinensis)'),
   ('sa', 'vaakira'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Kopikao riro kaekae kokotova.'),
   ('xp', 'Em pisin i gat longpela lek.'),
   ('xe', 'The Yellow Bittern has long legs.'),
   ('ex', 'Vokiaro gaupaevaira koikas kokiova.'),
   ('xp', 'Long night i save singautim narapela pisin.'),
   ('xe', '???')]),
 ('kopikopi',
  [('rt', 'kopi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'baptize'),
   ('ge', 'sprinkle'),
   ('tkp', 'i gat planti mak'),
   ('nt', 'Used mainly in the United Church.'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Minista kakaevure kopikopiparevoi toupa vokia.'),
   ('xp', 'Long sande minista bai baptaisim ol pikinini.'),
   ('xe', 'The minister will baptize the children on sunday.'),
   ('ex', 'Oirara kopikopipaivora pastairara vo lotua iare voea.'),
   ('xp', '???'),
   ('xe', 'The pastors baptized the people, marking them into the church.')]),
 ('kopikopiara',
  [('rt', 'kopikopi'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('rdp', 'full'),
   ('ge', 'spotted'),
   ('ge', 'dots'),
   ('ge', 'blotches'),
   ('tkp', 'ol makmak'),
   ('dt', '07/Sep/2005'),
   ('ex', 'Varoa-ia kopikopiara toupaivoi.'),
   ('xp', 'Laplap i gat ol makmak.'),
   ('xe', 'There are stains on the clothing.')]),
 ('kopipi',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of plant'),
   ('tkp', 'raunpela haus'),
   ('nt', 'Has read leaves and is used as a tall border plant.'),
   ('sf', 'FLORA'),
   ('dt', '02/Dec/2004')]),
 ('kopirovu',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'rope'),
   ('ge', 'hawswer'),
   ('tkp', 'paklain'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Virera kopirovu aue tukepato.'),
   ('xp', 'Em rop blong yu blong pasim samting.'),
   ('xe', '???'),
   ('ex', 'Reka kopirovu-ia kotokotoara tukerevoi.'),
   ('xp', 'Reka i pasim ol kago long rop.'),
   ('xe', 'Reka tied up the cargo with rope.')]),
 ('kopu',
  [('ps', '???'),
   ('dx', 'Central'),
   ('ge', '???'),
   ('tkp', 'i no tan'),
   ('eng', '???'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Sira kopu pitoka veraevoi upiriko pitoka ovusia viapau oripiro.'),
   ('xp', 'Sira em i rausim sospen kaukau taim em i no tan.'),
   ('xe', '???')]),
 ('kopua',
  [('ps', '???'),
   ('ge', 'green'),
   ('ge', 'unripe'),
   ('ge', 'uncooked'),
   ('tkp', 'i no mau'),
   ('cmt', "Example doesn't look like a verb."),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kopua raga eva koie kuvu.'),
   ('xp', 'Mambu pik i no tan gut.'),
   ('xe', '???')]),
 ('kopuasi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('vx', '1'),
   ('dx', 'Central'),
   ('ge', 'restored'),
   ('eng', 'restored'),
   ('eng', 'rejuvenated'),
   ('tkp', 'pilim gut gen'),
   ('dt', '24/Aug/2005'),
   ('ex', 'Ragai kopuasirai uvare sisiurai.'),
   ('xp', 'Mi pilim gut gen bikos mi waswas.'),
   ('xe', '???')]),
 ('kopuasipie',
  [('rt', 'kopuasi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'restore'),
   ('ge', 'rejuvenate'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Leo kepa kopuasipierevo va kipuoro, uva vearo kekepai.'),
   ('xp', 'Leo i penim haus na em i luk nais tru.'),
   ('xe', 'Leo restored his house by painting it, and it looks good.'),
   ('ex', 'Va kopuasi pierivo va kipuoro vorerivira.'),
   ('xp', '???'),
   ('xe', 'You restored it while painting it again.')]),
 ('kopuasito',
  [('rt', 'kopuasi'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'young'),
   ('tkp', 'yangpela'),
   ('cmt',
    'Double-check translation. It says young but seems to get translated as smart.'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Kopuasito viapau upiavai revaia riro goruto.'),
   ('xp', 'I no gat sik em stong pela man.'),
   ('xe', '???'),
   ('ex', 'Kopuasito ro kakaeto, viapau upiavai rera-ia.'),
   ('xp', 'Smatpela boi tru nogat sik long em.'),
   ('xe', '???'),
   ('ex', 'Vii kopuasitoa rutu viovokoto.'),
   ('xp', 'Yu smatpela yangpela boi.'),
   ('xe', "You're a smart boy.")]),
 ('kopuasitovira',
  [('rt', 'kopuasi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'youthful'),
   ('tkp', 'olsem yangpela'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Latu kopuasivira raga touparevoi, uva viapau ikauvira uusiparoi.'),
   ('xp', 'Latu i skin kirap yet na em i no slip kwik.'),
   ('xe', "Latu is ???, and he can't fall asleep quickly."),
   ('ex', 'Vii kopuasitovira touparivo viovokovira.'),
   ('xp', 'Yu luk yangpela man.'),
   ('xe', '???')]),
 ('kopuasivira',
  [('rt', 'kopuasi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'young and fresh'),
   ('ge', 'powerful'),
   ('ge', 'tirelessly'),
   ('tkp', '???'),
   ('dt', '19/Feb/2004'),
   ('ex', 'Kopuasivira kovopare.'),
   ('xp', '???'),
   ('xe', 'He works tirelessly.')]),
 ('Kopuisi',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'Kopuisi'),
   ('eng', 'Kopuisi'),
   ('nt', 'proper name'),
   ('dt', '26/Jan/2005')]),
 ('kopukopu',
  [('rt', 'kopu'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('rdp', 'full'),
   ('ge', 'frontside'),
   ('tkp', 'poret bilong haus'),
   ('nt', 'front of anything (e.g., shirt, blouse, body, house)'),
   ('dt', '22/Nov/2005'),
   ('cmt', 'How can this be a verb? Something is confused here.'),
   ('ex', 'Kepa oiso ragavira kopukopupai.'),
   ('xp', '???'),
   ('xe', '???')]),
 ('kopukopua',
  [('rt', 'kopukopu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'front side of house'),
   ('tkp', '???'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kepa iava kopukopua oisio ragavira toupaivoi.'),
   ('xp', 'Pran bilong haus istap olsem.'),
   ('xe', 'The front of the house is just like this.'),
   ('ex', 'Kuvupato tovori ra kopukopua-ia avaka isiro pitupitupieri.'),
   ('xp', 'Yu putim siot, na bai yu pasim ol baton.'),
   ('xe', '???')]),
 ('kopupa',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'unripe'),
   ('ge', 'green yet'),
   ('tkp', '???'),
   ('dt', '01/Jun/2005'),
   ('am', 'false'),
   ('ex', 'Kopupa vao vaviokoa viapau viorio/melon.'),
   ('xp', 'Pawpaw is bao yet water melon i no bao yet.'),
   ('xe', '???'),
   ('ex', 'Kopupa isi raga eva isi aue vavioko.'),
   ('xp', 'Popo i grin yet na i no mau.'),
   ('xe', 'The pawpaw is unripe.')]),
 ('kopupira',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'red clay-like ground'),
   ('tkp', '???'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Rasiva revasi kekevira dupae rasiva rasiaiwa.'),
   ('xp', 'NO DATA.'),
   ('xe', '???')]),
 ('kopuro',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'fly'),
   ('tkp', '???'),
   ('nt', 'Small type that lives in jungle.'),
   ('dt', '20/Feb/2005'),
   ('ex', 'Kopuro korikoripa va vegotoupaeveira oisi kekevavi.'),
   ('xp', 'Tausito em luk olsem fly nn lizard.'),
   ('xe', '???'),
   ('ex', 'Kopuro korikoripava.'),
   ('xp', 'Liklik palai i gat ol makmak.'),
   ('xe', 'The ??? is ???.')]),
 ('Kopuru',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Kopurua ro vurapato oirato.'),
   ('xp', 'Em i masol na strongpela man.'),
   ('xe', 'Kopuru is a strong man.')]),
 ('kopuvioro',
  [('rt', 'kopu'),
   ('ps', 'N'),
   ('pt', '???'),
   ('cl', 'isi'),
   ('ge', 'unripe banana'),
   ('tkp', '???'),
   ('pt', 'itoo'),
   ('dt', '12/Aug/2005'),
   ('ex', 'Kopu isi raga vaisio ito isi.'),
   ('xp', 'Banana i no mau.'),
   ('xe', '???')]),
 ('kopuvira',
  [('alt', 'kopupavira'),
   ('rt', 'kopu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'uncooked'),
   ('ge', 'unripe'),
   ('tkp', '???'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Kopupavira toupaivoi upiriko pitoka.'),
   ('xp', 'Sospen kaukau i no tan.'),
   ('xe', 'The saucepan of sweet potatos is uncooked.'),
   ('ex', 'Viapau vearovira vao opoa oripiro o kopupavira toupaevoi.'),
   ('xp', '???'),
   ('xe', "The taro isn't well cooked, it is still uncooked."),
   ('ex',
    'Kopuvira raga araisi orievo rara igei-pa uva oisi raga oira aioiovo.'),
   ('xp', 'Rara em i kukim rais i no tan gut na mipela kaikaim olsem tasol.'),
   ('xe', '???')]),
 ('kora',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'vent'),
   ('eng', 'vent anger or frustration on an object'),
   ('tkp', '???'),
   ('vx', '1'),
   ('cmt', 'Is there casemarking? Is there a second argument?'),
   ('dt', '26/Jun/2005'),
   ('ex', 'Terita koveroi uva totoroe. Oire koraai oirara, evaova ritapaoro.'),
   ('xp',
    'Terita i pundaun long diwai na liklik taim em i dai na ol man i sutim diwai.'),
   ('xe', 'Terita fell down and ???.'),
   ('ex', 'Oiratoa-ia koraaepa oirara, uvare koveroepa opita atopaoro.'),
   ('xp', '???'),
   ('xe',
    'The people vented their frustration and sorrow for the man because ???.')]),
 ('korapato',
  [('rt', 'kora'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'punishment tree'),
   ('tkp', '???'),
   ('nt', 'tree designated to receive punishment otherwise vented on a ???'),
   ('sa', 'kasiava'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Eva oa-ia korapato purai oisio ra va ritaive oirara.'),
   ('xp', 'Diwai ol i sanapim bai ol man i kam sutim.'),
   ('xe', '???')]),
 ('korara',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'spin top in play'),
   ('tkp', '???'),
   ('sc', '???'),
   ('vx', '1'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Viovokopairara oea korarapai aue-ia atope visikopaoro.'),
   ('xp', 'Ol boi i pilai long het bilong sel kokonas.'),
   ('xe', '???'),
   ('ex', 'Korarapaai viovokopairara atopeara-ia.'),
   ('xp', '???'),
   ('xe', 'The boys are playing a spinning game with coconut shell tops.')]),
 ('koraraoko',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'type of frog'),
   ('tkp', '???'),
   ('nt',
    "Small yellow to brown body, large feet. It was pictured on Papua New Guinea's 20 toea postage stamp."),
   ('dt', '14/Feb/2005'),
   ('ex', 'Koraraoko iria tatagaara reroara toupaeveira.'),
   ('xp', 'Em wanpela kain rokrok i save istap aninit long ol diwai.'),
   ('xe', 'The ??? frog lives underneath ??? trees.')]),
 ('korau',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'clear'),
   ('ge', 'unobstructed'),
   ('eng', 'clear or unobstructed'),
   ('tkp', 'klia'),
   ('vx', '1'),
   ('cmt', 'example needed'),
   ('dt', '28/Apr/2007')]),
 ('koraua',
  [('rt', 'korau'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'clearing'),
   ('tkp', 'hap i klia'),
   ('dt', '26/Jan/2005'),
   ('ex', 'Vavo toreu koraua ra vavoiso vurari avakava.'),
   ('xp', 'Yu sanap long peles kilia na yu lukluk i go long solwava.'),
   ('xe', 'You stand up at a clear spot and you can see the ocean.'),
   ('ex', 'Vavo vurari koraua-ia.'),
   ('xp', 'Yu sanap long klia peles.'),
   ('xe', 'You stand up on a cleared spot.'),
   ('ex', 'Jon koraua-ia vokapare rogaraua-ia, avakava sirova.'),
   ('xp', 'Jon i wokabaut ples klia long wetsan arere long solwara.'),
   ('xe', '???')]),
 ('korauru',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of lizard'),
   ('tkp', '???'),
   ('nt', 'has small brown body and lives in the treetops'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Korauru ira vavo isio toupareveira vegoaro.'),
   ('xp', 'Palai i save istap long bus.'),
   ('xe', 'The ??? lizard lives in the bush.')]),
 ('korauvira',
  [('rt', 'korau'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'unobstructed'),
   ('tkp', 'olsem klia'),
   ('dt', '29/May/2005'),
   ('cmt', 'What is aue doing in the first example?'),
   ('ex', 'Kovauvira vurapi vavoiso uva viaei riapu rukutavai.'),
   ('xp', 'Em luk klia lang nogat kaut.'),
   ('xe', 'It looks clear and ???.'),
   ('ex', 'Toki korauvira rutu kareke.'),
   ('xp', 'Bagana i klia tru.'),
   ('xe', 'Mt. Balbi can be clearly seen.'),
   ('ex', 'Korauvira toupai aue evaoa.'),
   ('xp', 'Diwais em i stap long ples klia.'),
   ('xe', 'The trees are in the clearing.'),
   ('ex', 'Ezra korauvira rutu toreparoi.'),
   ('xp', 'Ezra i sanap long ples klia.'),
   ('xe', 'Ezra is standing up in the clearing.')]),
 ('korea',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'waddle of rooster or hen'),
   ('tkp', '???'),
   ('dt', '29/May/2005'),
   ('ex', 'Korea oa kokai-ia toupaiveira.'),
   ('xp', 'Buk we ol kaikai i save istap long kakaruk.'),
   ('xe', '???')]),
 ('korekare',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'animals'),
   ('tkp', 'ol animal'),
   ('cmt',
    'Does <kore> appear by itself, without the free pluralizer <kare>? This appears to be the generic term for all fauna. Double-check what is applies to.'),
   ('dt', '22/Jan/2007')]),
 ('Korere',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'Korean'),
   ('tkp', 'bilong Korea'),
   ('dt', '02/Dec/2004'),
   ('ex', 'korerepairara'),
   ('xp', 'pipol bilong Korea'),
   ('xe', 'Koreans')]),
 ('korereto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of lizard'),
   ('tkp', '???'),
   ('nt', 'small with ridge down the back, lives in sago palms and coconut'),
   ('sf', 'FAUNA'),
   ('dt', '02/Dec/2004'),
   ('ex',
    'Korereto koriava rera vararoia vo tou pareveira tetevu guruvaro ova vo opita raoroia.'),
   ('xp',
    'Korereto em i got ol makmak long body blong en em i save i stap long han blong sago na coconut lip.'),
   ('xe',
    'The koreto lizard has got a mark on his body and he is usually on sago palms or coconut leaves.'),
   ('ex',
    'Korereto koripato ira vo toupareveira tetevu guruva roia, tavaravira kou koupaoro.'),
   ('xp',
    'Palai i gat makmak na i save istap long ol lip saksak, na i save karim kiau dabol.'),
   ('xe', '???')]),
 ('kori',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'mark'),
   ('eng', 'mark'),
   ('eng', 'weave with design'),
   ('eng', 'write'),
   ('tkp', 'rait makim'),
   ('arg', '???'),
   ('vx', '???'),
   ('cmt',
    "Need more examples, probably can be found in corpus. What's with last example? Can <kori> also mean `cut'?"),
   ('dt', '10/Jan/2007'),
   ('ex', 'Takekoriiva.'),
   ('xp', '???'),
   ('xe', 'They wove the bamboo with a design.'),
   ('ex', 'Avaisisi koripava tururevoi takei kepa-ia.'),
   ('xp', '???'),
   ('xe', 'Avaisisi sews a patterned design on the house walls.'),
   ('ex', 'Koue korivo oira siarearo verasia uva oira orivo.'),
   ('xp',
    'Ol i bin katim pik na ol i rausim bel bilong em na bihain ol i kukim pik.'),
   ('xe', 'They ??? the pig, removing its guts, and they cooked it.')]),
 ('koria',
  [('rt', 'kori'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'writing'),
   ('ge', 'markings'),
   ('ge', 'design'),
   ('tkp', 'makmak raft'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Avaisisi koria purarevoi kepa-ia.'),
   ('xp', 'Avaisisi i wokim ol makmak long haus.'),
   ('xe', 'Avaisisi is putting markings on the house.')]),
 ('Koribori',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'village name'),
   ('tkp', 'ples nem'),
   ('dx', 'Aita'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Uruia vaisiaro vao Korvore eisi Siteamasi-ia ruvaraia.'),
   ('xp', 'Nem bilong ples korobore klostu long siteamasi.'),
   ('xe', 'The village named Koribori is close to Siteamasi.')]),
 ('korikori',
  [('rt', 'kori'),
   ('ps', '???'),
   ('ge', 'marking'),
   ('tkp', 'makmak'),
   ('eng', 'writing'),
   ('eng', 'markings'),
   ('eng', 'design'),
   ('dt', '14/May/2005')]),
 ('korikoripava',
  [('rt', 'korikori'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'snake with marks on its back'),
   ('ge', 'Upe hat with designs'),
   ('tkp', 'snek i gat makmak'),
   ('tkp', 'upe hat'),
   ('cmt', 'Double-check translation'),
   ('dt', '14/Feb/2005'),
   ('ex', 'Upeva korikoripava.'),
   ('xp', 'Upe i gat ol makmak.'),
   ('xe', 'The Upe is patterned.')]),
 ('korita',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'carve'),
   ('tkp', '???'),
   ('ge', 'carve'),
   ('ge', 'dissect'),
   ('ge', 'cut up'),
   ('nt', 'applies to meat'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '03/Jul/2005'),
   ('ex', 'Oire voava rera koritaive rera aiosia.'),
   ('xp', '???'),
   ('xe', 'Then they would cut him up to eat him.')]),
 ('korita',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'cutlet?'),
   ('tkp', '???'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Jonatan koie korita kuvupare aue-ia veeta.'),
   ('xp', 'Jonatan i pulamapim pik ol i katim pinis long ol mambu.'),
   ('xe', 'Jonathan filled the bamboo with pig cutlets.')]),
 ('koriteira',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'container'),
   ('tkp', 'kontaina'),
   ('nt', 'Tok Pisin loan'),
   ('dt', '23/Jul/2004')]),
 ('koro',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'blinded'),
   ('ge', 'go inside the eye'),
   ('tkp', 'sut i go long ai rop'),
   ('cmt',
    'Check semantics. How do you specify what went in the eye? Can you say I was blinded by an insect in my eye.'),
   ('dt', '22/Nov/2005'),
   ('vx', '???'),
   ('cmt', 'Check original -- koroai or kororai?'),
   ('ex', 'Viapau vearoviva vurapaa uvare korotoae ragai.'),
   ('xp', 'Mi no lukluk gut bikos mi gat samting i stap long ei.'),
   ('xe', "I can't see well because I've got something in my eye."),
   ('ex', 'Viapau vearovira vurapaa uvare koroai ragai osireiaro-ia.'),
   ('xp', 'Mi no lukluk gut bikos i gat samting long ai bilong mi.'),
   ('xe', "I can't see well because I'm blind.")]),
 ('kororo',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'bird'),
   ('tkp', 'pisin'),
   ('eng', 'Papuan Frogmouth (Podargus papuensis)'),
   ('dt', '12/Jul/2004')]),
 ('kororo',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('cl', 'isi'),
   ('ge', 'ball'),
   ('ge', 'round'),
   ('ge', 'sphere'),
   ('tkp', 'bol'),
   ('tkp', 'raunpela ston'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Ipura kororoisi rutu-ia avekeisi kokioto ritarevoi porokoa-ia.'),
   ('xp', 'Ipura em i sutim pisin long raunpela ston stret long katapila.'),
   ('xe', '???')]),
 ('kororoisivira',
  [('rt', 'kororoisi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'round'),
   ('ge', 'spherical'),
   ('ge', 'ball-like'),
   ('tkp', 'bol'),
   ('tkp', 'raunpela'),
   ('dt', '07/Feb/2005'),
   ('ex', 'Opo kuio kororoisivira toupaivo.'),
   ('xp', 'Taro em i raunpela.'),
   ('xe', 'Taro is round.')]),
 ('kororovi',
  [('rt', 'kororo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'circle'),
   ('tkp', 'raunpela lain'),
   ('dt', '13/Feb/2005')]),
 ('kororovivira',
  [('rt', 'kororovi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'circular'),
   ('tkp', 'olsem raunpela'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Kororovivira raga opo purapae riakora.'),
   ('xp', 'Ol meri i wok long wokim ol raunpela sikon.'),
   ('xe', 'The women make only scones (literally, round taro/bread).'),
   ('ex', 'Kororovivira opo isiro puraevo Vitera.'),
   ('xp', 'Vitera i wokim ol raunpela sikon.'),
   ('xe', 'Vitera made a scone (literally, round taro/bread) .')]),
 ('kororu',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'endangered'),
   ('ge', 'near death'),
   ('tkp', 'i laik dai'),
   ('dcsv', 'true'),
   ('vx', '???'),
   ('cmt', 'Does this mean close or close to death? Huh?'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Kororu Sera ei kororu avia ra kopia.'),
   ('xp', 'Liklik taim tasol na Sera bai dai.'),
   ('xe', 'Sera will die soon.'),
   ('ex', 'Kororu pita ei kororu avia ra kopiro.'),
   ('xp', 'Liklik taim tasol na Pita bai dai.'),
   ('xe', 'Peter will die soon.'),
   ('ex', 'Pita epao kororua via ra kopiiro.'),
   ('xp', 'Pita i stap long liklik hap tasol na bai em i dai.'),
   ('xe', '???')]),
 ('kororupie',
  [('rt', 'kororu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'stranded by flood'),
   ('tkp', 'wara i pasim em'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Sera kororupievo pirutuva uva viapau kareoe.'),
   ('xp', 'Wara i tait na i pasim Sera long sait na em i no kam.'),
   ('xe', "The flood stranded Sera and she didn't come."),
   ('ex', 'Kororupie pirutuva igei/ragai/visii/vei kororupie.'),
   ('xp', 'Yu mi napa go long peles because taiet em pasim yu.'),
   ('xe', '???'),
   ('ex', 'Voi kareapa eisi wakunai voea kororupie pirutuva.'),
   ('xp', '???'),
   ('xe', '???')]),
 ('koroto',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'meet together'),
   ('tkp', 'bung'),
   ('cm', '-re'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'John, Pita-re korotosia ira karepa eisi-va Wakunai.'),
   ('xp', 'John yu go na bungim Pita em i kam long Wakunai.'),
   ('xe', "John, you go meet Peter, he's coming from Wakunai."),
   ('ex', 'Vii korotopari Sera-re.'),
   ('xp', 'Yu laik bungim Sera.'),
   ('xe', "You're going to meet with Sera."),
   ('ex', 'Ragai avaparai Mak-re korotosia eisi raivaro.'),
   ('xp', 'Mi go bungim Mak long rot.'),
   ('xe', "I'm going to meet Mark on the road.")]),
 ('koroviri',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'braid'),
   ('ge', 'plait'),
   ('ge', 'twist together'),
   ('tkp', 'pas wantaim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('cmt', 'Double-check second sentence.'),
   ('ex', 'Irorio koroviripasivoi.'),
   ('xp', '???'),
   ('xe', 'The two men are braiding rope.'),
   ('ex', 'Mata koroviri purapae kuvera ua purasia.'),
   ('xp', 'Mata i pasim tupela rop bilong wokim ubian.'),
   ('xe', 'Mata ???.')]),
 ('korovo',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'oil'),
   ('ge', 'lubricate'),
   ('tkp', 'welim'),
   ('cmt', "What's the gender?"),
   ('dt', '07/Sep/2006'),
   ('ex', 'Korovo opita korovo isipava varaua isipasia.'),
   ('xp', 'Oil blong kokonas blong napim long body.'),
   ('xe', 'Coconut oil ???.'),
   ('ex', 'Korovo varaua isipava aue iava opita.'),
   ('xp', 'Oil bilong rapim long skin, na ol i ave wokim long kokonas.'),
   ('xe', '???')]),
 ('koru',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make return'),
   ('tkp', 'bekim'),
   ('dcsv', 'true'),
   ('cmt', 'Not clear on meaning--double check vowel length'),
   ('dt', '19/Mar/2006'),
   ('ex',
    'Jon-re korusia opeita eva oupare/pitare korusa opeita avaparo skulsa.'),
   ('xp', 'Tokim John no kan go/tokim pita em/noken go long skul.'),
   ('xe', '???'),
   ('ex',
    'Kokusia kokoto rekuo kokusa uriou vorio avivikepaoro kokusia uriou.'),
   ('xp', 'Bukim sikul na yu bowim head blong yu.'),
   ('xe', '???'),
   ('ex', 'Sipito igei-re koru kovoarapa.'),
   ('xp', 'Chief i tokim mipela long noken go wok.'),
   ('xe', '???'),
   ('ex', 'Fidelis korupare Ata teapi avaro eisi-re uukova uvare pirutuoe.'),
   ('xp', 'Fidelis Aata i stopim nogut em i go long wara i tait.'),
   ('xe',
    'Aata stopped Fidelis from going to the water because it was high tide.')]),
 ('Koruko',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'village name'),
   ('tkp', 'ples nem'),
   ('sa', 'Sirioripaia'),
   ('dx', 'Pipipaia'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Oropeara eisi-va urioroi Koruko.'),
   ('xp', 'Oropeara kam long Koruko.'),
   ('xe', 'Oropeara comes from Koruko.')]),
 ('korukoru',
  [('rt', 'koru'),
   ('rdp', 'full'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'block'),
   ('ge', 'obstruct'),
   ('eng', 'block'),
   ('eng', 'obstruct'),
   ('eng', 'hinder'),
   ('eng', 'deter'),
   ('tkp', '???'),
   ('vx', '2'),
   ('arg', 'OBL'),
   ('cm', '-re'),
   ('dt', '17/Jul/2005'),
   ('ex', 'Uva ragai-re korukoruragaparoepa.'),
   ('xp', '???'),
   ('xe', 'He tried to hinder me, but in vain.'),
   ('ex', 'Jemis-re korukoruparoe aiteto, ovusia uturoe.'),
   ('xp', 'Papa i rausim Jemis, tasol em i go bihain.'),
   ('xe', 'Dad blocked Jemis but he followed.')]),
 ('korukorupato',
  [('rt', 'korukoru'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'interloper'),
   ('eng', 'interloper'),
   ('eng', 'intermediary'),
   ('tkp', 'man bilong stopim'),
   ('avm', 'pa_required'),
   ('dt', '02/Sep/2005')]),
 ('koruo',
  [('ps', '???'),
   ('ge', 'young'),
   ('tkp', 'yangpela'),
   ('cmt', 'No examples'),
   ('dt', '01/Jun/2005')]),
 ('koruoto',
  [('rt', 'koruo'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'young person'),
   ('tkp', 'yangpela man'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Koueto koruoto.'),
   ('xp', 'Yangpela pig.'),
   ('xe', '???'),
   ('ex', 'Viia koruoto viovokoto.'),
   ('xp', 'Yu yangpela man.'),
   ('xe', "You're a young man."),
   ('ex', 'Koruoto roia viovokoto oirato.'),
   ('xp', 'Boi em i yangpela tasol.'),
   ('xe', '???')]),
 ('koruou',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'sacrifice'),
   ('tkp', 'ofa'),
   ('cm', '-pa'),
   ('vx', '???'),
   ('arg', 'OBL'),
   ('cmt', 'Unclear whether the non-sub args are subcategorized...'),
   ('dt', '04/Dec/2006'),
   ('ex', 'Tuariripa irara oisoa tugara kare-pa koruoupaave aue-ia koie.'),
   ('xp', '???'),
   ('xe',
    'People of long ago would always sacrifice to the bush spirits using.'),
   ('ex', 'Tuariripairara oisioa koruoupave aue-ia koie tugara kare-pa.'),
   ('xp',
    'Ol man bilong bipo ol i save givim pik long ol masalai bilong bus.'),
   ('xe', 'The ancestors always sacrified pigs to the spirits.')]),
 ('koruoua',
  [('rt', 'koruou'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'sacrifice'),
   ('tkp', 'ofa'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Koruoua purasia avapaviei vavo tugaratoa-pa.'),
   ('xp', 'Bai yumi go givim kaikai long masalai.'),
   ('xe', "We're going to make a sacrifice to the spirits."),
   ('ex', 'Koruoua purapai vavo pugara.'),
   ('xp', 'Sacrifice long ol bush spirit.'),
   ('xe', '???')]),
 ('koruovira',
  [('rt', 'koruo'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'young'),
   ('tkp', 'yangpela'),
   ('dt', '13/Feb/2005'),
   ('ex', 'Koruovira toupai koie kare.'),
   ('xp', 'Ol pik ol i stap yangpela.'),
   ('xe', 'The pigs are young.')]),
 ('korupie',
  [('rt', 'koru'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'take outside'),
   ('tkp', 'i kam ausait'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Sigoa korupie kepa siovara iava.'),
   ('xp', 'Yu kisim naip kam arasait long haus.'),
   ('xe', 'Take the knife outside from the house.')]),
 ('korupievira',
  [('rt', 'koru'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'whispered-like'),
   ('tkp', '???'),
   ('cmt', "Check vowel length. Gloss and example don't match."),
   ('dt', '13/Feb/2005'),
   ('ex', 'Pitokava korupievira toupae.'),
   ('xp', 'Sospen i stap arasait.'),
   ('xe', 'The saucepan is ???.')]),
 ('kosi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'exit'),
   ('eng', 'go out'),
   ('eng', 'exit'),
   ('eng', 'come out'),
   ('tkp', 'go autsait'),
   ('tkp', 'kamaut'),
   ('dcsv', 'true'),
   ('sc', 'MOTION'),
   ('vx', '1'),
   ('cmt', 'Sentence fragment?'),
   ('dt', '20/Nov/2005'),
   ('ex', 'Kakau kosia vao vii vaaro.'),
   ('xp', 'Hap bek kakau bilong yu.'),
   ('xe', '???')]),
 ('kosikosi',
  [('rt', 'kosi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'cut off sago palm leaves'),
   ('tkp', 'katkatim lip bilong saksak'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '10/Aug/2005'),
   ('ex', 'Sipoia kosikosi.'),
   ('xp', 'Katimkatim suger cane.'),
   ('xe', 'Kat the sugarcane.'),
   ('ex', 'Kokora, tetevu rao kosikosi.'),
   ('xp', 'Kokora, yu kakatim han bilong saksak.'),
   ('xe', 'Kokora, cut the sago branch.'),
   ('ex',
    'Ragai uviara rava opoara kosikosipavoi oisio ra vara kukuearo paua.'),
   ('xp',
    'Mi wok long katikatim ol taro long het bilong ol olsem bai mi pranim ol het bilong taro.'),
   ('xe', '???')]),
 ('kosikosi',
  [('rt', 'kosi'),
   ('rdp', 'full'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'exit'),
   ('tkp', 'go autsait'),
   ('tkp', 'kamaut'),
   ('eng', 'come out'),
   ('eng', 'exit'),
   ('ge', 'gush out'),
   ('tkp', 'kamkamaut strong'),
   ('dcpv', 'true'),
   ('vx', '1'),
   ('sc', 'MOTION'),
   ('cmt', 'Is this a full word or really a stem?'),
   ('dt', '14/Nov/2005'),
   ('ex', 'Kosikosipapiro ukoa vo tage tou.'),
   ('xp', 'Kamkam out long dispela tager wara.'),
   ('xe', '???'),
   ('ex', 'Vukoa kosikosipapiro vavova pukuia.'),
   ('xp', 'Wara em i kamkam out long dispela mauntem.'),
   ('xe', 'Water is gushing out of this mountain.'),
   ('ex', 'Ukovi vavo-va kosikosipapiroveira pukuia vituaro.'),
   ('xp', 'Wara i save kam aut long as bilong dispela maunten.'),
   ('xe', 'The river gushes out from the base of the mountain.'),
   ('ex', 'Kakae vure kosikosipaviroi kepa sovara iava.'),
   ('xp', 'Ol pikinini i kam autsait long haus na i go.'),
   ('xe', 'The children have come outside of the house.')]),
 ('kosipa',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', '???'),
   ('tkp', 'katim het bilong taro'),
   ('vx', '2'),
   ('arg', 'O'),
   ('cmt',
    'The meaning of "earn money/painim moni" is not recognized by informants. Can you have kosipapaavoi?'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Varao rutu kosipaa vo raga.'),
   ('xp', 'Bai mi hipim olgeta samting long hia.'),
   ('xe', '???'),
   ('ex', 'Ragai opo kosiparai opo kosipa-ia.'),
   ('xp',
    'Mi katim het bilong taro long samtin gbilong katim het bilong taro.'),
   ('xe', '???')]),
 ('kosipato',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', '???'),
   ('tkp', '???'),
   ('dt', '23/Nov/2004'),
   ('ex', 'Opo kosipatoa akavi avakava iava rera akoroto.'),
   ('xp', 'Sel bilong sol wara em bilong katim taro.'),
   ('xe', '???')]),
 ('kosipie',
  [('rt', 'kosi'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make exit'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Oire avipe ra voava voea kosipieive ita aia voeava pupisia.'),
   ('xp', '???'),
   ('xe',
    'Then when it was light they would bring them out again to singsing with.'),
   ('ex', 'Aviipai ra voea kosipiepe voea-va pupisia.'),
   ('xp',
    'Taim i tulait bai yumi bringim ol i kam arasait, bai yumi singsing wantaim ol.'),
   ('xe', "When it's ??? we'll bring you ??? in order to sing with them.")]),
 ('kosivago',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'decorative shell'),
   ('tkp', 'sel bilong bilas'),
   ('nt', 'large flat type worn on neck as decoration'),
   ('dt', '26/Jan/2005'),
   ('ex',
    'Kosivago asipava tuariripaira oisoa asipave kosivagoi avakavaio oiso kosiva puropaive oira kuripaori asipasa.'),
   ('xp',
    'Shel ol man blong bipo ol save putim long nek blong ol na biras long em ol i save kisim long sol wara.'),
   ('xe',
    'A shell that in the past people put on their neck and which ??? usually find in the ocean.'),
   ('ex', 'Kosivaio pekeri oa oisioa kuripaive va-ia asiipasia.'),
   ('xp', 'Sel ol i save skirapim bilong hangamapim long nek.'),
   ('xe', '???')]),
 ('kosiviro',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'go out'),
   ('ge', 'exit'),
   ('tkp', 'aut'),
   ('vx', '1'),
   ('sc', 'MOTION'),
   ('cmt', 'Is the viro necessary?'),
   ('dt', '14/Nov/2005'),
   ('ex', 'Vokiaro kosivirorae uva ovato kekeavo osa vokaparevo.'),
   ('xp',
    'Long nait mi kam ausait na mi lukim sotpela wailman taim em wokabaut.'),
   ('xe', 'At night I went out and I saw a dwarf man as he walked around.'),
   ('ex', 'Josep kosiviro kepa iava.'),
   ('xp', 'Joseph em i go arasait long haus.'),
   ('xe', '???')]),
 ('koto',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'hang'),
   ('tkp', 'hangamap'),
   ('vx', '1'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Aatu kotivira kotopaoi uusipaoro.'),
   ('xp', '???'),
   ('xe', 'The flying fox hangs in a hanging position while sleeping.'),
   ('ex', 'Aatu iria kotovira uusipaoveira.'),
   ('xp', 'Bilak bokis i sae hangamap na slip.'),
   ('xe', '???')]),
 ('kotokoto',
  [('rt', 'koto'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'cargo'),
   ('ge', 'supplies'),
   ('tkp', 'kago'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Varao voriri kotokotoara.'),
   ('xp', 'Yu buy ol dispela cargo.'),
   ('xe', 'You buy these supplies.'),
   ('ex', 'Rari kotokoto ousia avaroe eisi Buka, aio kitupa kepa iare.'),
   ('xp', 'Rari em i go kisim kago long Buka i kam long sitoa.'),
   ('xe', '???')]),
 ('kotokotoara',
  [('rt', 'kotokoto'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'cargo'),
   ('ge', 'supplies'),
   ('tkp', 'kago'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Varao voriri kotokotoara.'),
   ('xp', 'Yu baiim ol dispela cago.'),
   ('xe', 'You buy these supplies.'),
   ('ex', 'Kotokotoara uriopai uvuoa-ia.'),
   ('xp', 'Ol kago bai i kam long sip.'),
   ('xe', '???')]),
 ('kotopa',
  [('rt', 'koto'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'fishhook'),
   ('ge', 'hook'),
   ('ge', 'hanger'),
   ('tkp', 'huk'),
   ('cmt', 'Sentence fragment?'),
   ('dt', '02/Dec/2004'),
   ('ex', 'Pokokoa eva pitoka kotopa.'),
   ('xp', 'Huk bilong hangamapim sospen.'),
   ('xe', '???')]),
 ('kotopa',
  [('rt', 'koto'),
   ('ps', 'N'),
   ('pt', 'LOC'),
   ('ge', 'hanging from'),
   ('tkp', '???'),
   ('dt', '07/Feb/2005')]),
 ('kotovira',
  [('rt', 'koto'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'hanging'),
   ('tkp', '???'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Atuu kotovira uusipaoi.'),
   ('xp', 'Fly bokis em i slip hangamap.'),
   ('xe', 'The flying fox sleeps hanging.'),
   ('ex', 'Kotovira uusiu.'),
   ('xp', 'Yu slip hangamap.'),
   ('xe', 'You sleep hanging up.'),
   ('ex', 'Kotovira uusipaoi aatu.'),
   ('xp', 'Bilak bokis i hangamap na slip.'),
   ('xe', '???')]),
 ('kotu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'bite'),
   ('tkp', 'kaikai'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Aakova aioa kotupaevoi.'),
   ('xp', 'Mama em i baitim kaikai.'),
   ('xe', 'The mother is biting the food.'),
   ('ex', 'Sera opoa kotupaevoi kakaetoa-pa.'),
   ('xp', 'Sera i wok long kaikai na givim ol liklik hap long pikinini.'),
   ('xe', 'Sera is biting the food for the child.')]),
 ('kotukotu',
  [('rt', 'kotu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'gnash teeth'),
   ('ge', 'grind teeth together'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('rdp', 'full'),
   ('ex', 'Reuriara kuripare va kotukotu para???.'),
   ('xp', 'Em mekim ol tit bilong em.'),
   ('xe', '???'),
   ('ex', 'Rupeiri ora reuriaro kotukotuparevoi.'),
   ('xp', 'Rupeiri i wok long kaikaim ol titi bilong en.'),
   ('xe', '???')]),
 ('kotupiua',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'knee'),
   ('tkp', 'skru bilong lek'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Sara iava kotupiua upiapae.'),
   ('xp', 'Skuru bilong Sara i wok long pen.'),
   ('xe', "Sera's knee hurts.")]),
 ('koturu',
  [('wf', 'koturua'),
   ('rt', 'koturu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'bark'),
   ('tkp', '???'),
   ('eng', 'tree bark'),
   ('dt', '21/May/2005'),
   ('ex', 'Evao koturu v??? oupaiveira kepa kovovopasia.'),
   ('xp', 'Skili blong diwai ol i save kisim blong banisim house.'),
   ('xe', '???'),
   ('ex', 'Koturua siovaraia Kauasi tovareva Raupeto.'),
   ('xp', 'Raupeto i bin palanim Kauasi long skin bilong diwai.'),
   ('xe', 'Raupeto buried Kauasi inside tree bark.')]),
 ('kou',
  [('ps', 'CLASS'),
   ('ge', 'heap'),
   ('tkp', 'hip'),
   ('dt', '04/Sep/2005'),
   ('ex', 'Tesi kokopa kouro tosievoi kepa iava.'),
   ('xp', 'Tesi em i katim plenti bus long haus.'),
   ('xe', '???')]),
 ('kou',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'lay egg'),
   ('ge', 'defecate'),
   ('tkp', '???'),
   ('dcsv', '???'),
   ('am', '???'),
   ('arg', 'O'),
   ('sc', 'EXCRETION'),
   ('vx', '2'),
   ('nt', 'defecate reading considered vulgar'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Kokotu takura koupaoi.'),
   ('xp', 'Kakaruk meri em i mekim ol kiau.'),
   ('xe', 'Hens produce eggs.'),
   ('ex', 'Ragai oiraro kokotu takura kou oi pekuri-ia.'),
   ('xp', 'Kakaruk bilong mi i putim kiau long basket.'),
   ('xe', '???')]),
 ('koue',
  [('ps', 'N'),
   ('pt', 'ANIM'),
   ('ge', 'pig'),
   ('tkp', 'pik'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Ben oiraaro koue karuvera kovo aioevo.'),
   ('xp', 'Pik bilong Ben i kaikaim gaden singapo.'),
   ('xe', "Ben's pig ate the ??? garden."),
   ('ex', 'Koue tarasia avaae eisi vegoaro aiosia.'),
   ('xp', 'Ol i go painim pik long bilong kaikai.'),
   ('xe', '???')]),
 ('koue',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'collect pig to eat'),
   ('tkp', 'kisim pik'),
   ('vx', '???'),
   ('arg', '???'),
   ('cmt', 'Examples needed'),
   ('dt', '08/Jun/2005')]),
 ('koui',
  [('ps', '???'),
   ('ge', 'selfish with food'),
   ('tkp', 'no laik givim kaikai'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Rakea koui aue-ia aio.'),
   ('xp', 'Rake em i makau man long kaikai.'),
   ('xe', '???')]),
 ('koukou',
  [('ps', '???'),
   ('ge', 'Chinese'),
   ('tkp', 'saina'),
   ('tkp', 'hongkong'),
   ('dt', '26/Jan/2005'),
   ('ex', 'Koukou oea katai pua-ia toupaiveira.'),
   ('xp', 'Ol saina ol i save luk wankain.'),
   ('xe', 'All Chinese look alike.')]),
 ('koukouo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'laugh heartily at'),
   ('tkp', 'lap long'),
   ('nt', 'not the generic term'),
   ('cmt', 'Check spelling.'),
   ('sa', 'agesi'),
   ('sc', '???'),
   ('vx', '2'),
   ('arg', 'O'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Rapi, e ragai koukooparivoi?'),
   ('xp', 'Rapi, yu lap long mi?'),
   ('xe', 'Rapi, are you laughing at me?')]),
 ('koukouo',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'laugh heartily'),
   ('tkp', 'lap'),
   ('nt', 'not the generic term'),
   ('cmt', 'Check spelling.'),
   ('sa', 'agesi'),
   ('sc', '???'),
   ('vx', '1'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Viovokoto koukouoparoi.'),
   ('xp', '???'),
   ('xe', 'The boy is laughing heartily.')]),
 ('kova',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'grow'),
   ('eng', 'grow'),
   ('eng', 'mature'),
   ('tkp', 'kamap bikpela'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '10/Nov/2005'),
   ('ex', 'Evaova kovae uuko ovi ruvara-ia.'),
   ('xp', 'Diwai i bin kamap bikpela klostu long wara.'),
   ('xe', 'The tree grew near the stream.'),
   ('ex', 'Evaova kovae uukovi ruvaraia.'),
   ('xp', 'Diwai i kru klostu long wara.'),
   ('xe', '???'),
   ('ex', 'Kasivari kovova rutu oia Ovoteivi.'),
   ('xp', 'Ovoteivi em i meri bilong wok hat tru.'),
   ('xe', 'Ovoteivi is a hard worker.')]),
 ('kovaaro',
  [('rt', 'kova'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'growth of something'),
   ('tkp', '???'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kakau isiro iava kovaara kaepiroi.'),
   ('xp', 'Ol kru bilong kaukau i kamap.'),
   ('xe', '???')]),
 ('kovaeto',
  [('rt', 'kovae'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'stocky fat man'),
   ('tkp', '???'),
   ('dt', '05/Feb/2005'),
   ('ex', 'Kovaeto oirato ira viapau uvuipa ra kovopareve.'),
   ('xp', 'Bikpela na patpela man i no inap long wok.'),
   ('xe', "A fat man, he can't work.")]),
 ('kovaii',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'taro with long root'),
   ('tkp', 'longpela taro'),
   ('pt', 'opo'),
   ('cmt', 'Sentence fragment for ex.'),
   ('dt', '12/Aug/2005'),
   ('ex', 'Kovaii kuio aue opo.'),
   ('xp', 'Longpela taro.'),
   ('xe', '???')]),
 ('kovakovara',
  [('rt', 'kova'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'shoots of plant'),
   ('tkp', '???'),
   ('cmt', 'Why is it plural with singular agreement?'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Uutu kovo iava kovakovaara kaepiroi.'),
   ('xp', 'Ol kru bilong gaden yam i sut i kam antap.'),
   ('xe', '???')]),
 ('kovapato',
  [('rt', 'kova'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'stomach'),
   ('tkp', 'bek bilong bel'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Koue iava kovapato, uva aioara toupaiveira.'),
   ('xp', 'Bel bilong pik we ol kaikai i save istap.'),
   ('xe', '???')]),
 ('kovarato',
  [('rt', 'kova'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'shoots of plant or tree coming from where cut was made'),
   ('ge', 'sprouts from cut'),
   ('tkp', '???'),
   ('nt', 'Plural is kovakovara.'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kasiava iava kovarato putepare eisi Ruruu.'),
   ('xp', 'Han bilong diwai kanu i antap tru long Ruruu.'),
   ('xe', '???')]),
 ('kovarua',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'bushes'),
   ('ge', 'foliage'),
   ('tkp', 'hap bus'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kovaruara upiriko kovo ruuvo.'),
   ('xp', 'Bus i karamapim gaden kaukau.'),
   ('xe', '???')]),
 ('kovasi',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'pregnant'),
   ('tkp', '???'),
   ('nt', 'dog or pig only'),
   ('sc', '???'),
   ('vx', '1'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Koie kovasioi iria kakae kavaupaoi rara.'),
   ('xp', '???'),
   ('xe', 'The pig is pregnant and she will farrow later.'),
   ('ex', 'Koue kovasioi ra kakaekarevai kavaueve rara.'),
   ('xp', 'Pik i gat bel nau, na bai em i bonim sampela pikinini pik.'),
   ('xe',
    'The pig is pregnant and will eventually give birth to some piglets.')]),
 ('kovata',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'thrilled'),
   ('ge', 'happy'),
   ('tkp', 'lap hepi'),
   ('dt', '01/Jun/2005'),
   ('cmt', 'Can you specify what one is happy about?'),
   ('sc', 'EMOTION'),
   ('vx', '1'),
   ('ex', 'Kovatapau.'),
   ('xp', 'Yu amamas tru.'),
   ('xe', 'You are smiling.'),
   ('ex', 'Pita kovatairaoparoi agesipaoro.'),
   ('xp', 'Pita i amamas tru na em i lap.'),
   ('xe', 'Pita is very happy and is laughing.'),
   ('ex', 'Oirara kovatae rutu ovusia voea agesipieavo.'),
   ('xp', 'Ol man i amamas tru taim mi mekim ol i lap.'),
   ('xe', 'Everyone was thrilled because I made them laugh.')]),
 ('kovatavira',
  [('rt', 'kovata'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'thrilled'),
   ('ge', 'happy'),
   ('tkp', 'amamas'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Riakova kovatavira vurapae.'),
   ('xp', '???'),
   ('xe', 'The woman is looking on with a smile.'),
   ('ex', 'Jois kovatavira rutu vurapae ragai-va.'),
   ('xp', 'Jois i amamas tru na em i lukluk long mi.'),
   ('xe', '???')]),
 ('kovato',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of banana'),
   ('tkp', '???'),
   ('nt', 'strong-smelling when ripe'),
   ('pt', 'itoo'),
   ('dt', '12/Aug/2005'),
   ('ex', 'Kovato itoto ira vioroparoveira ra rera aiopaive.'),
   ('xp', 'Banana i save mao na ol i sae kaikai.'),
   ('xe', '???')]),
 ('kovauke',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'type of banana'),
   ('tkp', '???'),
   ('nt', 'short stubby eating type'),
   ('pt', 'itoo'),
   ('dt', '12/Aug/2005'),
   ('ex', 'Kovauke iria vioropaoveira ra oira aiopaive.'),
   ('xp', 'Banana i save mao na ol i save kaikai.'),
   ('xe', '???')]),
 ('Kovava',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'cave near Ibu'),
   ('tkp', 'wanpela hol ston'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kovava kepa oa eisi toupaiveira Ibu-ia ruvaraia.'),
   ('xp', 'Hol Kovava istap klostu long Ibu.'),
   ('xe', '???')]),
 ('kove',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'fall'),
   ('ge', 'drop'),
   ('tkp', 'pundaun'),
   ('vx', '1'),
   ('sc', '???'),
   ('nt', 'also used to describe laying eggs'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Teapi ipau ra ora kepioro ita koveuvere vorevira.'),
   ('xp', '???'),
   ('xe', "It wouldn't be good if you climbed up and fell down again."),
   ('ex', 'Avaisisi koveroe rikui-ia vokiaro ravapaoro.'),
   ('xp', 'Avaisisi em i pundaun long hul long nait taim em lukaut kapul.'),
   ('xe', 'Avaisisi fell into the hole at night while searching for possums.'),
   ('ex', 'Tavuteua viorooro kove.'),
   ('xp', 'Mango i mau na em i pundaun.'),
   ('xe', 'Fell down, being mature. (???)'),
   ('ex', 'Erik aitearo koveroe kepa iava.'),
   ('xp', 'Papa bilong Erik em i pundaun long haus.'),
   ('xe', "Eric's father fell down off the house.")]),
 ('kove',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'fell'),
   ('tkp', 'pundaunim'),
   ('vx', '2'),
   ('arg', 'O'),
   ('cmt', "What's the difference between transitive kove and kovepie?"),
   ('dt', '17/Oct/2005')]),
 ('kovea',
  [('rt', 'kove'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'pocket'),
   ('tkp', 'bak'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kovea-ia toupaivoi monia.'),
   ('xp', 'Moni istap long poket.'),
   ('xe', '???')]),
 ('kovekove',
  [('rt', 'kove'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'drip repeatedly'),
   ('tkp', 'pundaun planti'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '07/Sep/2005'),
   ('rdp', 'full'),
   ('ex', 'Uva gauoviro oisiri kovekovepaepa kakate sovara-ia.'),
   ('xp', '???'),
   ('xe', 'The tears dripped into the bamboo tube.')]),
 ('koveoapa',
  [('rt', 'kove'),
   ('ps', '???'),
   ('ge', 'kit bag'),
   ('tkp', 'kikbek'),
   ('cmt', 'Sentence fragment for ex. sentence.d'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Koeoapa Susan varo.'),
   ('xp', 'Bilum bilong Susan.'),
   ('xe', '???')]),
 ('koveva',
  [('rt', 'kove'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'sack'),
   ('ge', 'string bag'),
   ('tkp', 'bilum'),
   ('tkp', 'bek'),
   ('dt', '03/Dec/2004')]),
 ('Kovia',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'Kovia'),
   ('tkp', 'Kovia'),
   ('dt', '26/Jan/2005'),
   ('ex', 'Oirato vaisiaro Kovia.'),
   ('xp', 'Nem bilong man em long Kovia.'),
   ('xe', "The man's name is Kovia.")]),
 ('kovikoro',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'type of bush frog'),
   ('tkp', '???'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kovikoro iria gaupaeveira ovusia vokipape.'),
   ('xp', 'Em i wanpela kain rokrok i save krai taim i laik tudak.'),
   ('xe', '???')]),
 ('kovire',
  [('ps', '???'),
   ('ge', 'whitewash'),
   ('tkp', 'kambang'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kovirea rasia oa popotevira toupaiveira.'),
   ('xp', 'Dispela em i waitpela giraun.'),
   ('xe', '???')]),
 ('kovirea',
  [('rt', 'kovire'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'whitewash'),
   ('ge', 'white-colored earth'),
   ('tkp', '???'),
   ('nt',
    "White colored earth found near old Togarao one half hour's walk from ???"),
   ('dt', '26/Jan/2005')]),
 ('kovo',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'garden'),
   ('ge', 'work'),
   ('tkp', 'gaden'),
   ('tkp', 'wok'),
   ('eng', 'garden'),
   ('eng', 'work'),
   ('dt', '19/May/2005'),
   ('ex', 'Ragai kovopaavoi vo araisi kovo-ia.'),
   ('xp', 'Mi wok long gaden rais.'),
   ('xe', '???')]),
 ('kovo',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'work'),
   ('tkp', 'wok'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('cmt', 'Can this take an object? Need to check.'),
   ('dt', '10/Nov/2005'),
   ('ex', 'Ragai vo reoaro uvupaoro kovopareve.'),
   ('xp', '???'),
   ('xe', 'Hearing my talk (being obedient), he would be working.')]),
 ('kovo',
  [('ps', 'CLASS'),
   ('ge', 'garden'),
   ('tkp', 'gaden'),
   ('dt', '04/Sep/2005'),
   ('ex', 'Aio kovoro tapo oisoa tekiive vosia kopiiro oirato.'),
   ('xp', 'Taim man em i dai, ol i save bagarapim ol gaden kaikai bilong em.'),
   ('xe',
    'If a man would died, they would always destroy his gardens also.')]),
 ('kovoa',
  [('rt', 'kovo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'garden'),
   ('ge', 'work'),
   ('ge', 'week'),
   ('ge', 'day'),
   ('tkp', 'gaden'),
   ('tkp', 'wok'),
   ('tkp', 'wik'),
   ('tkp', 'de'),
   ('dt', '04/Sep/2005'),
   ('ex', 'Kovoa eva Justin vo kovoaaro.'),
   ('xp', 'Em gaden o wok bilong Justin.'),
   ('xe', "That's Justin's garden.")]),
 ('kovokovo',
  [('rt', 'kovo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'fence off'),
   ('ge', 'surround'),
   ('tkp', 'banisim'),
   ('tkp', 'susap'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Oirato kepa kovokovoreva kasura-ia koie kare asavira.'),
   ('xp', '???'),
   ('xe', 'The man surrounded his house with a fence against (to keep out).'),
   ('ex', 'Kokaipa icepa kovokovosia ra voa toupaive.'),
   ('xp', 'Banisim house kakaruk bai ol i stap insait.'),
   ('xe', '???'),
   ('ex', 'Vao kepa kovokovosia uriou ari uteoparai.'),
   ('xp', 'Yo kam banis haus em i kolor tumas.'),
   ('xe', '???'),
   ('ex', 'Oirara rutu kepa kovokovopaivoi sipito vo kepaaro.'),
   ('xp', 'Olgeta man bai banisim haus bilong chief.'),
   ('xe', 'Everyone is going to fence the house of the chief.')]),
 ('kovokovo',
  [('rt', 'kovo'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'play'),
   ('eng', "play Jew's Harp"),
   ('tkp', 'pilai chusap'),
   ('vx', '1'),
   ('dt', '22/Feb/2006'),
   ('ex', 'Kovokovo-ia puraiparoe avukato.'),
   ('xp', 'Lapun man i bin wok long pilai long mambu paip.'),
   ('xe', '???')]),
 ('kovokovo',
  [('rt', 'kovo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', "Jew's.harp"),
   ('eng', "Jew's Harp"),
   ('tkp', 'chusap'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Kovokovo-ia piraiparoe avukato.'),
   ('xp', 'Lapun man i bin wok long pilai long mambu paip.'),
   ('xe', "The old man played the Jew's harp.")]),
 ('kovokovoa',
  [('rt', 'kovokovo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'fence'),
   ('tkp', 'banis'),
   ('cmt', 'Check original for subject of 3rd sentence.'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Kepaia kovovao tovori va kakeoro.'),
   ('xp', 'Yu putim banis long house.'),
   ('xe', 'You are putting up a fence around the house.'),
   ('ex', 'Kovovoa kepa iava kovovoa.'),
   ('xp', 'Dispela em banis blong house.'),
   ('xe', 'This is a house fence.'),
   ('ex', 'Kovokovoa purapai ??? koue kare-re.'),
   ('xp', 'Ol youth ol i wokim banis bilong ol pik.'),
   ('xe', '???')]),
 ('kovopaa',
  [('rt', 'kovo'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'tool'),
   ('tkp', 'tul'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Oaravu rutu kovopaara kepa siovaraia toupai.'),
   ('xp', 'Ol tuls bilong wok istap insait long haus.'),
   ('xe', 'All of the tools are in the house.')]),
 ('kovopato',
  [('rt', 'kovo'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'laborer'),
   ('ge', 'worker'),
   ('tkp', 'wokman'),
   ('avm', 'pa_required'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Pita kovopato vo kepaio.'),
   ('xp', 'Pita man blong work long house.'),
   ('xe', 'Peter is a house-builder.'),
   ('ex', 'Pol a ragai reraro kovopato.'),
   ('xp', 'Pol em i wokman bilong mi.'),
   ('xe', 'Paul is my workman.')]),
 ('kovopie',
  [('rt', 'kovo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make work'),
   ('eng', 'make work'),
   ('eng', 'use to make work'),
   ('tkp', 'yusim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '16/Nov/2005'),
   ('cmt', "What's the locative noun doing in the 2nd sentence?"),
   ('ex', 'Monia kovopie kakau vorioro.'),
   ('xp', 'Yu mekim wok dispela moni long baiim kokoa.'),
   ('xe', 'You make money by buying cocoa.'),
   ('ex', 'Pol monia kovopiepare kakau voripaoro va-ia.'),
   ('xp', 'Pol i mekim wok moni taim em i baim kakau.'),
   ('xe', 'Paul is making money buying cocoa with it.'),
   ('ex', 'Sera vaio ora Rarasori ragai kovopiesivo.'),
   ('xp', 'Sera wantaim Rarason i givim wok long mi.'),
   ('xe', 'Sera and Robinson gave me work.')]),
 ('kovoruko',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'white-bark tree'),
   ('tkp', '???'),
   ('dt', '30/Aug/2005'),
   ('ex', 'Evaova popotevira rakaripapeira kovurukoua.'),
   ('xp', 'Em wampela diwai skin blong em i save white.'),
   ('xe', 'That is a tree whose skin is white.')]),
 ('kovoto',
  [('rt', 'kovo'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('dx', 'Central'),
   ('ge', 'big.garden'),
   ('ge', 'big garden'),
   ('tkp', 'bikpela gaden'),
   ('avm', 'pa_barred'),
   ('dt', '02/Sep/2005')]),
 ('kovovo',
  [('rt', 'kovo'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'fence'),
   ('ge', 'protect'),
   ('tkp', 'banisim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Kepa kovovo va utoro.'),
   ('xp', 'Yo banisim house bikol?? Em kolor tumas.'),
   ('xe', '???'),
   ('ex', 'Koue karen kovovo sapi kovoa aioive.'),
   ('xp', 'You mekim fence long gut of ikaikaim garden.'),
   ('xe', '???'),
   ('ex', 'Koue kare kovovori ikauvira teapi kosiaviro kovoara aiosia.'),
   ('xp',
    'Yu banisim kwik ol pik, nogut ol i kam arasait na kaikaim ol gaden.'),
   ('xe', 'Fence the pigs quickly lest they escape and eat the gardens.')]),
 ('kovuaka',
  [('alt', 'kovoaka'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'roof'),
   ('tkp', 'rup'),
   ('dt', '05/Feb/2005'),
   ('ex',
    'Kepa-ia konoaka avo kovuako puropai aue iava tetevu guruvaro kepa iare.'),
   ('xp', 'Roof blong house/ol I woekim blong house long ol sago levevis?? .'),
   ('xe', '???')]),
 ('kovuaro',
  [('rt', 'kovu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('eng', 'middle section'),
   ('eng', 'center section'),
   ('ge', 'middle'),
   ('tkp', 'namel'),
   ('dt', '10/May/2005'),
   ('ex', 'Pita rakoru tokorevo kovuaro-va.'),
   ('xp', 'Pita i katim snek namel.'),
   ('xe', 'Pita cut the snake from the middle.')]),
 ('kovukovu',
  [('rt', 'kovu'),
   ('ps', '???'),
   ('ge', 'false'),
   ('ge', 'fake'),
   ('ge', 'untrue'),
   ('tkp', 'giaman'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Riro kovukovu reotoa rutu vii.'),
   ('xp', 'Yu man bilong toktok nating.'),
   ('xe', '???')]),
 ('kovukovuto',
  [('rt', 'kovukovu'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'tree'),
   ('tkp', 'wanpela rain diwai'),
   ('sf', 'FLORA'),
   ('nt', 'tall fast-growing type with pithy center'),
   ('dt', '14/Nov/2005'),
   ('ex',
    'Kovukovuto riro kaekaepavira riparoveira. Vosia evakoro ra rivivuroviro sovara iava.'),
   ('xp',
    'I em wanpela kan diwai em i save gro hariap. Taim em i dai nau bai i gat hol insait.'),
   ('xe', 'The ??? grows very long. When it ???, it is ??? inside.'),
   ('ex', 'Kovukovuto riro ikaupa riroto.'),
   ('xp', 'Plant i save gro kwik tru.'),
   ('xe', 'The ??? plant is a really fast grower.')]),
 ('kovukovuvira',
  [('rt', 'kovukovu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'falsely'),
   ('tkp', 'olsem giaman'),
   ('dt', '03/Dec/2004'),
   ('ex', 'Kovukovuvira reoreopau, Leo.'),
   ('xp', 'Leo, toktok bilong yu i no tru.'),
   ('xe', '???')]),
 ('kovuru',
  [('ps', '???'),
   ('ge', '???'),
   ('eng', '???'),
   ('tkp', '???'),
   ('dt', '13/Nov/2005'),
   ('ex',
    'Veeta tou kovururi eva evoa uva vo raga urio vasipae kovoa tasiasipaoro.'),
   ('xp',
    'Yu putim akros mambu ia bikos ol i wok long kam olsem nabaut na kurukutim gaden.'),
   ('xe', '???')]),
 ('kovurui',
  [('alt', 'kovuru'),
   ('alt', 'kovuruva'),
   ('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'rafter'),
   ('eng', 'rafter'),
   ('eng', 'eave of roof'),
   ('tkp', 'sparen'),
   ('dt', '25/Jul/2005'),
   ('ex', 'Kovurup kepai kovurui tovori kovuakare.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Kovurui ivaraia kovuakaua tovota.'),
   ('xp', 'Putim ridge kap antap long rapta.'),
   ('xe', '???')]),
 ('kovuruko',
  [('alt', 'kuvuruko'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'carved decorative marks on arrows'),
   ('ge', 'decoration on arrow shaft'),
   ('tkp', '???'),
   ('am', 'true'),
   ('dt', '13/May/2005'),
   ('cmt', 'Are kovuruko and kuvuruko both acceptable pronunciations?'),
   ('ex', 'Kovurukopava rakoru kekeavo.'),
   ('xp', 'Mi lukim sinek i gat makmak.'),
   ('xe', 'I am looking at the decorated snake.'),
   ('ex', 'Vao kovuruko evaoa va vearo kekepe.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Kearitou kovuruko eva vauruvare.'),
   ('xp', 'Yu decoraturem dispela blong wokim spear.'),
   ('xe', '???'),
   ('ex', 'Kovuruko ova evaova popote rakari ova.'),
   ('xp', 'Diwai em sking bilong en i wait.'),
   ('xe', '???'),
   ('ex', 'Kuvuruko-pa tou keari tou.'),
   ('xp', 'Pitpit i gat makmak long en.'),
   ('xe', '???')]),
 ('kovurukovira',
  [('alt', 'kovurukopavira'),
   ('rt', 'kovuruko'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'striped'),
   ('tkp', '???'),
   ('nt', 'The meaning, many stripes, is marked by the reduplicated form'),
   ('dt', '30/Aug/2005'),
   ('ex', 'Auero rutu kovurukopavira toupaivo.'),
   ('xp', 'Olgeta samting i gat makmak istap.'),
   ('xe', '???')]),
 ('kovuruvira',
  [('rt', 'kovuru'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'at a right angle'),
   ('ge', 'cross-wise'),
   ('ge', 'horizontal'),
   ('tkp', '???'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Kovuruvira kaeu.'),
   ('xp', 'Karim dispela a cross.'),
   ('xe', '???'),
   ('ex', 'Kovuruvira vao tovo.'),
   ('xp', 'Putim dispela across.'),
   ('xe', '???'),
   ('ex', 'Jon kovuruvira vokaparevoi raiva-ia.'),
   ('xp', 'Jon wokabaut krosim rot i go.'),
   ('xe', 'John walked crossing the road.')]),
 ('kovuto',
  [('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'abdomen'),
   ('ge', 'stomach'),
   ('ge', 'diarrhea'),
   ('tkp', 'bel'),
   ('tkp', 'pekpek wara'),
   ('dt', '26/Jan/2005'),
   ('ex', 'Oirato iava kovuto.'),
   ('xp', 'Bel blong man.'),
   ('xe', '???'),
   ('ex', 'Rakova iava kovuto.'),
   ('xp', 'Bel blong meri.'),
   ('xe', '???'),
   ('ex', 'Vii iava kovuto.'),
   ('xp', 'Bel blong yu.'),
   ('xe', '???'),
   ('ex', 'Varesia riro kovuto.'),
   ('xp', 'Varsia em i bikbel.'),
   ('xe', '???')]),
 ('kuara',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'yell at'),
   ('tkp', '???'),
   ('cm', '-va'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Rera-va kuarapaeva.'),
   ('xp', '???'),
   ('xe', 'She yelled at him.'),
   ('ex', 'Potaki-va kuarapaivo ovusia taviparoe.'),
   ('xp', 'Ol i wok long singaut bikmaus long Potaki taim em i toksave.'),
   ('xe', 'They are yelling at Potaki while he talks.')]),
 ('kue',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'bear.fruit'),
   ('tkp', 'karim pikinini bilong diwai'),
   ('eng', 'reproduce'),
   ('eng', 'bear fruit'),
   ('cmt', 'In the second example, is it vo or va?'),
   ('vx', '1'),
   ('dt', '07/Jun/2005'),
   ('ex', 'Opitato kueroi.'),
   ('xp', 'Kokonas i karim nau.'),
   ('xe', 'The coconut tree is bearing fruit.'),
   ('ex', 'Vosa kakao kueo, ra apeisi vo kuepiea.'),
   ('xp', '???'),
   ('xe', 'When the cocoa bears fruit, then what do I do with the fruit?')]),
 ('kuea',
  [('alt', 'kue'),
   ('rt', 'kue'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'fruit'),
   ('tkp', 'prut pikinini'),
   ('dt', '10/May/2005'),
   ('ex', 'Opita kuea vaoi vosa evaova kuepe.'),
   ('xp', 'Sapos diwai em isat pa fruit.'),
   ('xe', '???'),
   ('ex', 'Opitatoa-ia kuea vavo.'),
   ('xp', 'Kokonas i karim pinis nau.'),
   ('xe', '???')]),
 ('kuga',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'bump into'),
   ('ge', 'nudge'),
   ('tkp', '???'),
   ('cm', '-ia'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Vii-ia kugapaavoi.'),
   ('xp', '???'),
   ('xe', 'I am bumping into you.'),
   ('ex', 'Siraovira vii-ia kugaavoi.'),
   ('xp', 'Sori tru, mi bam long yu.'),
   ('xe', '???')]),
 ('kui',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'semen'),
   ('tkp', 'melek'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kui oiratoa-ia toupai ora riakova.'),
   ('xp', 'Semen em istap long man na meri.'),
   ('xe', '???')]),
 ('kuio',
  [('ps', 'CLASS'),
   ('ge', 'round'),
   ('tkp', '???'),
   ('eng', 'group of round objects'),
   ('dt', '02/Jun/2005'),
   ('ex', 'Kuiorei.'),
   ('xp', '???'),
   ('xe', 'Two round objects.'),
   ('ex', 'Riro kuio rutu vao opo kuio.'),
   ('xp', 'Triapela raunpela taro.'),
   ('xe', '???')]),
 ('kuioi',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'taro with round root'),
   ('tkp', 'raunpela taro'),
   ('pt', 'opo'),
   ('dt', '12/Aug/2005'),
   ('ex', 'Opo kuioi ragaipa oo kuioi aue opo.'),
   ('xp', 'Yu givim taro.'),
   ('xe', '???'),
   ('ex', 'Ragai-pa okuioi opo.'),
   ('xp', 'Yu givim mi wanpela taro.'),
   ('xe', '???')]),
 ('kuiopesi',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'frog'),
   ('tkp', '???'),
   ('nt', 'small yellow body, pictured on PNG postage stamp, lives on leaves'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kuiopesi iria toupaeveira eisi vegoaro.'),
   ('xp', 'Liklik rokrok i save istap long bus.'),
   ('xe', '???')]),
 ('kuiopetu',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of frog'),
   ('tkp', '???'),
   ('nt', 'Small variety. This was pictured on the 10t PNG postage stamp.'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kuiopetu iria toupae 10 toia stamp-ia vao PNG.'),
   ('xp', 'Rokrok poto bilong en istap long 10 toia PNG stamp.'),
   ('xe', '???')]),
 ('kuito',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'type of ant'),
   ('tkp', '???'),
   ('nt', 'body is blue-black, like army ant'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Iravivu ita avarato rera vaisiaro kuito.'),
   ('xp', 'Wanpela kain anis kolim long kuito.'),
   ('xe', '???')]),
 ('kuka',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'shackle'),
   ('ge', 'handcuffs'),
   ('tkp', 'hankap sagoi'),
   ('dt', '04/Dec/2004'),
   ('ex',
    'Polis kuka tovoiva ira vavaearoi vosia vuria purareve kuka tovoivo John vare rivo kaureotoa.'),
   ('xp', 'Polis i putim hankuf long han bilong man sapos em i bikhet.'),
   ('xe', '???'),
   ('ex', 'Kuka tovopaiveira oirara-ia kaureoirara.'),
   ('xp', 'Polis i save putim ankap long ol bikhet man.'),
   ('xe', '???')]),
 ('kukara',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'corn'),
   ('ge', 'cane'),
   ('tkp', 'maispitpit'),
   ('nt', 'wilde type'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Saimon riro kukara aioto.'),
   ('xp', 'Saimon man bilong kaikai kon.'),
   ('xe', '???')]),
 ('kukauviro',
  [('rt', '--'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'deteriorate'),
   ('tkp', 'i laik bruk'),
   ('tkp', 'pundaun nating'),
   ('vx', '1'),
   ('cmt', "Is viro really part of the stem? If so, it's been lexicalized."),
   ('dt', '22/Nov/2005'),
   ('ex',
    'Kukauviro opeita evoa torepau vlaraim sapi kukauviro vo-re etokasi.'),
   ('xp', 'Nogut yu pundaun i go long paia.'),
   ('xe', '???'),
   ('ex', 'Teapi kukauviro eisi-re vagapa.'),
   ('xp', 'Nogut yu pundaun i go long watpal.'),
   ('xe', "Don't fall down the waterfall."),
   ('ex', 'Raka teapi kukauviro evoa-re rikui.'),
   ('xp', 'Raka, nogut yu pundaun i go long hol.'),
   ('xe', "Raka, don't fall down into the hole."),
   ('ex',
    'Avukato kukauviroragaparoi kopiioro uvare rera viapau goruavai oa iava kukarovi vovokio.'),
   ('xp',
    'Lapun man bai pundaun nating na bai em dai bikos em nogat strong olsem na em i pundaun nating tude.'),
   ('xe',
    "The old man is ??? dying because he has no strength and that's why he ??? today.")]),
 ('kukiuki',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'shake something'),
   ('ge', 'rattle something'),
   ('tkp', 'meknoisim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Moniara kukiukiri bokis sovara-ia.'),
   ('xp', '???'),
   ('xe', 'Shake the money inside the box.'),
   ('ex', 'Moniara kukiukiri raka-pa tou siovaraia.'),
   ('xp', 'Yu sekim ol moni insait long tin.'),
   ('xe', '???')]),
 ('kukiukia',
  [('rt', 'kukiuki'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'rattles for play'),
   ('tkp', 'salamon bilong pilai'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Pita kukiukia vo evaova-ia, osia voa touparevo.'),
   ('xp', 'Pita istap long diwai na mi sekim em.'),
   ('xe', '???')]),
 ('kuku',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'spoonfeed'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('nt',
    'To feed a person incapable of feeding himself. Used describing a pastor who feeds the people from the Word of God.'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Rusa kakaeto kukupaevo gisipo-ia.'),
   ('xp', 'Rusa i givim kaikai long pikinini long maus.'),
   ('xe', '???')]),
 ('kukua',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'galip nut'),
   ('tkp', 'galip'),
   ('tkp', 'kasang'),
   ('eng', 'Java nut'),
   ('eng', 'galip nut'),
   ('eng', 'ngali nut (Canarium indicum)'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Vutara-ia kuepapeira aue kukua vo ivaraia.'),
   ('xp', 'Garip i save karim wanpela taim insait long yia.'),
   ('xe', '???')]),
 ('kukuaua',
  [('rt', 'kukua'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'candle'),
   ('ge', 'light made from sap of Tahitian chestnut'),
   ('tkp', 'kandel bilong wara bilong galip'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kukuaua oa aviavipaiveira rara va rukuepaive.'),
   ('xp', 'Kendol i save lait, sapos ol i laitim.'),
   ('xe', '???')]),
 ('kukue',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'head'),
   ('eng', 'head'),
   ('eng', 'skull'),
   ('tkp', 'het'),
   ('dt', '17/Nov/2005'),
   ('ex',
    'Riko siopaipaavo ovusia Rarasiori tapo pauparoe uvare kukuvaipaa tovoparevo vo kukue.'),
   ('xp',
    'Mi no wok long luksave long Riko taim em i wok long sindaun wantaim Rarason bikos em iputim kep long het bilong em.'),
   ('xe',
    "I didn't recognize Riko when he was seated with Robinson because he put a hat on his head.")]),
 ('kukue pute',
  [('rt', 'akotei'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'type of ant'),
   ('tkp', 'anis'),
   ('nt', 'nickname'),
   ('sa', 'akotei'),
   ('dt', '04/Feb/2004')]),
 ('kukuepaa',
  [('rt', 'kukue'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'hat'),
   ('tkp', 'hat'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kukuepa/rageipa kukuepa vate ataria.'),
   ('xp', 'Yu givim hap hat blong fish.'),
   ('xe', '???'),
   ('ex', 'Ragai-pa kukuepaa vateri sipoia.'),
   ('xp', 'Yu givim mi het bilong suka.'),
   ('xe', '???')]),
 ('kukuku',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'headwater of river'),
   ('tkp', 'het bilong wara'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Uko kukuku.'),
   ('xp', 'Head blong wara.'),
   ('xe', '???'),
   ('ex', 'Vouva ukovi parupaivo kukuku.'),
   ('xp', 'Long hap wara em stoter?? igi.'),
   ('xe', '???'),
   ('ex', 'Ivitu kukuku iava parupae eisi pukuia.'),
   ('xp', 'Het bilong Ivitu istap long maunten.'),
   ('xe', '???')]),
 ('kukupira',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'type of ground'),
   ('tkp', '???'),
   ('nt', 'Loamy clay ground that is hard when dry but very ???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kukupira rasi vearopai, atari-pa rovu purasia.'),
   ('xp', 'Hap retpela giraun i gutpela long wokim pis pon.'),
   ('xe', '???')]),
 ('kukuriko',
  [('ps', '???'),
   ('ge', 'head'),
   ('ge', 'most important part'),
   ('tkp', '???'),
   ('vx', '???'),
   ('dt', '19/Mar/2006'),
   ('ex', 'Oara rutu-pa kukurikopai.'),
   ('xp', '???'),
   ('xe', 'It is the most important part of all.'),
   ('ex', 'Kokioto era evoa kukuriko-ia.'),
   ('xp', 'Pisin i stap long het bilong diwai.'),
   ('xe', '???')]),
 ('kukurikoto',
  [('rt', 'kukuriko'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'head of tree, talk, or thought'),
   ('ge', 'crown of tree'),
   ('tkp', '???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kukurikoto evaovaiava.'),
   ('xp', 'Head blong diwai.'),
   ('xe', '???'),
   ('ex', 'Korato varo, eisi kukurikoto.'),
   ('xp', 'Kapul istap long het bilong diwai.'),
   ('xe', '???')]),
 ('kukusiri',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'midrib of sago frond'),
   ('tkp', 'nok'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Te?? tevu guruvaiava kukusiri.'),
   ('xp', 'Brum blong saksak.'),
   ('xe', '???'),
   ('ex', 'Kukusiri tetevu guruva iava.'),
   ('xp', 'Bun bilong lip saksak.'),
   ('xe', '???')]),
 ('kukutauvu',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'parrot'),
   ('tkp', '???'),
   ('eng', 'Superb Fruit Dove (Ptilinopus superbus)'),
   ('eng', 'Claret-breasted Fruit Dove (Ptilinopus viridis)'),
   ('nt', 'female of visooto'),
   ('cmt', 'Check spelling'),
   ('sa', 'uvuutu'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kukutavu oisio raga oira osia uruvau.'),
   ('xp', 'Pisin i olsem tasol barus.'),
   ('xe', '???')]),
 ('kukutu',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'coconut'),
   ('tkp', 'drip'),
   ('nt', 'immature green without juice'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Opita kukutu ukoara viapau opitatai aiosa.'),
   ('xp', '?? Kokos em nara tasol no gat meat blong kaikai.'),
   ('xe', '???'),
   ('ex', 'Kura kopukupuva osa kokovaraviratoupaevo oira oruparo.'),
   ('xp', 'Grass blong em em blue na em istap green.'),
   ('xe', '???'),
   ('ex', 'Kukutuisi vaisio aisia viapau aue uuko.'),
   ('xp', 'Liklik kokonas i nogat wara istap.'),
   ('xe', '???')]),
 ('kukuuku',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make footfall'),
   ('tkp', 'krungutim'),
   ('cmt', 'Rethink translation'),
   ('dt', '16/Nov/2005'),
   ('vx', '1'),
   ('ex', 'Oirato kukuukupaoro torirevo oa iava kaakaukare rera-va tarioae.'),
   ('xp', 'Man i mekim nois olsem na ol dok ronim em.'),
   ('xe', '???'),
   ('ex', 'Kikisi kukuukupavoi.'),
   ('xp', 'Bal em i bam long graun.'),
   ('xe', '???'),
   ('ex', 'Oirato era ira kukuukupare ira uvupavoi.'),
   ('xp', 'Mi harim man i wokabaut i kam.'),
   ('xe', '???'),
   ('ex', 'Pita kukuukupieoro puterevo kepa-ia.'),
   ('xp', 'Pita i mekim nois taim em i wokabaut klostu long haus.'),
   ('xe', '???')]),
 ('kukuukua',
  [('rt', 'kukuuku'),
   ('ps', '???'),
   ('ge', 'sound of footsteps'),
   ('tkp', 'nois bilong iek'),
   ('dt', '04/Dec/2004'),
   ('ex',
    'Vosa vokapareve rairavu vuri osa kukuukua piepare vokapaoro oirato kukuu piepare.'),
   ('xp', 'Taim yu harim mom em woik baut.'),
   ('xe', '???')]),
 ('kukuukupie',
  [('rt', 'kukuuku'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make footfall'),
   ('tkp', 'krungutim'),
   ('vx', '???'),
   ('cmt', 'Can it take an object?'),
   ('dt', '22/Nov/2005'),
   ('ex', 'Pita kukuukupieoro puterevo kepa-ia.'),
   ('xp', 'Pita i mekim nois taim em i wokabaut klostu long haus.'),
   ('xe', '???')]),
 ('kukuuvua',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'roof'),
   ('tkp', '???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kepa kukuuvua-ro.'),
   ('xp', 'Roof blong hause.'),
   ('xe', '???'),
   ('ex', 'Kepa-ia kukuuvua tovota.'),
   ('xp', 'Yupela putim rup antap long haus.'),
   ('xe', '???')]),
 ('kukuvai',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'shelter head'),
   ('ge', 'cover the head from rain or sun'),
   ('tkp', 'karamapim het'),
   ('cmt', 'Check classification? Class A or B? Is it labile?'),
   ('vx', '???'),
   ('arg', '???'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Kukue kukuvairi ambrela-ia.'),
   ('xp', '???'),
   ('xe', 'Shelter you head with the umbrella.'),
   ('ex', 'Pita, kukuvaiu ari kokeva koveoi.'),
   ('xp', 'Pita, yu karamap nau ren i pundaun.'),
   ('xe', "Pita, cover up because it's raining.")]),
 ('kukuvaipaa',
  [('rt', 'kukuvai'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'hat'),
   ('tkp', 'hat'),
   ('cmt', 'Sentence fragment'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Porisito vaaro kukuvaipa.'),
   ('xp', 'Hat bilong polis.'),
   ('xe', 'Police hat.')]),
 ('kukuvita',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('cl', 'isi'),
   ('ge', 'tree fern'),
   ('tkp', '???'),
   ('nt', 'small with edible new shoots'),
   ('sa', 'kepikepiisi'),
   ('sf', 'FAUNA'),
   ('dt', '12/Feb/2005'),
   ('cmt', 'Is vegaropa isi really a word? Come on, really?'),
   ('ex', 'Kukuvita isi vegoaropa isi aisia-ia kupeiara toupaiveira.'),
   ('xp', 'Em wanpela kain plant i save gat ol miupela sut.'),
   ('xe', '???')]),
 ('kukuvua',
  [('rt', 'kukuvu'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'roof cap'),
   ('tkp', 'rup rabun'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kepa iava kukuvua.'),
   ('xp', 'Rup bilong haus.'),
   ('xe', '???')]),
 ('Kuokuo',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'name'),
   ('tkp', 'nem'),
   ('cmt', 'Sentence fragment'),
   ('dt', '26/Jan/2005'),
   ('ex', 'Kuokuo riakova tugarava vegoaropava.'),
   ('xp', 'Masalai meri bilong bus.'),
   ('xe', '???')]),
 ('kupare',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'smoke'),
   ('eng', 'smoke'),
   ('eng', 'produce smoke'),
   ('tkp', 'mekim smok'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '27/Sep/2006'),
   ('ex', 'Etokasi kupareepa.'),
   ('xp', '???'),
   ('xe', 'The fire smoked.'),
   ('ex', 'Toki kupareparoveira vokiara rutu-ia.'),
   ('xp', 'Bagana i save smuk olgeta dei.'),
   ('xe', '???')]),
 ('kupareto',
  [('rt', 'kupare'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'smoke'),
   ('tkp', 'smok'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kupareto ira vusivusipareveira etokasi iava.'),
   ('xp', 'Simuk i save kam aut long paia.'),
   ('xe', '???')]),
 ('kupekupe',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'fan'),
   ('tkp', 'givim win'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Ragai kakaeto kupekupepavoi uvare kasiraoparoi.'),
   ('xp', 'Mi givim win long pikinini bikos em i hot.'),
   ('xe', "I'm fanning the child because it's hot.")]),
 ('kupekupepa',
  [('rt', 'kupekupe'),
   ('ps', 'N'),
   ('pt', '???'),
   ('ge', 'fan'),
   ('tkp', 'samting bilong givim win'),
   ('dt', '13/Jul/2004'),
   ('ex', 'Ragai orakupekupeparai kupekupepaia.'),
   ('xp', 'Mi givim win long mi yet long fan.'),
   ('xe', "I'm fanning myself with the fan.")]),
 ('kupero',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'shoulder joint'),
   ('tkp', 'sol'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kupero koue iava.'),
   ('xp', 'Bun skuru bilong pik.'),
   ('xe', '???')]),
 ('kuperoo',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'arrow with bone tip'),
   ('tkp', 'supsup i gat bun'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Siova kuperoo paurevora keari tou-ia.'),
   ('xp', 'Siova i bin putim bun bilong pik long pitpit.'),
   ('xe', '???')]),
 ('kuperovira',
  [('rt', 'kupero'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'dull flat-like'),
   ('tkp', '???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kuperovira raga vokapavoi.'),
   ('xp', 'Mi wokabaut nating na mi no save.'),
   ('xe', '???')]),
 ('kupii',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'pupa of beetle'),
   ('tkp', 'pikinini bilong binatang'),
   ('dt', '07/Mar/2005'),
   ('ex', 'Kupito era kokopuoto.'),
   ('xp', 'Batapalai i kamap pupa.'),
   ('xe', '???')]),
 ('kupukupu',
  [('rt', 'kupu'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('rdp', 'full'),
   ('ge', 'excited'),
   ('ge', 'anxious'),
   ('tkp', 'kirapim bel'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '07/Sep/2005'),
   ('rdp', 'full'),
   ('ex', 'Ragai iava vovou isi kupukupuiraopaivo rorupaoro.'),
   ('xp', 'Bel bilong mi seksek tru taim mi amamas.'),
   ('xe', 'My heart shakes when I am truly happy.')]),
 ('kurae',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'frog'),
   ('tkp', '???'),
   ('nt',
    'green, rubbery appearance, cries just before the rain to get the ???. Pictured on the 15t PNG postage stamp.'),
   ('sf', 'FAUNA'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kurae iria gaupaeveira vosia kokeva koveo.'),
   ('xp', 'Rokrok bilong bus i save krai long taim bilong ren.'),
   ('xe', '???')]),
 ('kurasia',
  [('ps', '???'),
   ('ge', 'wrist'),
   ('tkp', 'skru bilong han'),
   ('dt', '26/Jan/2005')]),
 ('kurasia',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'lower bones of limbs'),
   ('tkp', '???'),
   ('sf', 'BP'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kurasia keru upiapai ragai-re.'),
   ('xp', 'Bun bilong mi i pen.'),
   ('xe', 'My wrist bone hurts.')]),
 ('kurei',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'lizard'),
   ('tkp', '???'),
   ('nt', 'black smooth skin, small, lives near the beach'),
   ('sf', 'FAUNA'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kurei iria tetevu guruvaro-ia toupaeveira.'),
   ('xp', 'Geko i save istap long ol lip saksak.'),
   ('xe', '???')]),
 ('kuri',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'scrape'),
   ('tkp', 'skrapim'),
   ('eng', 'scrape'),
   ('eng', 'scratch'),
   ('eng', 'gnashing'),
   ('eng', 'gritting'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Visii iava reuri oauisii kuripaoro siraovira toupataua voa.'),
   ('xp', '???'),
   ('xe', 'You were sorrowfully there gritting/grashing your teeth.'),
   ('ex', 'Robin, opita kuriu.'),
   ('xp', 'Robin, yu skirapim kokonas.'),
   ('xe', 'Robin, scrape some coconut.')]),
 ('kuriato',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'frog'),
   ('tkp', '???'),
   ('nt', 'very small black body'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kuriatoa garepavavi, iria gaupaeveira.'),
   ('xp', 'Em wanpela liklik rokrok i save krai ol taim long bus.'),
   ('xe', '???')]),
 ('kurikaakaakuto',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'weevil'),
   ('tkp', '???'),
   ('nt', 'insect with long proboscis, looks like boll-weevil'),
   ('sf', 'FAUNA'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kurkaakaakuto kokoruto riropa oruto.'),
   ('xp', 'Binatang i gat bikpela garas.'),
   ('xe', '???')]),
 ('kurikasi',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'urge along'),
   ('ge', 'prod along'),
   ('tkp', 'hariapim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Koie kurikasireva oirato.'),
   ('xp', '???'),
   ('xe', 'The man urged the pig along.'),
   ('ex', 'Pita koue kurikasirevoi kasiura iare.'),
   ('xp', 'Pita i hariapaim tru pik i go long banis.'),
   ('xe', 'Peter urged the pigs into the fence.')]),
 ('kurikasivira',
  [('rt', 'kurikasi'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'urgently'),
   ('tkp', 'olsem hariap'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kurikasivira rutu kareai riakora eisi uupa tapi.'),
   ('xp', 'Ol meri i kam bek kwik tru long bung.'),
   ('xe', '???')]),
 ('kurikoko',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'orchid'),
   ('tkp', 'wanpela kain piaua'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kurikoko oa vo toupaiveira evaoara-ia.'),
   ('xp', 'Em palaua i save istap long ol diwai.'),
   ('xe', '???')]),
 ('kurikuri',
  [('rt', 'kuri'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'scratch repeatedly'),
   ('tkp', 'skrapim'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('rdp', 'full'),
   ('ex', 'David kavorava kurikruipare.'),
   ('xp', 'David i skirapim girire.'),
   ('xe', 'David is scratching his grille.'),
   ('ex', 'Kurikuripavoi varaua iava.'),
   ('xp', 'Skin bilong mi skirap.'),
   ('xe', '???')]),
 ('kuripaa',
  [('rt', 'kuri'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'drawknife'),
   ('ge', 'scraper'),
   ('tkp', 'naip samting'),
   ('dt', '07/Mar/2005'),
   ('ex', 'Raki, uriou sigo kuripaa-va.'),
   ('xp', 'Raki, yu kam wantaim pail.'),
   ('xe', 'Raki, come here with the scraper.')]),
 ('kuritava',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'octopus'),
   ('ge', 'squid'),
   ('tkp', 'tauka'),
   ('dt', '26/Jan/2005'),
   ('ex',
    'Kuritava iria avakava siovaraia toupaeveira, uvui ra uvuoavai rokopieve.'),
   ('xp',
    'Urita i save istap insait long solwara, em inap daunim sip i go insait long solwara.'),
   ('xe', '???')]),
 ('kuroa',
  [('rt', 'kuro'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'half'),
   ('tkp', 'hap'),
   ('dt', '07/Dec/2004'),
   ('ex', 'Kuroa raga vao-ia kepa, viapau opesiera.'),
   ('xp', 'Haus em i hap tasol i no pinis iet.'),
   ('xe', "The haus is half-done, it wasn't finished.")]),
 ('kuroea',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'vine with leathery appearance'),
   ('ge', 'shoe'),
   ('tkp', 'wanpela kain rop'),
   ('tkp', 'su'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Iroiro oa vaisipai oisio kuroea, ora tasipara.'),
   ('xp', 'Em i rop, tu ol Rotoaksi i save kolim su long dispela frut.'),
   ('xe', '???')]),
 ('kurokuro',
  [('rt', 'kuro'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'arthritic'),
   ('ge', 'paralyzed'),
   ('tkp', 'haphapim'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '01/Jun/2005'),
   ('rdp', 'full'),
   ('ex', 'Vavaea korukorupai ragaire.'),
   ('xp', '???'),
   ('xe', 'My hand is paralyzed.'),
   ('ex', 'Ragai-re vavaea kurokuroepa.'),
   ('xp', 'Han bilong mi i bin nogut.'),
   ('xe', 'My hand is paralyzed.')]),
 ('kurokuroto',
  [('alt', 'kurokuropato'),
   ('rt', 'kurokuro'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'arthritic person'),
   ('tkp', '???'),
   ('dt', '15/Nov/2005'),
   ('ex',
    'Ro oirato ira viapau vearovira vokaparevo uva kokotoa ei vuruvira kurukuroto.'),
   ('xp', 'Man i no wokabaut gut bikos ol lek bilong em mo em nogut.'),
   ('xe', '???')]),
 ('kuroo',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', "child's penis"),
   ('tkp', 'kok bilong pikinini'),
   ('nt', 'acceptable in mixed company'),
   ('cmt', 'Check spelling: kuro or kuroo? Sentence fragment.'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kakaetoa iava kuro.'),
   ('xp', 'Bol bilong liklik boi.'),
   ('xe', '???')]),
 ('kurooro',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'crumbs'),
   ('ge', 'small particles'),
   ('tkp', 'ol hap kaikai'),
   ('cmt', 'Check spelling kuroro or kurooro?'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Pita vaio ora Jon aio kurooro aiosivoi.'),
   ('xp', 'Pita na Jon i kaikaim ol hap kaikai.'),
   ('xe', '???'),
   ('ex', 'Aio kuroro tovori evara vearovira.'),
   ('xp', 'Yu putim gut ol hap kaikai.'),
   ('xe', '???')]),
 ('kurovira',
  [('rt', 'kuro'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'incompletely'),
   ('tkp', 'hapim'),
   ('dt', '13/Nov/2005'),
   ('ex', 'Kurovira toupaivo rari vuku.'),
   ('xp', '???'),
   ('xe', 'The trust is incomplete.'),
   ('ex', 'Kurovira kepa purarivo.'),
   ('xp', 'Yu hapim haus.'),
   ('xe', '???'),
   ('ex', 'Kurovira aio koko kekeavo ragai vaaro oa aioivo.'),
   ('xp',
    'Mi lukim pelet kaikai bilong mi olsem ol i kaikai long em na ol i hapim.'),
   ('xe', '???')]),
 ('kuru',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'strip off branches'),
   ('tkp', '???'),
   ('dt', '08/Jun/2005'),
   ('arg', 'O'),
   ('vx', '2'),
   ('ex', 'Evaova kururi.'),
   ('xp', '???'),
   ('xe', 'Strip the tree of all the branches.'),
   ('ex', 'Kasiva kuru kekepari vavoisio Ruru.'),
   ('xp', 'Yu lukim wanpela han het bilong diwai kanu long Ruru.'),
   ('xe', '???')]),
 ('kurue',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'pigeon'),
   ('tkp', 'balus'),
   ('eng', 'Spice Imperial Pigeon (Ducula myristicivora)'),
   ('eng', 'Pacific Imperial Pigeon (Ducula pacifica)'),
   ('dt', '31/Aug/2005'),
   ('ex', 'Kurue ira kukua aiopaoro ora toupieparoveira.'),
   ('xp', 'Pisin barus i save kaikai garip na istap isi.'),
   ('xe', '???')]),
 ('kurupi',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'starling'),
   ('tkp', 'pisin'),
   ('eng', 'Singing Starling (Aploni cantoroides)'),
   ('nt', 'clan totem'),
   ('sa', 'orokui'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '12/Jul/2004')]),
 ('kururai',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'bamboo water carrier'),
   ('tkp', '???'),
   ('eng', 'two joined sections of bamboo used for carrying water'),
   ('dt', '04/Dec/2004'),
   ('ex', 'veta kururai ukaepava.'),
   ('xp', 'mambu bilong karim wara'),
   ('xe', '???'),
   ('ex', 'Salome kururae-ia ukovi kaepae.'),
   ('xp', 'Salome i karim wara long mambu.'),
   ('xe', '???')]),
 ('kururu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'crumble something'),
   ('tkp', '???'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Va kurururi.'),
   ('xp', '???'),
   ('xe', 'You crumble it.'),
   ('ex', 'Bisketea aiopaoro va kururuavoi.'),
   ('xp', 'Mi kaikaim bisket na i bruk liklik tru.'),
   ('xe', '???')]),
 ('kurutu',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'portion'),
   ('ge', 'part of'),
   ('tkp', '???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Uuko kurutu vateri eva.'),
   ('xp', '???'),
   ('xe', 'Give that half full cup of water (to him).'),
   ('ex', 'Uko kurutu vateri eva ragai-pa.'),
   ('xp', 'Yu givim hap wara long mi.'),
   ('xe', '???')]),
 ('Kusi',
  [('ps', 'N'),
   ('pt', 'PN'),
   ('ge', 'village name'),
   ('tkp', 'ples nem'),
   ('dx', 'Aita'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Uruia vaisiaro aue kusi oa toupai eisi Aita.'),
   ('xp', 'Nem bilong ples Kusi istap long Aita.'),
   ('xe', '???')]),
 ('kusii',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'cool off'),
   ('tkp', '???'),
   ('dt', '01/Jun/2005'),
   ('vx', '1'),
   ('sc', '???'),
   ('ex', 'Varaua rera-re kusiipe.'),
   ('xp', '???'),
   ('xe', 'His body would cool off.'),
   ('ex', 'Ukoara kusiipapeira eisi Tutue.'),
   ('xp', 'Ol wara i save kol long Balbi.'),
   ('xe', 'The waters are always cold on Mt. Balbi.')]),
 ('kusike',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'rat'),
   ('tkp', 'rat'),
   ('nt', 'archaic'),
   ('sa', 'isike'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kusike votoupai vegoaro ora vo kepara sovaraia.'),
   ('xp', 'Rat em i stap long bus na em i stap tu long ples.'),
   ('xe', 'Rats are in the bush and also in houses.'),
   ('ex', 'Kusike upoevo epusi.'),
   ('xp', 'Pusi i kilim rat.'),
   ('xe', 'The cat killed the rat.')]),
 ('kusito',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'shellfish'),
   ('ge', 'scallop'),
   ('tkp', '???'),
   ('nt',
    'So called because in the past the shell was attached to a piece of wood ???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kusito opita kuripato.'),
   ('xp', 'Skarap bilong skarapim kokonas.'),
   ('xe', '???')]),
 ('kuu',
  [('ps', '???'),
   ('ge', 'talk.from.the.grave'),
   ('eng', 'talk to dead man'),
   ('eng', "dead person's talk"),
   ('tkp', '???'),
   ('dcsv', 'true'),
   ('vx', '???'),
   ('cmt',
    "Is this archaic? Isn't recognized by informants, who only can think of ku (short vowel). It may be that kuupie is rootless?"),
   ('dt', '27/Sep/2006')]),
 ('kuu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'satisfied with'),
   ('ge', 'finished with'),
   ('tkp', 'hepi long en'),
   ('tkp', 'pinisim'),
   ('cmt',
    "Check vowel length. Don't really understand this... Need more examples and some follow-up."),
   ('vx', '???'),
   ('arg', '???'),
   ('dt', '27/Sep/2006'),
   ('ex', 'Vii kupaavoi eva opesirivo.'),
   ('xp', 'Mi lak save yu pisism dispela.'),
   ('xe', '???'),
   ('ex', 'Vii kupavoi oisio e kovoa opesirivo.'),
   ('xp', 'Mi laik save, yu pinisim wok bilong yu.'),
   ('xe', '???'),
   ('ex', 'Rarasori kovoara rutu kupare ra karero verevira eisi-re Amerika.'),
   ('xp', 'Rarason bai pinim tru ol wok bihain bai em i go bek long Amerika.'),
   ('xe', 'Robinson is going to finish all of the work ???.')]),
 ('kuukuuvu',
  [('rt', 'kuuvu'),
   ('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'lie'),
   ('ge', 'deceive'),
   ('tkp', 'giamanim'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '07/Sep/2005'),
   ('cmt', 'Follow up on second example.'),
   ('rdp', 'partial'),
   ('ex', 'Kuukuuvupaue vii.'),
   ('xp', 'Yu giaman man.'),
   ('xe', 'You lie.'),
   ('ex', 'Vori areae riro kuukuuva-va ruvaraia riakova.'),
   ('xp', 'Ol i askim moni long meri bilong giaman.'),
   ('xe', 'They ask for money for the lying woman.')]),
 ('kuukuuvuto',
  [('alt', 'kuukuuvupato'),
   ('rt', 'kuukuuvu'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', 'liar'),
   ('ge', 'deceiver'),
   ('tkp', '???'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Pita riro kuukuuvuto.'),
   ('xp', 'Pita em i man bilong giaman man.'),
   ('xe', 'Peter is a big liar.'),
   ('ex', 'Riro kuukuuvuva vii.'),
   ('xp', 'Yu man bilong giaman.'),
   ('xe', "You're a liar."),
   ('ex', 'Tugarato riro kuukuuvuto ira oirara keakeapareveira.'),
   ('xp', 'Setan i man bilong giaman em i save giamanim ol man.'),
   ('xe', 'Satan is a big liar who deceives people.')]),
 ('kuuoa',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'owl'),
   ('tkp', 'korol'),
   ('nt', 'small brown body, large yellow eyes, wire tufts around the'),
   ('cmt',
    "According to Wakunai informants, it's kuo (short 'o' and no final 'a'), and is used for all owls. Need to double-check."),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '08/Mar/2005'),
   ('ex',
    'Kuuoa viro osireiva vokiaro vokapaeveira papapaoro ora ava-ia uusipaoveira.'),
   ('xp',
    'I gat bikpela ei na em i save fly long nait i slip long ol hol bilong diwai.'),
   ('xe', '???'),
   ('ex', 'Kuuo iria gaupaeveira vokiaro.'),
   ('xp', 'Korol i save karai long nait.'),
   ('xe', 'The owl cries at night.')]),
 ('kuupie',
  [('rt', 'kuu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'talk.to.the.dead'),
   ('tkp', 'tokim dai man'),
   ('eng', 'speak to the dead'),
   ('eng', 'communicate with the spirit of a dead person'),
   ('arg', 'O'),
   ('vx', '2'),
   ('cmt', "Does 'rera' with 'ukaiopiesia' refer to a drinker or drinkee?"),
   ('dt', '08/Dec/2005'),
   ('ex', 'Kuupiereva.'),
   ('xp', '???'),
   ('xe', 'He caused the spirit of the dead person to speak.'),
   ('ex', 'Siorairara oisioa kopiiirara kuupiepaive.'),
   ('xp', 'Ol poisin man i save singautim spirit bilong ol dai man.'),
   ('xe', 'Poison men always talk to the spirits of the dead.'),
   ('ex', 'Kuu oirato kuupiepaveira vosa ruva tovarovoive.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex', 'Uva Kokota Tovisia kuupiepareva.'),
   ('xp', 'Kokota i singautim spirit bilong Tovisia.'),
   ('xe', 'And Kokota spoke to the spirit of Tovisia.'),
   ('ex', 'Kopito kuupieivo eisi tova urui rera ukaiopiesia.'),
   ('xp', 'Ol i kolim dai man long matmat long mekim em dring.'),
   ('xe', '???')]),
 ('kuurea',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'gecko'),
   ('tkp', 'geko'),
   ('nt', 'large with white skin'),
   ('sf', 'FAUNA'),
   ('dt', '10/Feb/2005'),
   ('ex', 'Korikoripato vavauiavo oiso vera osa.'),
   ('xp', 'Em i gat ol makmak ol luk okam geko.'),
   ('xe', '???'),
   ('ex', 'Kuurea ira toupareveira tetevu guruvaro siovaraia.'),
   ('xp', 'Geko i save istap insait long ol lip saksak.'),
   ('xe', 'The kuurea gecko lives on the leaves of sago.')]),
 ('kuuri',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'grunt'),
   ('ge', 'huff and puff'),
   ('tkp', 'pulim win strong'),
   ('vx', '1'),
   ('sc', 'BODILY.PROCESS'),
   ('cmt', 'Need ex to illustrate subj agreement'),
   ('dt', '01/Jun/2005'),
   ('cmt', 'Check original--first verb is dependent verb, no?'),
   ('ex', 'Pita kuuripaoro iiparoei vaisia-ia.'),
   ('xp', 'Pita em i pulim strongpela win taim i go antap long maunten.'),
   ('xe', 'Peter is going up the mountain huffing and puffing.')]),
 ('kuuva',
  [('ps', '???'),
   ('ge', 'blue'),
   ('ge', 'purple'),
   ('tkp', 'blupela'),
   ('tkp', 'pepol'),
   ('dt', '14/Nov/2005'),
   ('ex',
    'Siroiri kuuvapa isi karuvera isi erievo ragai-pa uva vo ogatava karerai va orisia.'),
   ('xp',
    'Siroiri em i digim pepolpela singapo bilong mi na mi woksakim i kam long peles long kukim.'),
   ('xe', '???')]),
 ('kuuvaki',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'quiet'),
   ('tkp', '???'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '01/Jun/2005'),
   ('ex',
    'Oirato kuuvakito roira viapau reoparoveira ova veva vovouaro ragai toupareve.'),
   ('xp', 'Mau i no nap toktok tumas em bai i stap ig tasol.'),
   ('xe', '???'),
   ('ex', 'Kuvakivira toupareveira.'),
   ('xp', 'Em i save istap isi na em i no save toktok.'),
   ('xe', 'He is usually quiet.')]),
 ('kuuvakito',
  [('rt', 'kuuvaki'),
   ('ps', 'N'),
   ('pt', 'HUM'),
   ('ge', "quiet person who doesn't talk much"),
   ('ge', 'docile person'),
   ('tkp', '???'),
   ('avm', 'pa_barred'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Roo ira viapau ririvira reo puraparo.'),
   ('xp', 'Man ino save toktok tuma.'),
   ('xe', "One who doesn't ???."),
   ('ex', 'Kuvakitoa ro oirato ira viapau rirovira reoreoparo.'),
   ('xp', 'Man bilong istap isi na em i no save toktok tumas.'),
   ('xe', "A quiet man is a man who doesn't talk much.")]),
 ('kuuvato',
  [('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'parrot'),
   ('tkp', '???'),
   ('eng', 'Eclectus Parrot (Eclectus roratus)'),
   ('sa', 'sikarato'),
   ('sf', 'FAUNA.BIRD'),
   ('dt', '10/Feb/2005'),
   ('ex', 'Sikavoto kuuvato kokovavavira toupare.'),
   ('xp', 'Parrot em blue va em green pinis.'),
   ('xe', '???'),
   ('ex', 'Kuuvato kokovarato kokioto.'),
   ('xp', 'Parot em i grinpela pisin.'),
   ('xe', 'The kuuvato is a green pidgin.')]),
 ('kuuvavira',
  [('rt', 'kuuva'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'blue'),
   ('tkp', 'blupela'),
   ('cmt', 'Also translated as pepol/purple'),
   ('dt', '14/Nov/2005'),
   ('cmt',
    'Again, the adv kuuvavira is used to modify A in 2nd example. Check Tok Pisin on 1st example.'),
   ('ex', 'Kuuvavira kekepapiroi vavoisi.'),
   ('xp', 'Em i luk blu long hap i noway tru.'),
   ('xe', 'It looks blue there.'),
   ('ex', 'Avakava kuuvavira kekeavoi uva siraorai.'),
   ('xp', 'Mi lukim solwara i blu turu, na mi sori.'),
   ('xe', 'I saw the ocean looking blue and I felt sad.'),
   ('ex', 'Kuuvavira kekepapiroi eva purpurua ovusia va kaepari.'),
   ('xp', '???'),
   ('xe', '???'),
   ('ex',
    'Kuuvapavira rutu paruvo uukovi vo va tegoto ovusia rera toerevo Sipei.'),
   ('xp',
    'Wara bilong wail banana i ron na i luk pepol stret taim Sipei i katim daun nail banana.'),
   ('xe', 'The water from the wild banana flowed purple as Sipei cut it.')]),
 ('kuuvu',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'lie'),
   ('ge', 'deceive'),
   ('tkp', 'tok giaman'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '01/Jun/2005'),
   ('ex', 'Asi vii vai kuuvupauei vo vokio.'),
   ('xp', 'Ating yu tok giaman tude!'),
   ('xe', 'Why you are lying today, I think!'),
   ('ex', 'Apoka, kuuvupauei, riro kuuvuto vii.'),
   ('xp', 'Apoka, yu giaman, man bilong giaman.'),
   ('xe', "Apoka, you're lying, you big liar.")]),
 ('kuuvuvira',
  [('rt', 'kuuvu'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'deceptively'),
   ('ge', 'untruthfully'),
   ('ge', 'illusive'),
   ('tkp', '???'),
   ('dt', '10/Feb/2005'),
   ('ex', 'Kuuvuvira ita toupare evoa ra avaro Buka iare.'),
   ('xp', 'Em i stap giaman gen bai i go long Buka.'),
   ('xe', '???.'),
   ('ex', 'Kuuvuvira reo purapa ra kaviruro.'),
   ('xp', 'Em i toktok giaman gen bai em stil.'),
   ('xe', '???.'),
   ('ex', 'Jeri kuuvuvira uusiparoi ra vokiaro vokapasia avaro.'),
   ('xp', 'Jerry i slip giaman, bai em i go raun long nait.'),
   ('xe', 'Jerry pretends to sleep and goes out and about at night.')]),
 ('kuva',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'work sorcery'),
   ('tkp', 'gip papait'),
   ('eng', 'work sorcery'),
   ('eng', 'do black magic'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Uririparaepa oiso teapi ragai kuvaive.'),
   ('xp', '???'),
   ('xe', 'I was afraid lest they would work sorcery on me.'),
   ('ex', 'Latu uririparoera oisio teapi rera kuvaive.'),
   ('xp', 'Lati i poret olsem nogut ol i poisinim em.'),
   ('xe', 'Latu is afraid that they will poisin him.')]),
 ('kuvaku',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'plant'),
   ('tkp', '???'),
   ('sf', 'FLORA'),
   ('nt',
    'After the smashed branch was in place, the area of the break was warmed ??? , branches were smashed and used as splints for holding ???'),
   ('dt', '12/Feb/2005'),
   ('ex', 'Voari tuariri oisioa oirato kuvakupaive, rara kovero orakepisia.'),
   ('xp',
    'Bipo ol man i save putim kuvaku long man sapos em i pundaun na brukim lek.'),
   ('xe', '???')]),
 ('kuvato',
  [('ps', '???'),
   ('ge', 'name of clan'),
   ('tkp', '???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Siara oa vaisipaiveira oisio kuvato eisi Aita.'),
   ('xp', 'Nem bilong wanpela bisnis ol Aita i save kolim long kuvato.'),
   ('xe', '???')]),
 ('kuvau',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'alone'),
   ('tkp', 'ples i nogat man'),
   ('dcsv', 'true'),
   ('vx', '1'),
   ('sc', '???'),
   ('dt', '02/Jun/2005'),
   ('ex', 'Kuvauuei.'),
   ('xp', '???'),
   ('xe', 'You are all alone.'),
   ('ex', 'Togarao-ia kuvauei rutu, viapau oirara ora riakoravai.'),
   ('xp', 'Ples Togarao i nogat  manmeri tru.'),
   ('xe', '???')]),
 ('kuvaupie',
  [('rt', 'kuvau'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'desert'),
   ('ge', 'leave alone'),
   ('tkp', '???'),
   ('cm', '-re'),
   ('arg', 'OBL'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('ex', 'Ragai-re kuvaupieri kopioro.'),
   ('xp', 'Yu dai bai mi tasol istap wan pis.'),
   ('xe', 'You will die, leaving me behind.'),
   ('ex', 'Ragai-re kuvaupietavoi kareoro eisi-re atoia.'),
   ('xp', 'Yupela lusim mi na go long ples, na mi i stap wanpis.'),
   ('xe', 'You return to the village, leaving me behind.'),
   ('ex', 'Ovokivu-ia kuvaupieiva vikuoro vo-re opopasia.'),
   ('xp', '???'),
   ('xe',
    'One time they deserted (the village) going to the garden to get taro.'),
   ('ex', 'Avai kovosia kuvaupieoro vova.'),
   ('xp', 'Nogat man long hia olgeta i go work.'),
   ('xe', '???')]),
 ('kuvauvira',
  [('rt', 'kuvau'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'alone'),
   ('ge', 'isolated'),
   ('tkp', '???'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kuvauvira toupari.'),
   ('xp', '???'),
   ('xe', 'You are all alone.'),
   ('ex', 'Kovatoa-ia toupaiveira kuvauvira.'),
   ('xp', 'Mi save istap wanpis long kovatoa.'),
   ('xe', '???')]),
 ('kuvera',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'net fish or animals'),
   ('tkp', 'wokim umben'),
   ('cmt',
    'Example looks like incorporation. Double-check verb classification (A or B).'),
   ('vx', '???'),
   ('dt', '22/Nov/2005'),
   ('ex',
    'Kuverapare ovusia rera-pa atari kare sigupavoi ra ora kiukiuave kuveraua sovarai ravo kare oureve vokare orisia.'),
   ('xp',
    'Em i wok long net taim mi raun ol pis na bai ol i go sinait umben na bai em i kisim ol na kukim ol.'),
   ('xe', '???'),
   ('ex', 'Rokoe atari-kuveraroepa avakava-ia.'),
   ('xp', '???'),
   ('xe', 'Rokoe netted fish in the sea.')]),
 ('kuverava',
  [('ps', 'N'),
   ('pt', 'FEM'),
   ('ge', 'net'),
   ('ge', 'spider web'),
   ('ge', 'strainer'),
   ('tkp', 'umben haus bilong spaida'),
   ('dt', '08/Mar/2005'),
   ('ex', 'Kuverava kokio kuverapava vo vegoaro.'),
   ('xp', 'Ubian bilong kisim pisin long bus.'),
   ('xe', '???'),
   ('ex', 'Kuverava atari kuverapava ora aue kavari kokio.'),
   ('xp', 'Net blong netim fish.'),
   ('xe', '???'),
   ('ex', 'Pita oiraaro kuverava atari kuverapava avakava-ia.'),
   ('xp', 'Ubian bilong Pita, em i save kisim pis long solwara.'),
   ('xe', '???')]),
 ('kuvere',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'make an echo'),
   ('tkp', 'krai i kam bek'),
   ('cmt', 'Need simple example'),
   ('arg', '???'),
   ('vx', '???'),
   ('dt', '08/Jun/2005'),
   ('ex',
    'Vosa pukuia kukueavo torepay ra kakupiepari ora oiveta ra ova vioepaurere viraga. Keapauvere oiso oravu vi atopare.'),
   ('xp',
    'Taim yu stap antap name long tupela maunten na yu singaut bikpela bai singaut blong yu bai go. Na yu ting olsen wanpela man i mekim??sigaut blong yu.'),
   ('xe', '???'),
   ('ex', 'Ora vii raga reraro kuvereto vii aatoparevoi.'),
   ('xp', 'Singaut bilong yu tasol i pairap na i bekim yu.'),
   ('xe', '???')]),
 ('kuverea',
  [('rt', 'kuvere'),
   ('ps', '???'),
   ('ge', 'incomplete'),
   ('ge', 'unready'),
   ('ge', 'not all right'),
   ('tkp', 'i no redi'),
   ('dt', '02/Jun/2005'),
   ('ex', 'Kuverea vao-ia opoa. Kuverea vao-ia opeitarai ovoipa.'),
   ('xp', 'Tam ino ready yet. Dispela kaikai ino ready yet.'),
   ('xe', '???'),
   ('ex', 'Kuverea raga vao-ia opoa.'),
   ('xp', 'Em taro i no orait.'),
   ('xe', '???')]),
 ('kuvereto',
  [('rt', 'kuvere'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'echo'),
   ('tkp', 'krai i kam bek'),
   ('dt', '15/Nov/2005'),
   ('ex', 'Vii raga oravioepaue.'),
   ('xp', 'Yu tasol yu bihanim toktok bla??.'),
   ('xe', '???'),
   ('ex', 'Andru kakuupiepare ovusia kuvereto pokoparovi vavoisio.'),
   ('xp', 'Andru i singaut na singaut bilong em yet i pairap long hap.'),
   ('xe', '???')]),
 ('kuverevira',
  [('rt', 'kuvere'),
   ('ps', 'ADV'),
   ('pt', 'MANNER'),
   ('ge', 'incompletely'),
   ('tkp', 'i no redi'),
   ('dt', '04/Dec/2004'),
   ('ex', 'Kuverevira kovoa toupaivo opeitai ovoipai.'),
   ('xp', 'Gaden i no ready yet tru.'),
   ('xe', "The garden isn't ready yet."),
   ('ex', 'Kuverevira raga toupai upiriko kovo.'),
   ('xp', 'Gaden kaukau i no orait iet.'),
   ('xe', '???')]),
 ('kuvoro',
  [('ps', 'V'),
   ('pt', 'A'),
   ('ge', 'burnt out'),
   ('ge', 'extinguished'),
   ('tkp', 'i no gat paia'),
   ('eng', 'burned out'),
   ('eng', 'extinguished'),
   ('cmt',
    'Seems to have extended meaning. Get examples of the literal and figurative meanings.'),
   ('dt', '02/Jun/2005'),
   ('vx', '1'),
   ('sc', '???'),
   ('ex', "Uva oiso puraaepa, ``Eavoeao, e kuvorotae?''."),
   ('xp', '???'),
   ('xe', "They said this, ``Why, are you all burned out?''."),
   ('ex', 'Jekop kuvororoe eisi vegoaro ravapaoro.'),
   ('xp', 'Jekop i slip nogat long nait taim em i lukaut kapul long bus.'),
   ('xe', 'Jekop burnt out in the jungle looking for possum. (???)')]),
 ('kuvu',
  [('ps', 'V'),
   ('pt', 'B'),
   ('ge', 'fill.up'),
   ('tkp', 'pulamapim'),
   ('tkp', 'putim kaikai insait long mambu'),
   ('eng', 'fill up'),
   ('eng', 'put inside bamboo'),
   ('eng', 'clothe'),
   ('arg', 'O'),
   ('vx', '2'),
   ('dt', '08/Jun/2005'),
   ('cmt',
    "Is John filling the pig with bamboo or vice-versa in the 2nd ex sentence? What's the arg structure?"),
   ('ex', 'Vosia koieavai upoa oisoa iriai kuvua aue-ia veeta.'),
   ('xp', '???'),
   ('xe',
    'When I would kill a pig, I would always put it inside bamboo tubes.'),
   ('ex', 'Jon koie kori kuvupare aue-ia veeta.'),
   ('xp', 'Jon i wok long pulamapim pik long mambu.'),
   ('xe', 'John is filling the bamboo with pig. (Or vice-versa?)')]),
 ('kuvuara',
  [('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'clothing'),
   ('tkp', 'klos'),
   ('eng', 'clothes'),
   ('eng', 'clothing'),
   ('eng', 'apparel'),
   ('dt', '07/Feb/2005'),
   ('ex',
    'Vievara kuvuara ragai varao veavopava kuvuara ora vorioro karerai esis sitoa??.'),
   ('xp', 'Em ol colhing blong yu. Yu buy ikam long store.'),
   ('xe', '???'),
   ('ex', 'Ragai vaararoa varao kuvuara, oara voriavo.'),
   ('xp', 'Em ol dispela siot bilong mi, mi bin baim.'),
   ('xe', 'This clothing of mine, I bought it.')]),
 ('kuvukuvu',
  [('rt', 'kuvu'),
   ('ps', 'V'),
   ('pt', 'B'),
   ('rdp', 'full'),
   ('ge', 'fill up'),
   ('ge', 'stamp the ground'),
   ('tkp', 'pulapim krungutim graun'),
   ('vx', '2'),
   ('arg', 'O'),
   ('cmt', 'Need good example; this example'),
   ('dt', '08/Dec/2005'),
   ('ex', 'Koue kuvukuvuivo aue-ia veeta vara tutupaoro rasitoa-ia.'),
   ('xp', 'Ol i pulumapim ol mambu pik long mambu.'),
   ('xe', 'They filled the pig with bamboo and buried it in the ground.')]),
 ('kuvukuvua',
  [('rt', 'kuvukuvu'),
   ('ps', 'N'),
   ('pt', 'NT'),
   ('ge', 'pole planted in the ground as sounding board'),
   ('tkp', 'diwai bilong autim nois'),
   ('dt', '07/Feb/2005'),
   ('ex', 'Totoueua-ia kuvkuvua purarevo Pauro, rasitoa-ia va tovaoro.'),
   ('xp',
    'Pauro i palanim hap diwai long graun. Taim ol man i wok long kruktim na i wok long pairap.'),
   ('xe', 'Paul planted ???.')]),
 ('kuvupato',
  [('alt', 'kuvuto'),
   ('rt', 'kuvu'),
   ('ps', 'N'),
   ('pt', 'MASC'),
   ('ge', 'shirt'),
   ('eng', 'shirt'),
   ('eng', 'jacket'),
   ('eng', 'coat'),
   ('tkp', 'klos'),
   ('tkp', 'saket'),
   ('sc', '???'),
   ('dt', '02/Sep/2005'),
   ('ex', 'Ragai reraaro kuvupato kavirurevo Pita.'),
   ('xp', 'Pita em i stilim siot bilong mi.'),
   ('xe', 'Peter stole my shirt.')]),
 ('kuvuto',
  [('ps', 'N'),
   ('pt', '???'),
   ('ge', 'clothes'),
   ('ge', 'clothing'),
   ('tkp', 'laplap'),
   ('dt', '28/Jul/2004')])]

WordNet

a semantically-oriented dictionary of English, similar to a traditional thesaurus but with a richer structure. NLTK includes the English WordNet (155,287 words and 117,659 synonym sets)

Senses and Synonyms

Identifying Synonyms

Benz is credited with the invention of the motorcar

Benz is credited with the invention of the automobile

--> Synonyms


In [90]:
from nltk.corpus import wordnet as wn
wn.synsets('motorcar')


Out[90]:
[Synset('car.n.01')]

motorcar has one possible meaning -- it is identified as car.n.01 -- the first noun sense of car

car.n.01 is a synset (collection of synonymous words, or lemmas)


In [91]:
wn.synset('car.n.01').lemma_names()


Out[91]:
['car', 'auto', 'automobile', 'machine', 'motorcar']

meaning of a synset


In [92]:
wn.synset('car.n.01').definition()


Out[92]:
'a motor vehicle with four wheels; usually propelled by an internal combustion engine'

In [93]:
wn.synset('car.n.01').examples()


Out[93]:
['he needs a car to get to work']

The pairing of a synset with a word is called a lemma


In [94]:
wn.synset('car.n.01').lemmas()


Out[94]:
[Lemma('car.n.01.car'),
 Lemma('car.n.01.auto'),
 Lemma('car.n.01.automobile'),
 Lemma('car.n.01.machine'),
 Lemma('car.n.01.motorcar')]

In [95]:
wn.lemma('car.n.01.automobile')


Out[95]:
Lemma('car.n.01.automobile')

In [96]:
wn.lemma('car.n.01.automobile').synset()


Out[96]:
Synset('car.n.01')

In [97]:
wn.lemma('car.n.01.automobile').name()


Out[97]:
'automobile'

The word car is ambiguous (unlike motorcar)


In [98]:
wn.synsets('car')


Out[98]:
[Synset('car.n.01'),
 Synset('car.n.02'),
 Synset('car.n.03'),
 Synset('car.n.04'),
 Synset('cable_car.n.01')]

In [99]:
for synset in wn.synsets('car'):
    print(synset.lemma_names())


['car', 'auto', 'automobile', 'machine', 'motorcar']
['car', 'railcar', 'railway_car', 'railroad_car']
['car', 'gondola']
['car', 'elevator_car']
['cable_car', 'car']

Can also access the lemmas using lemmas()


In [100]:
wn.lemmas('car')


Out[100]:
[Lemma('car.n.01.car'),
 Lemma('car.n.02.car'),
 Lemma('car.n.03.car'),
 Lemma('car.n.04.car'),
 Lemma('cable_car.n.01.car')]

The WordNet Hierarchy

Very general concepts: Unique Beginners / root synsets

  • Entity
  • State
  • Event

More specific concepts:

  • Gas guzzler
  • Hatchback

Using WordNet, we can look at concepts that are more specific -- hyponyms


In [101]:
motorcar = wn.synset('car.n.01')
types_of_motorcar = motorcar.hyponyms()
types_of_motorcar[0]


Out[101]:
Synset('ambulance.n.01')

less specific -- hypernyms (move up the hierarchy)


In [102]:
motorcar.hypernyms()


Out[102]:
[Synset('motor_vehicle.n.01')]

In [103]:
paths = motorcar.hypernym_paths()
len(paths)


Out[103]:
2

In [104]:
[synset.name() for synset in paths[0]]


Out[104]:
['entity.n.01',
 'physical_entity.n.01',
 'object.n.01',
 'whole.n.02',
 'artifact.n.01',
 'instrumentality.n.03',
 'container.n.01',
 'wheeled_vehicle.n.01',
 'self-propelled_vehicle.n.01',
 'motor_vehicle.n.01',
 'car.n.01']

In [105]:
[synset.name() for synset in paths[1]]


Out[105]:
['entity.n.01',
 'physical_entity.n.01',
 'object.n.01',
 'whole.n.02',
 'artifact.n.01',
 'instrumentality.n.03',
 'conveyance.n.03',
 'vehicle.n.01',
 'wheeled_vehicle.n.01',
 'self-propelled_vehicle.n.01',
 'motor_vehicle.n.01',
 'car.n.01']

Getting the root hypernyms (most general hypernyms)


In [106]:
motorcar.root_hypernyms()


Out[106]:
[Synset('entity.n.01')]

More Lexical Relations

meronyms -- moving from items to their components. examples:

  • tree --> trunk, crown (part_meronyms)
  • tree --> heartwood, sapwood (substance_meronyms)

holonyms -- moving from items to their containers. examples:

  • tree --> forest (member_holonyms)

In [107]:
wn.synset('tree.n.01').part_meronyms()


Out[107]:
[Synset('burl.n.02'),
 Synset('crown.n.07'),
 Synset('limb.n.02'),
 Synset('stump.n.01'),
 Synset('trunk.n.01')]

In [108]:
wn.synset('tree.n.01').substance_meronyms()


Out[108]:
[Synset('heartwood.n.01'), Synset('sapwood.n.01')]

In [109]:
wn.synset('tree.n.01').member_holonyms()


Out[109]:
[Synset('forest.n.01')]

lots of details -- example: mint


In [110]:
for synset in wn.synsets('mint', wn.NOUN):
    print(synset.name() + ':', synset.definition())


batch.n.02: (often followed by `of') a large number or amount or extent
mint.n.02: any north temperate plant of the genus Mentha with aromatic leaves and small mauve flowers
mint.n.03: any member of the mint family of plants
mint.n.04: the leaves of a mint plant used fresh or candied
mint.n.05: a candy that is flavored with a mint oil
mint.n.06: a plant where money is coined by authority of the government

In [111]:
wn.synset('mint.n.04').part_holonyms()


Out[111]:
[Synset('mint.n.02')]

In [112]:
wn.synset('mint.n.04').substance_holonyms()


Out[112]:
[Synset('mint.n.05')]

relationships between verbs:

walking entails stepping

(some verbs can have multiple entailments)


In [113]:
wn.synset('walk.v.01').entailments()


Out[113]:
[Synset('step.v.01')]

In [114]:
wn.synset('eat.v.01').entailments()


Out[114]:
[Synset('chew.v.01'), Synset('swallow.v.01')]

In [115]:
wn.synset('tease.v.03').entailments()


Out[115]:
[Synset('arouse.v.07'), Synset('disappoint.v.01')]

lexical relationships between lemmas: antonymy


In [116]:
wn.lemma('supply.n.02.supply').antonyms()


Out[116]:
[Lemma('demand.n.02.demand')]

In [117]:
wn.lemma('rush.v.01.rush').antonyms()


Out[117]:
[Lemma('linger.v.04.linger')]

In [118]:
wn.lemma('horizontal.a.01.horizontal').antonyms()


Out[118]:
[Lemma('inclined.a.02.inclined'), Lemma('vertical.a.01.vertical')]

In [119]:
wn.lemma('staccato.r.01.staccato').antonyms()


Out[119]:
[Lemma('legato.r.01.legato')]

Semantic Similarity

Synsets sharing a very specific hypernym must be closely related


In [120]:
right = wn.synset('right_whale.n.01')
orca = wn.synset('orca.n.01')
minke = wn.synset('minke_whale.n.01')
tortoise = wn.synset('tortoise.n.01')
novel = wn.synset('novel.n.01')
right.lowest_common_hypernyms(minke)


Out[120]:
[Synset('baleen_whale.n.01')]

In [121]:
right.lowest_common_hypernyms(orca)


Out[121]:
[Synset('whale.n.02')]

In [122]:
right.lowest_common_hypernyms(tortoise)


Out[122]:
[Synset('vertebrate.n.01')]

In [123]:
right.lowest_common_hypernyms(novel)


Out[123]:
[Synset('entity.n.01')]

quantifying generality by looking up synset depth


In [124]:
wn.synset('baleen_whale.n.01').min_depth()


Out[124]:
14

In [125]:
wn.synset('whale.n.02').min_depth()


Out[125]:
13

In [126]:
wn.synset('vertebrate.n.01').min_depth()


Out[126]:
8

In [127]:
wn.synset('entity.n.01').min_depth()


Out[127]:
0

Other similarity measures: path_similarity


In [128]:
right.path_similarity(minke) # minke whale


Out[128]:
0.25

In [129]:
right.path_similarity(orca)


Out[129]:
0.16666666666666666

In [130]:
right.path_similarity(tortoise)


Out[130]:
0.07692307692307693

In [131]:
right.path_similarity(novel)


Out[131]:
0.043478260869565216