ensegment: default program


In [2]:
from default import *

Documentation

Write some beautiful documentation of your program here.


In [3]:
Pw = Pdist(data=datafile("data/count_1w.txt"))
segmenter = Segment(Pw)
with open("data/input/dev.txt") as f:
    for line in f:
        print(" ".join(segmenter.segment(line.strip())))


choose spain
this is a test
who represents
experts exchange
speed of art
unclimatechangebody
we are the people
mentionyourfaves
now playing
the walking dead
follow me
we are the people
mentionyourfaves
check domain
big rock
name cheap
apple domains
honesty hour
being human
follow back
social media
30secondstoearth
current ratesoughttogodown
this is insane
what is my name
is it time
let us go
me too
nowthatcherisdead
advice for young journalists

Analysis

Do some analysis of the results. What ideas did you try? What worked and what did not?