Example text: From: lerxst@wam.umd.edu (where's my thing)
Subjec
ngram min=1 max=1 min_df=1
result:[u'from', u'lerxst', u'wam', u'umd', u'edu', u'where', u'my', u'thing', u'subjec']
ngram min=1 max=1 min_df=2
result:[u'from', u'lerxst', u'wam', u'umd', u'edu', u'where', u'my', u'thing', u'subjec']
ngram min=1 max=2 min_df=1
result:[u'from', u'lerxst', u'wam', u'umd', u'edu', u'where', u'my', u'thing', u'subjec', u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec']
ngram min=1 max=2 min_df=2
result:[u'from', u'lerxst', u'wam', u'umd', u'edu', u'where', u'my', u'thing', u'subjec', u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec']
ngram min=1 max=3 min_df=1
result:[u'from', u'lerxst', u'wam', u'umd', u'edu', u'where', u'my', u'thing', u'subjec', u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec', u'from lerxst wam', u'lerxst wam umd', u'wam umd edu', u'umd edu where', u'edu where my', u'where my thing', u'my thing subjec']
ngram min=1 max=3 min_df=2
result:[u'from', u'lerxst', u'wam', u'umd', u'edu', u'where', u'my', u'thing', u'subjec', u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec', u'from lerxst wam', u'lerxst wam umd', u'wam umd edu', u'umd edu where', u'edu where my', u'where my thing', u'my thing subjec']
ngram min=2 max=2 min_df=1
result:[u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec']
ngram min=2 max=2 min_df=2
result:[u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec']
ngram min=2 max=3 min_df=1
result:[u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec', u'from lerxst wam', u'lerxst wam umd', u'wam umd edu', u'umd edu where', u'edu where my', u'where my thing', u'my thing subjec']
ngram min=2 max=3 min_df=2
result:[u'from lerxst', u'lerxst wam', u'wam umd', u'umd edu', u'edu where', u'where my', u'my thing', u'thing subjec', u'from lerxst wam', u'lerxst wam umd', u'wam umd edu', u'umd edu where', u'edu where my', u'where my thing', u'my thing subjec']
ngram min=3 max=3 min_df=1
result:[u'from lerxst wam', u'lerxst wam umd', u'wam umd edu', u'umd edu where', u'edu where my', u'where my thing', u'my thing subjec']
ngram min=3 max=3 min_df=2
result:[u'from lerxst wam', u'lerxst wam umd', u'wam umd edu', u'umd edu where', u'edu where my', u'where my thing', u'my thing subjec']
Out[2]:
|
min_df |
model |
ngram_max |
ngram_min |
num_feature |
score |
train_time |
vectorizer |
| 0 |
1 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
1 |
1 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 1 |
2 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
1 |
1 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 2 |
1 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
2 |
1 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 3 |
2 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
2 |
1 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 4 |
1 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
3 |
1 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 5 |
2 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
3 |
1 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 6 |
1 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
2 |
2 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 7 |
2 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
2 |
2 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 8 |
1 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
3 |
2 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 9 |
2 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
3 |
2 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 10 |
1 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
3 |
3 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |
| 11 |
2 |
LinearSVC(C=1.0, class_weight=None, dual=True,... |
3 |
3 |
NaN |
NaN |
NaN |
CountVectorizer(analyzer=word, binary=False, c... |