Assignment 1
Cluster terms and documents in your favorite document collection or book:
- Download the collection.
- Form the term $\times$ document matrix. Remove stop words, if necessary, and apply stemming, if possible.
- Normalize the weights, if necessary.
- Form the matrices $A$ and $A_n$.
- Cluster documents (and terms) in $k$ clusters using spectral $k$-partitioning of bipartite graphs.
- Comment the solution.