In [1]:
library(data.table)

In [2]:
df <- fread('../data/processed/df.csv')


Read 145063 rows and 557 (of 557) columns from 0.418 GB file in 00:00:20

In [3]:
head(df)


Page_raw2015-07-012015-07-022015-07-032015-07-042015-07-052015-07-062015-07-072015-07-082015-07-09...2016-12-282016-12-292016-12-302016-12-31Page_keysIdagentaccessprojectpagename
2NE1_zh.wikipedia.org_all-access_spider 18 11 5 13 14 9 9 22 26 ... 22 19 18 20 !vote_en.wikipedia.org_all-access_all-agents_2017-01-01bf4edcf969af spider all-access zh.wikipedia.org 2NE1
2PM_zh.wikipedia.org_all-access_spider 11 14 15 18 11 13 22 11 10 ... 52 45 26 20 !vote_en.wikipedia.org_all-access_all-agents_2017-01-02929ed2bf52b9 spider all-access zh.wikipedia.org 2PM
3C_zh.wikipedia.org_all-access_spider 1 0 1 1 0 4 0 3 4 ... 6 3 4 17 !vote_en.wikipedia.org_all-access_all-agents_2017-01-03ff29d0f51d5c spider all-access zh.wikipedia.org 3C
4minute_zh.wikipedia.org_all-access_spider 35 13 10 94 4 26 14 9 11 ... 17 19 10 11 !vote_en.wikipedia.org_all-access_all-agents_2017-01-04e98873359be6 spider all-access zh.wikipedia.org 4minute
52_Hz_I_Love_You_zh.wikipedia.org_all-access_spider NA NA NA NA NA NA NA NA NA ... 27 13 36 10 !vote_en.wikipedia.org_all-access_all-agents_2017-01-05fa012434263a spider all-access zh.wikipedia.org 52_Hz_I_Love_You
5566_zh.wikipedia.org_all-access_spider 12 7 4 5 20 8 5 17 24 ... 23 17 17 50 !vote_en.wikipedia.org_all-access_all-agents_2017-01-0648f1e93517a2 spider all-access zh.wikipedia.org 5566

In [ ]: