Processing text_left with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 2118/2118 [00:00<00:00, 3718.40it/s]
Processing text_right with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 18841/18841 [00:07<00:00, 2381.54it/s]
Processing text_right with append: 100%|██████████| 18841/18841 [00:00<00:00, 444321.96it/s]
Building FrequencyFilter from a datapack.: 100%|██████████| 18841/18841 [00:00<00:00, 66270.86it/s]
Processing text_right with transform: 100%|██████████| 18841/18841 [00:00<00:00, 42664.36it/s]
Processing text_left with extend: 100%|██████████| 2118/2118 [00:00<00:00, 330095.71it/s]
Processing text_right with extend: 100%|██████████| 18841/18841 [00:00<00:00, 373073.88it/s]
Building Vocabulary from a datapack.: 100%|██████████| 404415/404415 [00:00<00:00, 1847527.98it/s]
Processing text_left with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 2118/2118 [00:00<00:00, 3771.43it/s]
Processing text_right with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 18841/18841 [00:07<00:00, 2367.53it/s]
Processing text_right with transform: 100%|██████████| 18841/18841 [00:00<00:00, 69352.98it/s]
Processing text_left with transform: 100%|██████████| 2118/2118 [00:00<00:00, 97079.34it/s]
Processing text_right with transform: 100%|██████████| 18841/18841 [00:00<00:00, 74474.07it/s]
Processing length_left with len: 100%|██████████| 2118/2118 [00:00<00:00, 334659.48it/s]
Processing length_right with len: 100%|██████████| 18841/18841 [00:00<00:00, 414870.15it/s]
Processing text_left with transform: 100%|██████████| 2118/2118 [00:00<00:00, 43066.26it/s]
Processing text_right with transform: 100%|██████████| 18841/18841 [00:00<00:00, 32836.35it/s]
Processing text_left with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 122/122 [00:00<00:00, 3487.84it/s]
Processing text_right with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 1115/1115 [00:00<00:00, 2442.70it/s]
Processing text_right with transform: 100%|██████████| 1115/1115 [00:00<00:00, 67645.17it/s]
Processing text_left with transform: 100%|██████████| 122/122 [00:00<00:00, 44196.33it/s]
Processing text_right with transform: 100%|██████████| 1115/1115 [00:00<00:00, 76927.43it/s]
Processing length_left with len: 100%|██████████| 122/122 [00:00<00:00, 122242.02it/s]
Processing length_right with len: 100%|██████████| 1115/1115 [00:00<00:00, 333332.07it/s]
Processing text_left with transform: 100%|██████████| 122/122 [00:00<00:00, 35858.80it/s]
Processing text_right with transform: 100%|██████████| 1115/1115 [00:00<00:00, 28605.98it/s]
Processing text_left with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 237/237 [00:00<00:00, 3830.90it/s]
Processing text_right with chain_transform of Tokenize => Lowercase => PuncRemoval: 100%|██████████| 2300/2300 [00:01<00:00, 2071.67it/s]
Processing text_right with transform: 100%|██████████| 2300/2300 [00:00<00:00, 70304.99it/s]
Processing text_left with transform: 100%|██████████| 237/237 [00:00<00:00, 85642.29it/s]
Processing text_right with transform: 100%|██████████| 2300/2300 [00:00<00:00, 81757.54it/s]
Processing length_left with len: 100%|██████████| 237/237 [00:00<00:00, 145883.48it/s]
Processing length_right with len: 100%|██████████| 2300/2300 [00:00<00:00, 372769.40it/s]
Processing text_left with transform: 100%|██████████| 237/237 [00:00<00:00, 35687.87it/s]
Processing text_right with transform: 100%|██████████| 2300/2300 [00:00<00:00, 33140.49it/s]