The purpose of this notebook is to illustrate the method of text analysis using a corpus created from digital content published by the CRTC. This is the second part in a two-part process, the first of which is a description of the code that 'scraped' the CRTC webpage to create the corpus.
In [1]:
# importing code modules
import json
import ijson
from ijson import items
import pandas as pd
import pprint
import nltk
import numpy as np
import networkx as nx
import matplotlib.pyplot as plt
import csv
import sys
import codecs
import nltk
import nltk.collocations
import collections
import statistics
from nltk.metrics.spearman import *
from nltk.collocations import *
from nltk.stem import WordNetLemmatizer
from tabulate import tabulate
In [2]:
# loading the JSON file
filename = "../scrapy/result20.json"
In [3]:
# shows the structure of the json file
for prefix, the_type, value in ijson.parse(open(filename)):
print(prefix, the_type, value)
start_array None
item start_map None
item map_key keywords
item.keywords string Broadcasting
item map_key dateMod
item.dateMod string
item map_key subject
item.subject string Broadcasting
item map_key date
item.date string 1997-08-21
item map_key dateCreated
item.dateCreated string
item map_key dateIssued
item.dateIssued string 2007-09-20
item map_key title
item.title string ARCHIVED - Central Okanagan Re-Broadcast Society - Licence renewal
item map_key docType
item.docType string Decisions
item map_key text
item.text string ARCHIVED -
Decision CRTC 97-477 Decision Ottawa, 21 August 1997 Decision CRTC 97-477 Central Okanagan Re-Broadcast Society Winfield, British Columbia - 199613310 Licence renewal 1. Following Public Notice CRTC 1997-65 dated 26 May 1997, the Commission renews the broadcasting licence for the radiocommunication distribution undertaking serving Winfield from 1 September 1997 to 31 August 2004, subject to the conditions in effect under the current licence, as well as to those conditions specified in the licence to be issued. 2. The Commission notes that this undertaking distributes, in non-encrypted mode, the programming services of The Open Learning Agency (Knowledge Network), on channel 34, with a transmitter power of 100 watts, and CBU-FM Vancouver, on the frequency 91.9 MHz, channel 227, with an effective radiated power of 25 watts. This decision is to be appended to the licence . Laura M. Talbot-Allan Secretary General This document is available in alternative format upon request . Date modified: ????-??-??
item map_key file
item.file string DB97-477
item end_map None
item start_map None
item map_key keywords
item.keywords string Broadcasting
item map_key dateMod
item.dateMod string
item map_key subject
item.subject string Broadcasting
item map_key date
item.date string 1997-11-06
item map_key dateCreated
item.dateCreated string
item map_key dateIssued
item.dateIssued string 2007-09-20
item map_key title
item.title string ARCHIVED - Increase in power for CHKG-FM
item map_key docType
item.docType string Decisions
item map_key text
item.text string ARCHIVED -
Decision CRTC 97-626 Decision Ottawa, 6 November 1997 Decision CRTC 97-626 Fairchild Radio (Vancouver FM) Ltd. Vancouver, British Columbia - 199701024 Increase in power for CHKG-FM 1. Following Public Notice CRTC 1997-116 dated 3 September 1997, the Commission approves the application to amend the broadcasting licence for the radio programming undertaking CHKG-FM Vancouver by increasing the effective radiated power from 40,000 watts to 46,000 watts. 2. Approval for this new undertaking was granted in Decision CRTC 96-288 dated 29 July 1996. The transmitting parameters in the original application were based on using the CFMI-FM New Westminster transmitter site. However, subsequent measurements have shown that alternative transmitting arrangements were necessary. 3. The Commission notes that the licensee proposes to relocate the transmitter to a site approximately 300 metres to the northwest for which no approval is required. The Commission also notes that according to the licensee's statement, there will be no change in the proposed coverage in the populated service areas of the station. 4. The Department of Industry has advised the Commission that this application is conditionally technically acceptable, and that a Broadcasting Certificate will only be issued once it has been determined that the proposed technical parameters will not create any unacceptable interference with aeronautical NAV/COM services. 5. In accordance with subsection 22(1) of the Broadcasting Act , the Commission will only issue the licence amendment, and the authority will only be granted, at such time as written notification is received from the Department of Industry that its technical requirements have been met, and that a Broadcasting Certificate will be issued. This decision is to be appended to the licence. Laura M. Talbot-Allan Secretary General This document is available in alternative format upon request. Date modified: ????-??-??
item map_key file
item.file string DB97-626
item end_map None
item start_map None
item map_key keywords
item.keywords string Telecommunications, Bell Canada, Coopérative Antenne T.V. St-Zacharie, Interim approval, Tariff applications
item map_key dateMod
item.dateMod string 2017-03-15
item map_key subject
item.subject string Telecommunications, Bell Canada, Coopérative Antenne T.V. St-Zacharie, Interim approval, Tariff applications
item map_key date
item.date string
item map_key dateCreated
item.dateCreated string
item map_key dateIssued
item.dateIssued string 2017-03-15
item map_key title
item.title string Various companies – Interim approval of tariff applications
item map_key docType
item.docType string Orders
item map_key text
item.text string Telecom Order CRTC 2017-73 PDF version Ottawa, 15 March 2017 Various companies – Interim approval of tariff applications The Commission approves on an interim basis the following tariff applications: Applicant Tariff Notice and description Date of application Effective date Bell Canada TN 7527 Special Facilities Tariff – Amendment to 56-Kbps Data Channel Service in Kuujjuaraapik, Quebec 2 March 2017 17 March 2017 Coopérative Antenne T.V. St-Zacharie TN 1 TN 1A TN 1B Introduction of Competitive Local Exchange Carrier (CLEC) Tariff 1 February 2017 7 February 2017 6 March 2017 15 March 2017 Revised tariff pages are to be issued within 10 calendar days of the date of this order. Revised tariff pages can be submitted to the Commission without a description page or a request for approval; a tariff application is not required. Secretary General Date modified: 2017-03-15
item map_key file
item.file string 2017-73
item end_map None
item start_map None
item map_key keywords
item.keywords string Telecommunications, Agreements, Property, Municipal governments, AT&T Canada, City of Toronto, Access arrangementsTelecommunications, Agreements, Property, Municipal governments, AT&T Canada, City of Toronto, Access arrangements
item map_key dateMod
item.dateMod string
item map_key subject
item.subject string Telecommunications, Agreements, Property, Municipal governments, AT&T Canada, City of Toronto, Access arrangements
item map_key date
item.date string
item map_key dateCreated
item.dateCreated string 2001-08-31
item map_key dateIssued
item.dateIssued string
item map_key title
item.title string ARCHIVED - Terms and conditions of existing agreements for access to municipal property
item map_key docType
item.docType string Notices of consultation
item map_key text
item.text string ARCHIVED - Public Notice CRTC 2001-99 Public Notice CRTC 2001-99 Ottawa, 31 August 2001 Terms and conditions of existing agreements for access to municipal property Reference: 8690-A4-01/01 In this Public Notice, the Commission initiates a proceeding to consider the circumstances, if any, where the Commission could alter the terms and conditions of an existing property-access contract between a carrier and a municipality. Background 1. On 28 May 2001, AT&T Canada Corp. on behalf of itself and AT&T Canada Telecom Services Company (AT&T Canada) filed a Part VII application requesting relief pursuant to sections 32(d) and (e) and 43(4) of the Telecommunications Act (the Act), naming the City of Toronto (the City) as respondent. AT&T Canada stated that it wanted the Commission to substitute for those terms and conditions of its current access agreement with the City that are inconsistent with the principles set out in Decision CRTC 2001-23 , Ledcor/Vancouver - Construction, operation and maintenance of transmission lines in Vancouver, dated 25 January 2001, the terms and conditions that are based on the principles set out in that decision. 2. On 27 June 2001, the City filed its response to AT&T Canada's application. The City argued that the principles developed in Decision 2001-23 were not applicable in the unique factual circumstances of the contractual situation between itself and AT&T Canada. Further, the City argued that the Commission lacked jurisdiction under the Act to interfere with an existing agreement for access to municipal property like the agreement between itself and AT&T Canada. Scope of proceeding 3. In this proceeding, the Commission will consider, given the framework set out in sections 43(1) to 43(4) and any other relevant provisions of the Act, and the principles laid out in Decision 2001-23 , what circumstances, if any, would justify an intervention by the Commission to alter the terms of an existing contract between a carrier and a municipality for access to municipal rights-of-way. Procedure 4. The contract filed in the Part VII application as Appendix "A" to AT&T Canada's application is made part of the record in this proceeding. 5. AT&T Canada and the City are made parties to this proceeding. 6. Other parties wishing to participate in this proceeding must notify the Commission of their intention to do so by 1 October 2001. These parties should contact the Secretary General, by mail at CRTC, Ottawa, Ontario, K1A 0N2; by fax at (819) 953-0795; or by email at procedure@crtc.gc.ca. They are to indicate in the notice their email address, where available. If parties do not have access to the Internet, they are to indicate in their notice whether they wish to receive disk versions of hard copy filings. 7. The Commission will issue, as soon as possible after the registration date, a complete list of interested parties and their mailing addresses (including their email address, if available), identifying those parties who wish to receive disk versions. 8. All parties may submit comments on the circumstances, if any, for Commission intervention, serving a copy of their submission on all the parties on the interested parties list, by 29 October 2001. Submissions longer than five pages should include a summary. In order to streamline the process and reduce the workload for all concerned, the Commission encourages parties with similar interests to file joint submissions and to participate jointly in subsequent stages of the proceeding. 9. Parties may file reply comments with the Commission, serving a copy on those parties who filed comments, by 28 November 2001. Submissions longer than five pages should include a summary. 10. Where a document is to be filed or served by a specific date, the document must be actually received, and not merely sent, by that date. 11. Parties wishing to file electronic versions of their comments can do so by email at the address shown above, or on diskette. 12. The electronic version should be in the HTML format. As an alternative, those submitting comments may use "Microsoft Word" for text and "Microsoft Excel" for spreadsheets. 13. Please number each paragraph of your submission. In addition, please enter the line ***End of document*** following the last paragraph. This will help the Commission verify that the document has not been damaged during transmission. 14. The Commission will make submissions filed in electronic form available on its web site at www.crtc.gc.ca in the official language and format in which they are submitted. This will make it easier for members of the public to consult the documents. 15. The Commission also encourages interested parties to monitor the public examination file (and/or the Commission's web site) for additional information that they may find useful when preparing their submission. 16. Submissions may be examined or will be made available promptly upon request at the Commission offices during normal business hours: Central Building Les Terasses de la Chaudière 1 Promenade du Portage, Room G-5 Hull, Quebec K1A 0N2 Tel: (819) 997-2429 - TDD: 994-0423 Fax: (819) 994-0218 Bank of Commerce Building 1809 Barrington Street Suite 1007 Halifax, Nova Scotia B3J 3K8 Tel: (902) 426-7997 - TDD: 426-6997 Fax: (902) 426-2721 405 de Maisonneuve Blvd. East 2 nd Floor, Suite B2300 Montréal, Quebec H2L 4J5 Tel: (514) 283-6607 - TDD: 283-8316 Fax: (514) 283-3689 55 St. Clair Avenue East Suite 624 Toronto, Ontario M4T 1M2 Tel: (416) 952-9096 Fax: (416) 954-6343 Kensington Building 275 Portage Avenue Suite 1810 Winnipeg, Manitoba R3B 2B3 Tel: (204) 983-6306 - TDD:983-8274 Fax: (204) 983-6317 Cornwall Professional Building 2125 - 11 th Avenue Room 103 Regina, Saskatchewan S4P 3X3 Tel: (306) 780-3422 Fax: (306) 780-3319 10405 Jasper Avenue Suite 520 Edmonton, Alberta T5J 3N4 Tel: (780) 495-3224 Fax: (780) 495-3214 530-580 Hornby Street Vancouver, British Columbia V6C 3B6 Tel: (604) 666-2111 - TDD:666-0778 Fax: (604) 666-8322 Secretary General This document is available in alternative format upon request, and may also be examined at the following Internet site: http://www.crtc.gc.ca Date Modified: 2001-08-31 Date modified: 2001-08-31
item map_key file
item.file string PT2001-99
item end_map None
item start_map None
item map_key keywords
item.keywords string Broadcasting, Specialty services, Ten Broadcasting, Hustler TV, X Channel, Broadcast day, Conditions of licence, Category 2 services, Approval
item map_key dateMod
item.dateMod string
item map_key subject
item.subject string Broadcasting, Specialty services, Ten Broadcasting, Hustler TV, X Channel, Broadcast day, Conditions of licence, Category 2 services, Approval
item map_key date
item.date string 2005-08-02
item map_key dateCreated
item.dateCreated string
item map_key dateIssued
item.dateIssued string 2007-09-20
item map_key title
item.title string ARCHIVED - Hustler TV and X Channel - Amendment to the definition of broadcast day
item map_key docType
item.docType string Decisions
item map_key text
item.text string ARCHIVED - Broadcasting Decision CRTC 2005-362 Broadcasting Decision CRTC 2005-362 Ottawa, 2 August 2005 Ten Broadcasting Inc. Across Canada Application 2005-0033-1 Broadcasting Public Notice CRTC 2005-53 19 May 2005 Hustler TV and X Channel - Amendment to the definition of broadcast day 1. The Commission approves the application by Ten Broadcasting Inc. to amend the definition of the term "broadcast day" for the purposes of the conditions of licence for the national English-language Category 2 specialty programming undertakings known as Hustler TV and X Channel. 2. The Commission did not receive any interventions in connection with this application. 3. The broadcast day for both Hustler TV and X Channel is currently defined as the 24-hour period beginning at 6:00 a.m. each day. The licensee requested that the definition of the broadcast day for each of these undertakings be amended, effective 1 November 2004, to mean a 24-hour period beginning at midnight (00:00) each day. 4. In accordance with its usual practice for amendments to the definition of broadcast day, the Commission finds that it is appropriate to implement the revised definition, effective 1 September 2005. Accordingly, the definition of the broadcast day for the purposes of both Hustler TV's and X Channel's conditions of licence will read as follows: Beginning 1 September 2005, "broadcast day" means a 24-hour period beginning each day at midnight (00:00), or any other period approved by the Commission. 5. In approving this application, the Commission has taken into account the fact that the revised definition of the broadcast day will not adversely affect the conditions of licence pertaining to Hustler TV's and X Channel's nature of service or any other of their conditions of licence. Secretary General This decision is to be appended to each licence. It is available in alternative format upon request, and may also be examined in PDF format or in HTML at the following Internet site: http://www.crtc.gc.ca Date Modified: 2005-08-02 Date modified: 2005-08-02
item map_key file
item.file string db2005-362
item end_map None
item start_map None
item map_key keywords
item.keywords string Broadcasting, Procedures, Television, Compliance, Requirements, Program logs, Television programming services, Amendments, Regulation, Filings
item map_key dateMod
item.dateMod string 2011-10-18
item map_key subject
item.subject string Broadcasting, Procedures, Television, Compliance, Requirements, Program logs, Television programming services, Amendments, Regulation, Filings
item map_key date
item.date string
item map_key dateCreated
item.dateCreated string
item map_key dateIssued
item.dateIssued string 2011-10-18
item map_key title
item.title string Procedures for television services in non-compliance with logging requirements
item map_key docType
item.docType string Information bulletins
item map_key text
item.text string Broadcasting Information Bulletin CRTC 2011-650 PDF version Ottawa, 18 October 2011 Procedures for television services in non-compliance with logging requirements Introduction 1. A program log is a record of all the programming aired by a licensee in a given month. It is submitted to the Commission in a specific format and sent to the program logs filing website. 2. Within 30 days after the end of each month, licensees of television programming services must file with the Commission the program log or computer record for that month and a certificate attesting to the accuracy of its contents signed by the licensee or a representative. Current procedures 3. Under the Commission’s current procedures, licensees that fail to meet the regulatory requirements for logging receive a letter of non-compliance from Commission staff requesting the submission of the appropriate logs. If the non-compliance continues after the receipt of this letter, licensees receive further letters from Commission staff again requesting the submission of the appropriate logs. New approach 4. Following a review of its procedures, the Commission has decided to modify its approach to non-compliance with logging requirements. Specifically, Commission staff will continue to send a first letter to licensees that do not meet these requirements. However, if a licensee remains in non-compliance, the Commission will send a second letter signed by the Secretary General. This second letter will stipulate that the licensee is in breach of its regulatory requirements and: must submit the required logs within 30 days from the date of the letter and describe the steps taken to ensure future compliance; and may be subject to one or more regulatory sanctions should it fail to submit the required information. 5. The above-noted regulatory sanctions could include: returning future applications of any type by the licensee in non-compliance concerning any current or proposed services, including applications for new Category B licences and extensions of deadlines for beginning the operation of new Category B services, as well as Part 1 and licence renewal applications; imposing a condition of licence requiring the licensee to contribute towards a new or existing independent Canadian production fund; and initiating a mandatory order proceeding. 6. In the case of a licensee that is in non-compliance as of the date of this information bulletin and has failed to respond within the allotted time to a non-compliance letter from Commission staff, the Commission will immediately issue a letter from the Secretary General indicating that the licensee has 60 days to submit the appropriate logs. If the licensee fails to do so, it may be subject to one or more of the above-noted sanctions. 7. Licensees are also advised that the Commission will note any non-compliance with regulatory requirements in the next renewal decision for the affected television services. Such non-compliance could result in a short-term renewal. Secretary General Date modified: 2011-10-18
item map_key file
item.file string 2011-650
item end_map None
end_array None
In [4]:
# reads the file and assigns the keys and values to a Python dictionary structure
with open(filename, 'r') as f:
objects = ijson.items(f, 'item')
file = list(objects)
In [5]:
# checks to see how many records we're viewing
print(len(file))
6
In [8]:
# prints all records in the file
file[0]
Out[8]:
{'date': '1997-08-21',
'dateCreated': '',
'dateIssued': '2007-09-20',
'dateMod': '',
'docType': 'Decisions',
'file': 'DB97-477',
'keywords': 'Broadcasting',
'subject': 'Broadcasting',
'text': 'ARCHIVED\xa0-\xa0 \r\nDecision CRTC 97-477 Decision Ottawa, 21 August 1997 Decision CRTC 97-477 Central Okanagan Re-Broadcast Society Winfield, British Columbia - 199613310 Licence renewal 1. Following Public Notice CRTC 1997-65 dated 26 May 1997, the Commission renews the broadcasting licence for the radiocommunication distribution undertaking serving Winfield from 1 September 1997 to 31 August 2004, subject to the conditions in effect under the current licence, as well as to those conditions specified in the licence to be issued. 2. The Commission notes that this undertaking distributes, in non-encrypted mode, the programming services of The Open Learning Agency (Knowledge Network), on channel 34, with a transmitter power of 100 watts, and CBU-FM Vancouver, on the frequency 91.9 MHz, channel 227, with an effective radiated power of 25 watts. This decision is to be appended to the licence . Laura M. Talbot-Allan Secretary General This document is available in alternative format upon request . Date modified: ????-??-??',
'title': 'ARCHIVED - Central Okanagan Re-Broadcast Society - Licence renewal'}
In [9]:
# iterates through each record in the file
for row in file:
# prints each record based on the 'key' value
print(row['docType'],":",row['title'])
Decisions : ARCHIVED - Central Okanagan Re-Broadcast Society - Licence renewal
Decisions : ARCHIVED - Increase in power for CHKG-FM
Orders : Various companies – Interim approval of tariff applications
Notices of consultation : ARCHIVED - Terms and conditions of existing agreements for access to municipal property
Decisions : ARCHIVED - Hustler TV and X Channel - Amendment to the definition of broadcast day
Information bulletins : Procedures for television services in non-compliance with logging requirements
In [10]:
# creates a list of the file numbers
file_nums = []
for row in file:
file_nums.append(row['file'])
In [20]:
file_nums
Out[20]:
['DB97-477', 'DB97-626', '2017-73', 'PT2001-99', 'db2005-362', '2011-650']
In [24]:
# collects all of the words from the 'text' key
all_chars = []
for row in file:
# splits the text into a list of words
text = nltk.word_tokenize(row['text'])
# makes all words lowercase
clean = [w.lower() for w in text]
all_chars.append(clean)
In [25]:
print(all_chars)
[['archived', '-', 'decision', 'crtc', '97-477', 'decision', 'ottawa', ',', '21', 'august', '1997', 'decision', 'crtc', '97-477', 'central', 'okanagan', 're-broadcast', 'society', 'winfield', ',', 'british', 'columbia', '-', '199613310', 'licence', 'renewal', '1', '.', 'following', 'public', 'notice', 'crtc', '1997-65', 'dated', '26', 'may', '1997', ',', 'the', 'commission', 'renews', 'the', 'broadcasting', 'licence', 'for', 'the', 'radiocommunication', 'distribution', 'undertaking', 'serving', 'winfield', 'from', '1', 'september', '1997', 'to', '31', 'august', '2004', ',', 'subject', 'to', 'the', 'conditions', 'in', 'effect', 'under', 'the', 'current', 'licence', ',', 'as', 'well', 'as', 'to', 'those', 'conditions', 'specified', 'in', 'the', 'licence', 'to', 'be', 'issued', '.', '2', '.', 'the', 'commission', 'notes', 'that', 'this', 'undertaking', 'distributes', ',', 'in', 'non-encrypted', 'mode', ',', 'the', 'programming', 'services', 'of', 'the', 'open', 'learning', 'agency', '(', 'knowledge', 'network', ')', ',', 'on', 'channel', '34', ',', 'with', 'a', 'transmitter', 'power', 'of', '100', 'watts', ',', 'and', 'cbu-fm', 'vancouver', ',', 'on', 'the', 'frequency', '91.9', 'mhz', ',', 'channel', '227', ',', 'with', 'an', 'effective', 'radiated', 'power', 'of', '25', 'watts', '.', 'this', 'decision', 'is', 'to', 'be', 'appended', 'to', 'the', 'licence', '.', 'laura', 'm.', 'talbot-allan', 'secretary', 'general', 'this', 'document', 'is', 'available', 'in', 'alternative', 'format', 'upon', 'request', '.', 'date', 'modified', ':', '?', '?', '?', '?', '-', '?', '?', '-', '?', '?'], ['archived', '-', 'decision', 'crtc', '97-626', 'decision', 'ottawa', ',', '6', 'november', '1997', 'decision', 'crtc', '97-626', 'fairchild', 'radio', '(', 'vancouver', 'fm', ')', 'ltd.', 'vancouver', ',', 'british', 'columbia', '-', '199701024', 'increase', 'in', 'power', 'for', 'chkg-fm', '1', '.', 'following', 'public', 'notice', 'crtc', '1997-116', 'dated', '3', 'september', '1997', ',', 'the', 'commission', 'approves', 'the', 'application', 'to', 'amend', 'the', 'broadcasting', 'licence', 'for', 'the', 'radio', 'programming', 'undertaking', 'chkg-fm', 'vancouver', 'by', 'increasing', 'the', 'effective', 'radiated', 'power', 'from', '40,000', 'watts', 'to', '46,000', 'watts', '.', '2', '.', 'approval', 'for', 'this', 'new', 'undertaking', 'was', 'granted', 'in', 'decision', 'crtc', '96-288', 'dated', '29', 'july', '1996', '.', 'the', 'transmitting', 'parameters', 'in', 'the', 'original', 'application', 'were', 'based', 'on', 'using', 'the', 'cfmi-fm', 'new', 'westminster', 'transmitter', 'site', '.', 'however', ',', 'subsequent', 'measurements', 'have', 'shown', 'that', 'alternative', 'transmitting', 'arrangements', 'were', 'necessary', '.', '3', '.', 'the', 'commission', 'notes', 'that', 'the', 'licensee', 'proposes', 'to', 'relocate', 'the', 'transmitter', 'to', 'a', 'site', 'approximately', '300', 'metres', 'to', 'the', 'northwest', 'for', 'which', 'no', 'approval', 'is', 'required', '.', 'the', 'commission', 'also', 'notes', 'that', 'according', 'to', 'the', 'licensee', "'s", 'statement', ',', 'there', 'will', 'be', 'no', 'change', 'in', 'the', 'proposed', 'coverage', 'in', 'the', 'populated', 'service', 'areas', 'of', 'the', 'station', '.', '4', '.', 'the', 'department', 'of', 'industry', 'has', 'advised', 'the', 'commission', 'that', 'this', 'application', 'is', 'conditionally', 'technically', 'acceptable', ',', 'and', 'that', 'a', 'broadcasting', 'certificate', 'will', 'only', 'be', 'issued', 'once', 'it', 'has', 'been', 'determined', 'that', 'the', 'proposed', 'technical', 'parameters', 'will', 'not', 'create', 'any', 'unacceptable', 'interference', 'with', 'aeronautical', 'nav/com', 'services', '.', '5', '.', 'in', 'accordance', 'with', 'subsection', '22', '(', '1', ')', 'of', 'the', 'broadcasting', 'act', ',', 'the', 'commission', 'will', 'only', 'issue', 'the', 'licence', 'amendment', ',', 'and', 'the', 'authority', 'will', 'only', 'be', 'granted', ',', 'at', 'such', 'time', 'as', 'written', 'notification', 'is', 'received', 'from', 'the', 'department', 'of', 'industry', 'that', 'its', 'technical', 'requirements', 'have', 'been', 'met', ',', 'and', 'that', 'a', 'broadcasting', 'certificate', 'will', 'be', 'issued', '.', 'this', 'decision', 'is', 'to', 'be', 'appended', 'to', 'the', 'licence', '.', 'laura', 'm.', 'talbot-allan', 'secretary', 'general', 'this', 'document', 'is', 'available', 'in', 'alternative', 'format', 'upon', 'request', '.', 'date', 'modified', ':', '?', '?', '?', '?', '-', '?', '?', '-', '?', '?'], ['telecom', 'order', 'crtc', '2017-73', 'pdf', 'version', 'ottawa', ',', '15', 'march', '2017', 'various', 'companies', '–', 'interim', 'approval', 'of', 'tariff', 'applications', 'the', 'commission', 'approves', 'on', 'an', 'interim', 'basis', 'the', 'following', 'tariff', 'applications', ':', 'applicant', 'tariff', 'notice', 'and', 'description', 'date', 'of', 'application', 'effective', 'date', 'bell', 'canada', 'tn', '7527', 'special', 'facilities', 'tariff', '–', 'amendment', 'to', '56-kbps', 'data', 'channel', 'service', 'in', 'kuujjuaraapik', ',', 'quebec', '2', 'march', '2017', '17', 'march', '2017', 'coopérative', 'antenne', 't.v', '.', 'st-zacharie', 'tn', '1', 'tn', '1a', 'tn', '1b', 'introduction', 'of', 'competitive', 'local', 'exchange', 'carrier', '(', 'clec', ')', 'tariff', '1', 'february', '2017', '7', 'february', '2017', '6', 'march', '2017', '15', 'march', '2017', 'revised', 'tariff', 'pages', 'are', 'to', 'be', 'issued', 'within', '10', 'calendar', 'days', 'of', 'the', 'date', 'of', 'this', 'order', '.', 'revised', 'tariff', 'pages', 'can', 'be', 'submitted', 'to', 'the', 'commission', 'without', 'a', 'description', 'page', 'or', 'a', 'request', 'for', 'approval', ';', 'a', 'tariff', 'application', 'is', 'not', 'required', '.', 'secretary', 'general', 'date', 'modified', ':', '2017-03-15'], ['archived', '-', 'public', 'notice', 'crtc', '2001-99', 'public', 'notice', 'crtc', '2001-99', 'ottawa', ',', '31', 'august', '2001', 'terms', 'and', 'conditions', 'of', 'existing', 'agreements', 'for', 'access', 'to', 'municipal', 'property', 'reference', ':', '8690-a4-01/01', 'in', 'this', 'public', 'notice', ',', 'the', 'commission', 'initiates', 'a', 'proceeding', 'to', 'consider', 'the', 'circumstances', ',', 'if', 'any', ',', 'where', 'the', 'commission', 'could', 'alter', 'the', 'terms', 'and', 'conditions', 'of', 'an', 'existing', 'property-access', 'contract', 'between', 'a', 'carrier', 'and', 'a', 'municipality', '.', 'background', '1', '.', 'on', '28', 'may', '2001', ',', 'at', '&', 't', 'canada', 'corp.', 'on', 'behalf', 'of', 'itself', 'and', 'at', '&', 't', 'canada', 'telecom', 'services', 'company', '(', 'at', '&', 't', 'canada', ')', 'filed', 'a', 'part', 'vii', 'application', 'requesting', 'relief', 'pursuant', 'to', 'sections', '32', '(', 'd', ')', 'and', '(', 'e', ')', 'and', '43', '(', '4', ')', 'of', 'the', 'telecommunications', 'act', '(', 'the', 'act', ')', ',', 'naming', 'the', 'city', 'of', 'toronto', '(', 'the', 'city', ')', 'as', 'respondent', '.', 'at', '&', 't', 'canada', 'stated', 'that', 'it', 'wanted', 'the', 'commission', 'to', 'substitute', 'for', 'those', 'terms', 'and', 'conditions', 'of', 'its', 'current', 'access', 'agreement', 'with', 'the', 'city', 'that', 'are', 'inconsistent', 'with', 'the', 'principles', 'set', 'out', 'in', 'decision', 'crtc', '2001-23', ',', 'ledcor/vancouver', '-', 'construction', ',', 'operation', 'and', 'maintenance', 'of', 'transmission', 'lines', 'in', 'vancouver', ',', 'dated', '25', 'january', '2001', ',', 'the', 'terms', 'and', 'conditions', 'that', 'are', 'based', 'on', 'the', 'principles', 'set', 'out', 'in', 'that', 'decision', '.', '2', '.', 'on', '27', 'june', '2001', ',', 'the', 'city', 'filed', 'its', 'response', 'to', 'at', '&', 't', 'canada', "'s", 'application', '.', 'the', 'city', 'argued', 'that', 'the', 'principles', 'developed', 'in', 'decision', '2001-23', 'were', 'not', 'applicable', 'in', 'the', 'unique', 'factual', 'circumstances', 'of', 'the', 'contractual', 'situation', 'between', 'itself', 'and', 'at', '&', 't', 'canada', '.', 'further', ',', 'the', 'city', 'argued', 'that', 'the', 'commission', 'lacked', 'jurisdiction', 'under', 'the', 'act', 'to', 'interfere', 'with', 'an', 'existing', 'agreement', 'for', 'access', 'to', 'municipal', 'property', 'like', 'the', 'agreement', 'between', 'itself', 'and', 'at', '&', 't', 'canada', '.', 'scope', 'of', 'proceeding', '3', '.', 'in', 'this', 'proceeding', ',', 'the', 'commission', 'will', 'consider', ',', 'given', 'the', 'framework', 'set', 'out', 'in', 'sections', '43', '(', '1', ')', 'to', '43', '(', '4', ')', 'and', 'any', 'other', 'relevant', 'provisions', 'of', 'the', 'act', ',', 'and', 'the', 'principles', 'laid', 'out', 'in', 'decision', '2001-23', ',', 'what', 'circumstances', ',', 'if', 'any', ',', 'would', 'justify', 'an', 'intervention', 'by', 'the', 'commission', 'to', 'alter', 'the', 'terms', 'of', 'an', 'existing', 'contract', 'between', 'a', 'carrier', 'and', 'a', 'municipality', 'for', 'access', 'to', 'municipal', 'rights-of-way', '.', 'procedure', '4', '.', 'the', 'contract', 'filed', 'in', 'the', 'part', 'vii', 'application', 'as', 'appendix', '``', 'a', "''", 'to', 'at', '&', 't', 'canada', "'s", 'application', 'is', 'made', 'part', 'of', 'the', 'record', 'in', 'this', 'proceeding', '.', '5', '.', 'at', '&', 't', 'canada', 'and', 'the', 'city', 'are', 'made', 'parties', 'to', 'this', 'proceeding', '.', '6', '.', 'other', 'parties', 'wishing', 'to', 'participate', 'in', 'this', 'proceeding', 'must', 'notify', 'the', 'commission', 'of', 'their', 'intention', 'to', 'do', 'so', 'by', '1', 'october', '2001', '.', 'these', 'parties', 'should', 'contact', 'the', 'secretary', 'general', ',', 'by', 'mail', 'at', 'crtc', ',', 'ottawa', ',', 'ontario', ',', 'k1a', '0n2', ';', 'by', 'fax', 'at', '(', '819', ')', '953-0795', ';', 'or', 'by', 'email', 'at', 'procedure', '@', 'crtc.gc.ca', '.', 'they', 'are', 'to', 'indicate', 'in', 'the', 'notice', 'their', 'email', 'address', ',', 'where', 'available', '.', 'if', 'parties', 'do', 'not', 'have', 'access', 'to', 'the', 'internet', ',', 'they', 'are', 'to', 'indicate', 'in', 'their', 'notice', 'whether', 'they', 'wish', 'to', 'receive', 'disk', 'versions', 'of', 'hard', 'copy', 'filings', '.', '7', '.', 'the', 'commission', 'will', 'issue', ',', 'as', 'soon', 'as', 'possible', 'after', 'the', 'registration', 'date', ',', 'a', 'complete', 'list', 'of', 'interested', 'parties', 'and', 'their', 'mailing', 'addresses', '(', 'including', 'their', 'email', 'address', ',', 'if', 'available', ')', ',', 'identifying', 'those', 'parties', 'who', 'wish', 'to', 'receive', 'disk', 'versions', '.', '8', '.', 'all', 'parties', 'may', 'submit', 'comments', 'on', 'the', 'circumstances', ',', 'if', 'any', ',', 'for', 'commission', 'intervention', ',', 'serving', 'a', 'copy', 'of', 'their', 'submission', 'on', 'all', 'the', 'parties', 'on', 'the', 'interested', 'parties', 'list', ',', 'by', '29', 'october', '2001', '.', 'submissions', 'longer', 'than', 'five', 'pages', 'should', 'include', 'a', 'summary', '.', 'in', 'order', 'to', 'streamline', 'the', 'process', 'and', 'reduce', 'the', 'workload', 'for', 'all', 'concerned', ',', 'the', 'commission', 'encourages', 'parties', 'with', 'similar', 'interests', 'to', 'file', 'joint', 'submissions', 'and', 'to', 'participate', 'jointly', 'in', 'subsequent', 'stages', 'of', 'the', 'proceeding', '.', '9', '.', 'parties', 'may', 'file', 'reply', 'comments', 'with', 'the', 'commission', ',', 'serving', 'a', 'copy', 'on', 'those', 'parties', 'who', 'filed', 'comments', ',', 'by', '28', 'november', '2001', '.', 'submissions', 'longer', 'than', 'five', 'pages', 'should', 'include', 'a', 'summary', '.', '10', '.', 'where', 'a', 'document', 'is', 'to', 'be', 'filed', 'or', 'served', 'by', 'a', 'specific', 'date', ',', 'the', 'document', 'must', 'be', 'actually', 'received', ',', 'and', 'not', 'merely', 'sent', ',', 'by', 'that', 'date', '.', '11', '.', 'parties', 'wishing', 'to', 'file', 'electronic', 'versions', 'of', 'their', 'comments', 'can', 'do', 'so', 'by', 'email', 'at', 'the', 'address', 'shown', 'above', ',', 'or', 'on', 'diskette', '.', '12', '.', 'the', 'electronic', 'version', 'should', 'be', 'in', 'the', 'html', 'format', '.', 'as', 'an', 'alternative', ',', 'those', 'submitting', 'comments', 'may', 'use', '``', 'microsoft', 'word', "''", 'for', 'text', 'and', '``', 'microsoft', 'excel', "''", 'for', 'spreadsheets', '.', '13', '.', 'please', 'number', 'each', 'paragraph', 'of', 'your', 'submission', '.', 'in', 'addition', ',', 'please', 'enter', 'the', 'line', '***end', 'of', 'document***', 'following', 'the', 'last', 'paragraph', '.', 'this', 'will', 'help', 'the', 'commission', 'verify', 'that', 'the', 'document', 'has', 'not', 'been', 'damaged', 'during', 'transmission', '.', '14', '.', 'the', 'commission', 'will', 'make', 'submissions', 'filed', 'in', 'electronic', 'form', 'available', 'on', 'its', 'web', 'site', 'at', 'www.crtc.gc.ca', 'in', 'the', 'official', 'language', 'and', 'format', 'in', 'which', 'they', 'are', 'submitted', '.', 'this', 'will', 'make', 'it', 'easier', 'for', 'members', 'of', 'the', 'public', 'to', 'consult', 'the', 'documents', '.', '15', '.', 'the', 'commission', 'also', 'encourages', 'interested', 'parties', 'to', 'monitor', 'the', 'public', 'examination', 'file', '(', 'and/or', 'the', 'commission', "'s", 'web', 'site', ')', 'for', 'additional', 'information', 'that', 'they', 'may', 'find', 'useful', 'when', 'preparing', 'their', 'submission', '.', '16', '.', 'submissions', 'may', 'be', 'examined', 'or', 'will', 'be', 'made', 'available', 'promptly', 'upon', 'request', 'at', 'the', 'commission', 'offices', 'during', 'normal', 'business', 'hours', ':', 'central', 'building', 'les', 'terasses', 'de', 'la', 'chaudière', '1', 'promenade', 'du', 'portage', ',', 'room', 'g-5', 'hull', ',', 'quebec', 'k1a', '0n2', 'tel', ':', '(', '819', ')', '997-2429', '-', 'tdd', ':', '994-0423', 'fax', ':', '(', '819', ')', '994-0218', 'bank', 'of', 'commerce', 'building', '1809', 'barrington', 'street', 'suite', '1007', 'halifax', ',', 'nova', 'scotia', 'b3j', '3k8', 'tel', ':', '(', '902', ')', '426-7997', '-', 'tdd', ':', '426-6997', 'fax', ':', '(', '902', ')', '426-2721', '405', 'de', 'maisonneuve', 'blvd', '.', 'east', '2', 'nd', 'floor', ',', 'suite', 'b2300', 'montréal', ',', 'quebec', 'h2l', '4j5', 'tel', ':', '(', '514', ')', '283-6607', '-', 'tdd', ':', '283-8316', 'fax', ':', '(', '514', ')', '283-3689', '55', 'st.', 'clair', 'avenue', 'east', 'suite', '624', 'toronto', ',', 'ontario', 'm4t', '1m2', 'tel', ':', '(', '416', ')', '952-9096', 'fax', ':', '(', '416', ')', '954-6343', 'kensington', 'building', '275', 'portage', 'avenue', 'suite', '1810', 'winnipeg', ',', 'manitoba', 'r3b', '2b3', 'tel', ':', '(', '204', ')', '983-6306', '-', 'tdd:983-8274', 'fax', ':', '(', '204', ')', '983-6317', 'cornwall', 'professional', 'building', '2125', '-', '11', 'th', 'avenue', 'room', '103', 'regina', ',', 'saskatchewan', 's4p', '3x3', 'tel', ':', '(', '306', ')', '780-3422', 'fax', ':', '(', '306', ')', '780-3319', '10405', 'jasper', 'avenue', 'suite', '520', 'edmonton', ',', 'alberta', 't5j', '3n4', 'tel', ':', '(', '780', ')', '495-3224', 'fax', ':', '(', '780', ')', '495-3214', '530-580', 'hornby', 'street', 'vancouver', ',', 'british', 'columbia', 'v6c', '3b6', 'tel', ':', '(', '604', ')', '666-2111', '-', 'tdd:666-0778', 'fax', ':', '(', '604', ')', '666-8322', 'secretary', 'general', 'this', 'document', 'is', 'available', 'in', 'alternative', 'format', 'upon', 'request', ',', 'and', 'may', 'also', 'be', 'examined', 'at', 'the', 'following', 'internet', 'site', ':', 'http', ':', '//www.crtc.gc.ca', 'date', 'modified', ':', '2001-08-31', 'date', 'modified', ':', '2001-08-31'], ['archived', '-', 'broadcasting', 'decision', 'crtc', '2005-362', 'broadcasting', 'decision', 'crtc', '2005-362', 'ottawa', ',', '2', 'august', '2005', 'ten', 'broadcasting', 'inc.', 'across', 'canada', 'application', '2005-0033-1', 'broadcasting', 'public', 'notice', 'crtc', '2005-53', '19', 'may', '2005', 'hustler', 'tv', 'and', 'x', 'channel', '-', 'amendment', 'to', 'the', 'definition', 'of', 'broadcast', 'day', '1', '.', 'the', 'commission', 'approves', 'the', 'application', 'by', 'ten', 'broadcasting', 'inc.', 'to', 'amend', 'the', 'definition', 'of', 'the', 'term', '``', 'broadcast', 'day', "''", 'for', 'the', 'purposes', 'of', 'the', 'conditions', 'of', 'licence', 'for', 'the', 'national', 'english-language', 'category', '2', 'specialty', 'programming', 'undertakings', 'known', 'as', 'hustler', 'tv', 'and', 'x', 'channel', '.', '2', '.', 'the', 'commission', 'did', 'not', 'receive', 'any', 'interventions', 'in', 'connection', 'with', 'this', 'application', '.', '3', '.', 'the', 'broadcast', 'day', 'for', 'both', 'hustler', 'tv', 'and', 'x', 'channel', 'is', 'currently', 'defined', 'as', 'the', '24-hour', 'period', 'beginning', 'at', '6:00', 'a.m.', 'each', 'day', '.', 'the', 'licensee', 'requested', 'that', 'the', 'definition', 'of', 'the', 'broadcast', 'day', 'for', 'each', 'of', 'these', 'undertakings', 'be', 'amended', ',', 'effective', '1', 'november', '2004', ',', 'to', 'mean', 'a', '24-hour', 'period', 'beginning', 'at', 'midnight', '(', '00:00', ')', 'each', 'day', '.', '4', '.', 'in', 'accordance', 'with', 'its', 'usual', 'practice', 'for', 'amendments', 'to', 'the', 'definition', 'of', 'broadcast', 'day', ',', 'the', 'commission', 'finds', 'that', 'it', 'is', 'appropriate', 'to', 'implement', 'the', 'revised', 'definition', ',', 'effective', '1', 'september', '2005', '.', 'accordingly', ',', 'the', 'definition', 'of', 'the', 'broadcast', 'day', 'for', 'the', 'purposes', 'of', 'both', 'hustler', 'tv', "'s", 'and', 'x', 'channel', "'s", 'conditions', 'of', 'licence', 'will', 'read', 'as', 'follows', ':', 'beginning', '1', 'september', '2005', ',', '``', 'broadcast', 'day', "''", 'means', 'a', '24-hour', 'period', 'beginning', 'each', 'day', 'at', 'midnight', '(', '00:00', ')', ',', 'or', 'any', 'other', 'period', 'approved', 'by', 'the', 'commission', '.', '5', '.', 'in', 'approving', 'this', 'application', ',', 'the', 'commission', 'has', 'taken', 'into', 'account', 'the', 'fact', 'that', 'the', 'revised', 'definition', 'of', 'the', 'broadcast', 'day', 'will', 'not', 'adversely', 'affect', 'the', 'conditions', 'of', 'licence', 'pertaining', 'to', 'hustler', 'tv', "'s", 'and', 'x', 'channel', "'s", 'nature', 'of', 'service', 'or', 'any', 'other', 'of', 'their', 'conditions', 'of', 'licence', '.', 'secretary', 'general', 'this', 'decision', 'is', 'to', 'be', 'appended', 'to', 'each', 'licence', '.', 'it', 'is', 'available', 'in', 'alternative', 'format', 'upon', 'request', ',', 'and', 'may', 'also', 'be', 'examined', 'in', 'pdf', 'format', 'or', 'in', 'html', 'at', 'the', 'following', 'internet', 'site', ':', 'http', ':', '//www.crtc.gc.ca', 'date', 'modified', ':', '2005-08-02', 'date', 'modified', ':', '2005-08-02'], ['broadcasting', 'information', 'bulletin', 'crtc', '2011-650', 'pdf', 'version', 'ottawa', ',', '18', 'october', '2011', 'procedures', 'for', 'television', 'services', 'in', 'non-compliance', 'with', 'logging', 'requirements', 'introduction', '1', '.', 'a', 'program', 'log', 'is', 'a', 'record', 'of', 'all', 'the', 'programming', 'aired', 'by', 'a', 'licensee', 'in', 'a', 'given', 'month', '.', 'it', 'is', 'submitted', 'to', 'the', 'commission', 'in', 'a', 'specific', 'format', 'and', 'sent', 'to', 'the', 'program', 'logs', 'filing', 'website', '.', '2', '.', 'within', '30', 'days', 'after', 'the', 'end', 'of', 'each', 'month', ',', 'licensees', 'of', 'television', 'programming', 'services', 'must', 'file', 'with', 'the', 'commission', 'the', 'program', 'log', 'or', 'computer', 'record', 'for', 'that', 'month', 'and', 'a', 'certificate', 'attesting', 'to', 'the', 'accuracy', 'of', 'its', 'contents', 'signed', 'by', 'the', 'licensee', 'or', 'a', 'representative', '.', 'current', 'procedures', '3', '.', 'under', 'the', 'commission’s', 'current', 'procedures', ',', 'licensees', 'that', 'fail', 'to', 'meet', 'the', 'regulatory', 'requirements', 'for', 'logging', 'receive', 'a', 'letter', 'of', 'non-compliance', 'from', 'commission', 'staff', 'requesting', 'the', 'submission', 'of', 'the', 'appropriate', 'logs', '.', 'if', 'the', 'non-compliance', 'continues', 'after', 'the', 'receipt', 'of', 'this', 'letter', ',', 'licensees', 'receive', 'further', 'letters', 'from', 'commission', 'staff', 'again', 'requesting', 'the', 'submission', 'of', 'the', 'appropriate', 'logs', '.', 'new', 'approach', '4', '.', 'following', 'a', 'review', 'of', 'its', 'procedures', ',', 'the', 'commission', 'has', 'decided', 'to', 'modify', 'its', 'approach', 'to', 'non-compliance', 'with', 'logging', 'requirements', '.', 'specifically', ',', 'commission', 'staff', 'will', 'continue', 'to', 'send', 'a', 'first', 'letter', 'to', 'licensees', 'that', 'do', 'not', 'meet', 'these', 'requirements', '.', 'however', ',', 'if', 'a', 'licensee', 'remains', 'in', 'non-compliance', ',', 'the', 'commission', 'will', 'send', 'a', 'second', 'letter', 'signed', 'by', 'the', 'secretary', 'general', '.', 'this', 'second', 'letter', 'will', 'stipulate', 'that', 'the', 'licensee', 'is', 'in', 'breach', 'of', 'its', 'regulatory', 'requirements', 'and', ':', 'must', 'submit', 'the', 'required', 'logs', 'within', '30', 'days', 'from', 'the', 'date', 'of', 'the', 'letter', 'and', 'describe', 'the', 'steps', 'taken', 'to', 'ensure', 'future', 'compliance', ';', 'and', 'may', 'be', 'subject', 'to', 'one', 'or', 'more', 'regulatory', 'sanctions', 'should', 'it', 'fail', 'to', 'submit', 'the', 'required', 'information', '.', '5', '.', 'the', 'above-noted', 'regulatory', 'sanctions', 'could', 'include', ':', 'returning', 'future', 'applications', 'of', 'any', 'type', 'by', 'the', 'licensee', 'in', 'non-compliance', 'concerning', 'any', 'current', 'or', 'proposed', 'services', ',', 'including', 'applications', 'for', 'new', 'category', 'b', 'licences', 'and', 'extensions', 'of', 'deadlines', 'for', 'beginning', 'the', 'operation', 'of', 'new', 'category', 'b', 'services', ',', 'as', 'well', 'as', 'part', '1', 'and', 'licence', 'renewal', 'applications', ';', 'imposing', 'a', 'condition', 'of', 'licence', 'requiring', 'the', 'licensee', 'to', 'contribute', 'towards', 'a', 'new', 'or', 'existing', 'independent', 'canadian', 'production', 'fund', ';', 'and', 'initiating', 'a', 'mandatory', 'order', 'proceeding', '.', '6', '.', 'in', 'the', 'case', 'of', 'a', 'licensee', 'that', 'is', 'in', 'non-compliance', 'as', 'of', 'the', 'date', 'of', 'this', 'information', 'bulletin', 'and', 'has', 'failed', 'to', 'respond', 'within', 'the', 'allotted', 'time', 'to', 'a', 'non-compliance', 'letter', 'from', 'commission', 'staff', ',', 'the', 'commission', 'will', 'immediately', 'issue', 'a', 'letter', 'from', 'the', 'secretary', 'general', 'indicating', 'that', 'the', 'licensee', 'has', '60', 'days', 'to', 'submit', 'the', 'appropriate', 'logs', '.', 'if', 'the', 'licensee', 'fails', 'to', 'do', 'so', ',', 'it', 'may', 'be', 'subject', 'to', 'one', 'or', 'more', 'of', 'the', 'above-noted', 'sanctions', '.', '7', '.', 'licensees', 'are', 'also', 'advised', 'that', 'the', 'commission', 'will', 'note', 'any', 'non-compliance', 'with', 'regulatory', 'requirements', 'in', 'the', 'next', 'renewal', 'decision', 'for', 'the', 'affected', 'television', 'services', '.', 'such', 'non-compliance', 'could', 'result', 'in', 'a', 'short-term', 'renewal', '.', 'secretary', 'general', 'date', 'modified', ':', '2011-10-18']]
In [26]:
# counting the number of words in each record (includes punctuation and spaces)
for each, num in zip(all_chars, file_nums):
print(num, ":", len(each), "words")
DB97-477 : 184 words
DB97-626 : 330 words
2017-73 : 148 words
PT2001-99 : 1193 words
db2005-362 : 363 words
2011-650 : 510 words
In [14]:
# removes all numbers, punctuation, and extra characters
only_words = []
for row in file:
text = nltk.word_tokenize(row['text'])
clean = [w.lower() for w in text if w.isalpha()]
only_words.append(clean)
In [15]:
print(only_words[0])
['archived', 'decision', 'crtc', 'decision', 'ottawa', 'august', 'decision', 'crtc', 'central', 'okanagan', 'society', 'winfield', 'british', 'columbia', 'licence', 'renewal', 'following', 'public', 'notice', 'crtc', 'dated', 'may', 'the', 'commission', 'renews', 'the', 'broadcasting', 'licence', 'for', 'the', 'radiocommunication', 'distribution', 'undertaking', 'serving', 'winfield', 'from', 'september', 'to', 'august', 'subject', 'to', 'the', 'conditions', 'in', 'effect', 'under', 'the', 'current', 'licence', 'as', 'well', 'as', 'to', 'those', 'conditions', 'specified', 'in', 'the', 'licence', 'to', 'be', 'issued', 'the', 'commission', 'notes', 'that', 'this', 'undertaking', 'distributes', 'in', 'mode', 'the', 'programming', 'services', 'of', 'the', 'open', 'learning', 'agency', 'knowledge', 'network', 'on', 'channel', 'with', 'a', 'transmitter', 'power', 'of', 'watts', 'and', 'vancouver', 'on', 'the', 'frequency', 'mhz', 'channel', 'with', 'an', 'effective', 'radiated', 'power', 'of', 'watts', 'this', 'decision', 'is', 'to', 'be', 'appended', 'to', 'the', 'licence', 'laura', 'secretary', 'general', 'this', 'document', 'is', 'available', 'in', 'alternative', 'format', 'upon', 'request', 'date', 'modified']
In [16]:
# counting the number of words in each record
for num, each in zip(file_nums,only_words):
print(num, ":",len(each), "words")
DB97-477 : 126 words
DB97-626 : 259 words
2017-73 : 112 words
PT2001-99 : 857 words
db2005-362 : 288 words
2011-650 : 442 words
In [17]:
# counts the occurence of each word in the file
for each in only_words:
wordcounts = dict([ [t, each.count(t)] for t in set(each) ])
In [40]:
wordcounts
Out[40]:
{'a': 19,
'accuracy': 1,
'advised': 1,
'affected': 1,
'after': 2,
'again': 1,
'aired': 1,
'all': 1,
'allotted': 1,
'also': 1,
'and': 9,
'any': 3,
'applications': 3,
'approach': 2,
'appropriate': 3,
'are': 1,
'as': 3,
'attesting': 1,
'b': 2,
'be': 2,
'beginning': 1,
'breach': 1,
'broadcasting': 1,
'bulletin': 2,
'by': 4,
'canadian': 1,
'case': 1,
'category': 2,
'certificate': 1,
'commission': 10,
'compliance': 1,
'computer': 1,
'concerning': 1,
'condition': 1,
'contents': 1,
'continue': 1,
'continues': 1,
'contribute': 1,
'could': 2,
'crtc': 1,
'current': 3,
'date': 3,
'days': 3,
'deadlines': 1,
'decided': 1,
'decision': 1,
'describe': 1,
'do': 2,
'each': 1,
'end': 1,
'ensure': 1,
'existing': 1,
'extensions': 1,
'fail': 2,
'failed': 1,
'fails': 1,
'file': 1,
'filing': 1,
'first': 1,
'following': 1,
'for': 6,
'format': 1,
'from': 5,
'fund': 1,
'further': 1,
'future': 2,
'general': 3,
'given': 1,
'has': 3,
'however': 1,
'if': 3,
'immediately': 1,
'imposing': 1,
'in': 10,
'include': 1,
'including': 1,
'independent': 1,
'indicating': 1,
'information': 3,
'initiating': 1,
'introduction': 1,
'is': 4,
'issue': 1,
'it': 3,
'its': 4,
'letter': 8,
'letters': 1,
'licence': 2,
'licences': 1,
'licensee': 9,
'licensees': 5,
'log': 2,
'logging': 3,
'logs': 5,
'mandatory': 1,
'may': 2,
'meet': 2,
'modified': 1,
'modify': 1,
'month': 3,
'more': 2,
'must': 2,
'new': 4,
'next': 1,
'not': 1,
'note': 1,
'october': 1,
'of': 19,
'one': 2,
'operation': 1,
'or': 6,
'order': 1,
'ottawa': 1,
'part': 1,
'pdf': 1,
'procedures': 4,
'proceeding': 1,
'production': 1,
'program': 3,
'programming': 2,
'proposed': 1,
'receipt': 1,
'receive': 2,
'record': 2,
'regulatory': 5,
'remains': 1,
'renewal': 3,
'representative': 1,
'requesting': 2,
'required': 2,
'requirements': 6,
'requiring': 1,
'respond': 1,
'result': 1,
'returning': 1,
'review': 1,
'sanctions': 3,
'second': 2,
'secretary': 3,
'send': 2,
'sent': 1,
'services': 5,
'should': 1,
'signed': 2,
'so': 1,
'specific': 1,
'specifically': 1,
'staff': 4,
'steps': 1,
'stipulate': 1,
'subject': 2,
'submission': 2,
'submit': 3,
'submitted': 1,
'such': 1,
'taken': 1,
'television': 3,
'that': 7,
'the': 41,
'these': 1,
'this': 3,
'time': 1,
'to': 17,
'towards': 1,
'type': 1,
'under': 1,
'version': 1,
'website': 1,
'well': 1,
'will': 5,
'with': 4,
'within': 3}
In [19]:
# sorts the list by highest frequency
items = [(v, k) for k, v in wordcounts.items()]
In [20]:
sorted(items, reverse=True)
Out[20]:
[(41, 'the'),
(19, 'of'),
(19, 'a'),
(17, 'to'),
(10, 'in'),
(10, 'commission'),
(9, 'licensee'),
(9, 'and'),
(8, 'letter'),
(7, 'that'),
(6, 'requirements'),
(6, 'or'),
(6, 'for'),
(5, 'will'),
(5, 'services'),
(5, 'regulatory'),
(5, 'logs'),
(5, 'licensees'),
(5, 'from'),
(4, 'with'),
(4, 'staff'),
(4, 'procedures'),
(4, 'new'),
(4, 'its'),
(4, 'is'),
(4, 'by'),
(3, 'within'),
(3, 'this'),
(3, 'television'),
(3, 'submit'),
(3, 'secretary'),
(3, 'sanctions'),
(3, 'renewal'),
(3, 'program'),
(3, 'month'),
(3, 'logging'),
(3, 'it'),
(3, 'information'),
(3, 'if'),
(3, 'has'),
(3, 'general'),
(3, 'days'),
(3, 'date'),
(3, 'current'),
(3, 'as'),
(3, 'appropriate'),
(3, 'applications'),
(3, 'any'),
(2, 'submission'),
(2, 'subject'),
(2, 'signed'),
(2, 'send'),
(2, 'second'),
(2, 'required'),
(2, 'requesting'),
(2, 'record'),
(2, 'receive'),
(2, 'programming'),
(2, 'one'),
(2, 'must'),
(2, 'more'),
(2, 'meet'),
(2, 'may'),
(2, 'log'),
(2, 'licence'),
(2, 'future'),
(2, 'fail'),
(2, 'do'),
(2, 'could'),
(2, 'category'),
(2, 'bulletin'),
(2, 'be'),
(2, 'b'),
(2, 'approach'),
(2, 'after'),
(1, 'well'),
(1, 'website'),
(1, 'version'),
(1, 'under'),
(1, 'type'),
(1, 'towards'),
(1, 'time'),
(1, 'these'),
(1, 'taken'),
(1, 'such'),
(1, 'submitted'),
(1, 'stipulate'),
(1, 'steps'),
(1, 'specifically'),
(1, 'specific'),
(1, 'so'),
(1, 'should'),
(1, 'sent'),
(1, 'review'),
(1, 'returning'),
(1, 'result'),
(1, 'respond'),
(1, 'requiring'),
(1, 'representative'),
(1, 'remains'),
(1, 'receipt'),
(1, 'proposed'),
(1, 'production'),
(1, 'proceeding'),
(1, 'pdf'),
(1, 'part'),
(1, 'ottawa'),
(1, 'order'),
(1, 'operation'),
(1, 'october'),
(1, 'note'),
(1, 'not'),
(1, 'next'),
(1, 'modify'),
(1, 'modified'),
(1, 'mandatory'),
(1, 'licences'),
(1, 'letters'),
(1, 'issue'),
(1, 'introduction'),
(1, 'initiating'),
(1, 'indicating'),
(1, 'independent'),
(1, 'including'),
(1, 'include'),
(1, 'imposing'),
(1, 'immediately'),
(1, 'however'),
(1, 'given'),
(1, 'further'),
(1, 'fund'),
(1, 'format'),
(1, 'following'),
(1, 'first'),
(1, 'filing'),
(1, 'file'),
(1, 'fails'),
(1, 'failed'),
(1, 'extensions'),
(1, 'existing'),
(1, 'ensure'),
(1, 'end'),
(1, 'each'),
(1, 'describe'),
(1, 'decision'),
(1, 'decided'),
(1, 'deadlines'),
(1, 'crtc'),
(1, 'contribute'),
(1, 'continues'),
(1, 'continue'),
(1, 'contents'),
(1, 'condition'),
(1, 'concerning'),
(1, 'computer'),
(1, 'compliance'),
(1, 'certificate'),
(1, 'case'),
(1, 'canadian'),
(1, 'broadcasting'),
(1, 'breach'),
(1, 'beginning'),
(1, 'attesting'),
(1, 'are'),
(1, 'also'),
(1, 'allotted'),
(1, 'all'),
(1, 'aired'),
(1, 'again'),
(1, 'affected'),
(1, 'advised'),
(1, 'accuracy')]
In [21]:
# prints the list in a nicer format, here showing only the top 50 words
for count, word in sorted(items, reverse=True)[:50]:
print("%5d %s" % (count, word))
41 the
19 of
19 a
17 to
10 in
10 commission
9 licensee
9 and
8 letter
7 that
6 requirements
6 or
6 for
5 will
5 services
5 regulatory
5 logs
5 licensees
5 from
4 with
4 staff
4 procedures
4 new
4 its
4 is
4 by
3 within
3 this
3 television
3 submit
3 secretary
3 sanctions
3 renewal
3 program
3 month
3 logging
3 it
3 information
3 if
3 has
3 general
3 days
3 date
3 current
3 as
3 appropriate
3 applications
3 any
2 submission
2 subject
In [23]:
# importing the stopword list
import nltk.corpus
stopwords = nltk.corpus.stopwords.words('english')
In [24]:
# applying the stopwords and updating the counts
terms = {}
for word, count in wordcounts.items():
if count > 0 and word not in stopwords and word.isalpha():
terms[word] = count
In [25]:
# sorting the new list by frequency
items = [(v, k) for k, v in terms.items()]
In [26]:
for count, word in sorted(items, reverse=True):
print("%5d %s" % (count, word))
10 commission
9 licensee
8 letter
6 requirements
5 services
5 regulatory
5 logs
5 licensees
4 staff
4 procedures
4 new
3 within
3 television
3 submit
3 secretary
3 sanctions
3 renewal
3 program
3 month
3 logging
3 information
3 general
3 days
3 date
3 current
3 appropriate
3 applications
2 submission
2 subject
2 signed
2 send
2 second
2 required
2 requesting
2 record
2 receive
2 programming
2 one
2 must
2 meet
2 may
2 log
2 licence
2 future
2 fail
2 could
2 category
2 bulletin
2 b
2 approach
1 well
1 website
1 version
1 type
1 towards
1 time
1 taken
1 submitted
1 stipulate
1 steps
1 specifically
1 specific
1 sent
1 review
1 returning
1 result
1 respond
1 requiring
1 representative
1 remains
1 receipt
1 proposed
1 production
1 proceeding
1 pdf
1 part
1 ottawa
1 order
1 operation
1 october
1 note
1 next
1 modify
1 modified
1 mandatory
1 licences
1 letters
1 issue
1 introduction
1 initiating
1 indicating
1 independent
1 including
1 include
1 imposing
1 immediately
1 however
1 given
1 fund
1 format
1 following
1 first
1 filing
1 file
1 fails
1 failed
1 extensions
1 existing
1 ensure
1 end
1 describe
1 decision
1 decided
1 deadlines
1 crtc
1 contribute
1 continues
1 continue
1 contents
1 condition
1 concerning
1 computer
1 compliance
1 certificate
1 case
1 canadian
1 broadcasting
1 breach
1 beginning
1 attesting
1 also
1 allotted
1 aired
1 affected
1 advised
1 accuracy
In [ ]:
# removes recurring word
terms.pop('commission')
In [41]:
# removing the numbers
terms = list(terms)
In [44]:
# alphabetizing the list
set(terms)
Out[44]:
{'accuracy',
'advised',
'affected',
'aired',
'allotted',
'also',
'applications',
'approach',
'appropriate',
'attesting',
'b',
'beginning',
'breach',
'broadcasting',
'bulletin',
'canadian',
'case',
'category',
'certificate',
'commission',
'compliance',
'computer',
'concerning',
'condition',
'contents',
'continue',
'continues',
'contribute',
'could',
'crtc',
'current',
'date',
'days',
'deadlines',
'decided',
'decision',
'describe',
'end',
'ensure',
'existing',
'extensions',
'fail',
'failed',
'fails',
'file',
'filing',
'first',
'following',
'format',
'fund',
'future',
'general',
'given',
'however',
'immediately',
'imposing',
'include',
'including',
'independent',
'indicating',
'information',
'initiating',
'introduction',
'issue',
'letter',
'letters',
'licence',
'licences',
'licensee',
'licensees',
'log',
'logging',
'logs',
'mandatory',
'may',
'meet',
'modified',
'modify',
'month',
'must',
'new',
'next',
'note',
'october',
'one',
'operation',
'order',
'ottawa',
'part',
'pdf',
'procedures',
'proceeding',
'production',
'program',
'programming',
'proposed',
'receipt',
'receive',
'record',
'regulatory',
'remains',
'renewal',
'representative',
'requesting',
'required',
'requirements',
'requiring',
'respond',
'result',
'returning',
'review',
'sanctions',
'second',
'secretary',
'send',
'sent',
'services',
'signed',
'specific',
'specifically',
'staff',
'steps',
'stipulate',
'subject',
'submission',
'submit',
'submitted',
'taken',
'television',
'time',
'towards',
'type',
'version',
'website',
'well',
'within'}
In [27]:
only_words_no_stop = []
for row in file:
text = nltk.word_tokenize(row['text'])
clean = [w.lower() for w in text if w.isalpha()]
text = [w for w in clean if w not in stopwords]
only_words_no_stop.extend(text)
In [28]:
# creates a variable for the lemmatizing function
wnl = WordNetLemmatizer()
# lemmatizes all of the verbs
lemm = []
for word in only_words_no_stop:
lemm.append(wnl.lemmatize(word, 'v'))
# lemmatizes all of the nouns
lems = []
for word in lemm:
lems.append(wnl.lemmatize(word, 'n'))
In [29]:
print("Number of words:", len(lems))
Number of words: 1214
In [30]:
# prints the 10 most common bigrams
colText = nltk.Text(lems)
colText.collocations(10)
secretary general; date modify; public notice; upon request; decision
crtc; broadcast day; british columbia; tel fax; format upon; access
municipal
In [51]:
# creates a list of bigrams (ngrams of 2), printing the first 5
colBigrams = list(nltk.ngrams(colText, 2))
colBigrams[:5]
Out[51]:
[('archive', 'decision'),
('decision', 'crtc'),
('crtc', 'decision'),
('decision', 'ottawa'),
('ottawa', 'august')]
In [32]:
# error checking. There should be one less bigram than total words
print("Number of words:", len(lems))
print("Number of bigrams:", len(colBigrams))
Number of words: 1214
Number of bigrams: 1213
In [33]:
# frequency plot with stopwords removed
%matplotlib inline
plt.rcParams['figure.figsize'] = (10.0, 10.0)
fd = nltk.FreqDist(colText)
fd.plot(25)
In [34]:
# loads bigram code from NLTK
bigram_measures = nltk.collocations.BigramAssocMeasures()
# bigrams with a window size of 2 words
finder = BigramCollocationFinder.from_words(lems, window_size = 2)
# only bigrams that appear 4+ times
#finder.apply_freq_filter(4)
# only bigrams that contain 'privacy'
#finder.apply_ngram_filter(privacy_filter)
In [35]:
# filter results based on statistical test
# calulates the raw frequency as an actual number and percentage of total words
act = finder.ngram_fd.items()
raw = finder.score_ngrams(bigram_measures.raw_freq)
# log-likelihood ratio
log = finder.score_ngrams(bigram_measures.likelihood_ratio)
Log-Likelihood Ratio
The Log-likelihood ratio calculates the size and significance between the observed and expected frequencies of bigrams and assigns a score based on the result, taking into account the overall size of the corpus. The larger the difference between the observed and expected, the higher the score, and the more statistically significant the collocate is. The Log-likelihood ratio is my preferred test for collocates because it does not rely on a normal distribution, and for this reason, it can account for sparse or low frequency bigrams (unlike the Chi-square). But unlike the PMI, it does not over-represent low frequency bigrams with inflated scores, as the test is only reporting how much more likely it is that the frequencies are different than they are the same. The drawback to the Log-likelihood ratio, much like the t-score, is that it cannot be used to compare scores across corpora.
In [36]:
print(tabulate(log, headers = ["Collocate", "Log-Likelihood"], floatfmt=".3f", numalign="left"))
Collocate Log-Likelihood
-------------------------------------- ----------------
('secretary', 'general') 106.213
('hustler', 'tv') 64.902
('tv', 'x') 64.902
('date', 'modify') 63.123
('public', 'notice') 55.245
('x', 'channel') 54.317
('upon', 'request') 51.039
('decision', 'crtc') 50.734
('broadcast', 'day') 49.840
('british', 'columbia') 42.011
('tel', 'fax') 41.971
('format', 'upon') 37.622
('access', 'municipal') 35.281
('alternative', 'format') 34.995
('available', 'alternative') 34.995
('definition', 'broadcast') 34.672
('email', 'address') 33.015
('power', 'watt') 33.015
('notice', 'crtc') 32.735
('term', 'condition') 31.674
('tel', 'tdd') 31.426
('period', 'begin') 30.785
('tdd', 'fax') 30.554
('condition', 'licence') 29.979
('department', 'industry') 29.631
('longer', 'five') 29.631
('ottawa', 'august') 27.958
('commission', 'staff') 27.703
('append', 'licence') 26.568
('decision', 'append') 26.167
('modify', 'archive') 26.065
('carrier', 'municipality') 25.812
('category', 'b') 25.812
('municipal', 'property') 25.812
('subject', 'one') 25.812
('regulatory', 'requirement') 25.733
('appropriate', 'log') 25.307
('information', 'bulletin') 24.086
('mean', 'period') 24.086
('part', 'vii') 24.086
('radiate', 'power') 24.086
('interest', 'party') 22.982
('party', 'wish') 22.982
('effective', 'radiate') 22.901
('five', 'page') 22.901
('include', 'summary') 22.901
('commission', 'approve') 22.760
('contract', 'carrier') 21.994
('alter', 'term') 21.993
('disk', 'version') 21.993
('site', 'http') 21.993
('web', 'site') 21.993
('document', 'available') 21.898
('city', 'argue') 21.255
('receive', 'disk') 21.255
('general', 'document') 21.033
('principle', 'set') 20.270
('serve', 'copy') 20.270
('laura', 'secretary') 20.096
('second', 'letter') 20.096
('submission', 'longer') 19.623
('program', 'log') 19.267
('approve', 'application') 19.111
('also', 'examine') 19.087
('exist', 'contract') 19.087
('regulatory', 'sanction') 19.087
('tn', 'tn') 18.547
('electronic', 'version') 18.180
('internet', 'site') 18.180
('new', 'category') 18.180
('pdf', 'version') 18.180
('transmitter', 'site') 18.180
('vancouver', 'british') 18.180
('day', 'purpose') 17.851
('licence', 'laura') 17.574
('certificate', 'issue') 17.444
('agreement', 'access') 17.365
('avenue', 'suite') 17.365
('exist', 'agreement') 17.365
('crtc', 'ottawa') 17.094
('vii', 'application') 17.073
('ten', 'broadcast') 16.844
('follow', 'internet') 16.824
('quebec', 'tel') 16.824
('http', 'date') 16.627
('indicate', 'notice') 16.289
('commission', 'note') 16.264
('account', 'fact') 16.203
('accuracy', 'content') 16.203
('agency', 'knowledge') 16.203
('applicable', 'unique') 16.203
('approximately', 'metre') 16.203
('area', 'station') 16.203
('arrangement', 'necessary') 16.203
('attest', 'accuracy') 16.203
('bank', 'commerce') 16.203
('business', 'hour') 16.203
('canadian', 'production') 16.203
('chaudière', 'promenade') 16.203
('competitive', 'local') 16.203
('conditionally', 'technically') 16.203
('contractual', 'situation') 16.203
('contribute', 'towards') 16.203
('coopérative', 'antenne') 16.203
('cornwall', 'professional') 16.203
('coverage', 'populate') 16.203
('create', 'unacceptable') 16.203
('currently', 'define') 16.203
('describe', 'step') 16.203
('distribute', 'mode') 16.203
('e', 'telecommunication') 16.203
('easier', 'member') 16.203
('edmonton', 'alberta') 16.203
('excel', 'spreadsheet') 16.203
('extension', 'deadline') 16.203
('frequency', 'mhz') 16.203
('halifax', 'nova') 16.203
('independent', 'canadian') 16.203
('interference', 'aeronautical') 16.203
('knowledge', 'network') 16.203
('la', 'chaudière') 16.203
('lack', 'jurisdiction') 16.203
('le', 'teras') 16.203
('learn', 'agency') 16.203
('local', 'exchange') 16.203
('maisonneuve', 'blvd') 16.203
('metre', 'northwest') 16.203
('nd', 'floor') 16.203
('normal', 'business') 16.203
('nova', 'scotia') 16.203
('office', 'normal') 16.203
('official', 'language') 16.203
('okanagan', 'society') 16.203
('open', 'learn') 16.203
('possible', 'registration') 16.203
('process', 'reduce') 16.203
('production', 'fund') 16.203
('promenade', 'du') 16.203
('radiocommunication', 'distribution') 16.203
('reduce', 'workload') 16.203
('regina', 'saskatchewan') 16.203
('relevant', 'provision') 16.203
('relief', 'pursuant') 16.203
('soon', 'possible') 16.203
('special', 'facility') 16.203
('state', 'want') 16.203
('statement', 'change') 16.203
('streamline', 'process') 16.203
('technically', 'acceptable') 16.203
('unacceptable', 'interference') 16.203
('unique', 'factual') 16.203
('useful', 'prepare') 16.203
('usual', 'practice') 16.203
('winnipeg', 'manitoba') 16.203
('word', 'text') 16.203
('would', 'justify') 16.203
('write', 'notification') 16.203
('march', 'march') 16.185
('page', 'include') 16.185
('amendment', 'definition') 15.726
('revise', 'definition') 15.726
('wish', 'receive') 15.726
('television', 'service') 15.396
('tariff', 'application') 15.331
('current', 'procedure') 15.282
('address', 'available') 15.108
('approval', 'tariff') 15.108
('revise', 'tariff') 15.108
('may', 'subject') 15.015
('file', 'electronic') 14.668
('submission', 'appropriate') 14.104
('staff', 'request') 14.104
('submit', 'comment') 13.933
('submit', 'require') 13.933
('tariff', 'page') 13.933
('crtc', 'pdf') 13.778
('advise', 'commission') 13.750
('version', 'ottawa') 13.650
('set', 'decision') 13.521
('accordance', 'subsection') 13.430
('accordance', 'usual') 13.430
('addition', 'please') 13.430
('adversely', 'affect') 13.430
('allot', 'time') 13.430
('authority', 'grant') 13.430
('barrington', 'street') 13.430
('blvd', 'east') 13.430
('central', 'okanagan') 13.430
('complete', 'list') 13.430
('construction', 'operation') 13.430
('content', 'sign') 13.430
('continue', 'receipt') 13.430
('damage', 'transmission') 13.430
('de', 'la') 13.430
('de', 'maisonneuve') 13.430
('du', 'portage') 13.430
('east', 'nd') 13.430
('ensure', 'future') 13.430
('enter', 'line') 13.430
('fairchild', 'radio') 13.430
('find', 'useful') 13.430
('fund', 'initiate') 13.430
('future', 'compliance') 13.430
('give', 'framework') 13.430
('hornby', 'street') 13.430
('hour', 'central') 13.430
('initiate', 'mandatory') 13.430
('interim', 'basis') 13.430
('introduction', 'competitive') 13.430
('jointly', 'subsequent') 13.430
('july', 'transmit') 13.430
('last', 'paragraph') 13.430
('maintenance', 'transmission') 13.430
('measurement', 'show') 13.430
('microsoft', 'excel') 13.430
('microsoft', 'word') 13.430
('municipality', 'background') 13.430
('number', 'paragraph') 13.430
('operation', 'maintenance') 13.430
('paragraph', 'help') 13.430
('parameter', 'create') 13.430
('parameter', 'original') 13.430
('participate', 'jointly') 13.430
('please', 'enter') 13.430
('please', 'number') 13.430
('property', 'like') 13.430
('property', 'reference') 13.430
('pursuant', 'section') 13.430
('return', 'future') 13.430
('room', 'hull') 13.430
('room', 'regina') 13.430
('section', 'e') 13.430
('section', 'relevant') 13.430
('show', 'diskette') 13.430
('society', 'winfield') 13.430
('spreadsheet', 'please') 13.430
('station', 'department') 13.430
('step', 'take') 13.430
('subsequent', 'measurement') 13.430
('subsequent', 'stag') 13.430
('take', 'account') 13.430
('take', 'ensure') 13.430
('teras', 'de') 13.430
('text', 'microsoft') 13.430
('time', 'write') 13.430
('transmit', 'arrangement') 13.430
('various', 'company') 13.430
('without', 'description') 13.430
('workload', 'concern') 13.430
('letter', 'license') 13.401
('canada', 'application') 13.100
('broadcast', 'certificate') 13.053
('program', 'undertake') 13.035
('file', 'part') 12.960
('condition', 'exist') 12.514
('carrier', 'clec') 12.383
('category', 'specialty') 12.383
('certificate', 'attest') 12.383
('computer', 'record') 12.383
('could', 'result') 12.383
('diskette', 'electronic') 12.383
('electronic', 'form') 12.383
('end', 'month') 12.383
('exchange', 'carrier') 12.383
('framework', 'set') 12.383
('hard', 'copy') 12.383
('hull', 'quebec') 12.383
('intention', 'october') 12.383
('intervention', 'connection') 12.383
('justify', 'intervention') 12.383
('kuujjuaraapik', 'quebec') 12.383
('montréal', 'quebec') 12.383
('national', 'category') 12.383
('relocate', 'transmitter') 12.383
('westminster', 'transmitter') 12.383
('within', 'day') 12.349
('may', 'also') 12.137
('licence', 'renewal') 12.076
('archive', 'decision') 11.820
('receive', 'letter') 11.778
('additional', 'information') 11.704
('amendment', 'authority') 11.704
('amendment', 'data') 11.704
('antenne', 'tn') 11.704
('appropriate', 'implement') 11.704
('build', 'barrington') 11.704
('build', 'le') 11.704
('build', 'th') 11.704
('change', 'propose') 11.704
('circumstance', 'contractual') 11.704
('circumstance', 'would') 11.704
('clair', 'avenue') 11.704
('commerce', 'build') 11.704
('define', 'period') 11.704
('determine', 'propose') 11.704
('fact', 'revise') 11.704
('factual', 'circumstance') 11.704
('fail', 'respond') 11.704
('implement', 'revise') 11.704
('inconsistent', 'principle') 11.704
('jasper', 'avenue') 11.704
('kensington', 'build') 11.704
('like', 'agreement') 11.704
('mandatory', 'order') 11.704
('merely', 'send') 11.704
('must', 'actually') 11.704
('must', 'notify') 11.704
('next', 'renewal') 11.704
('northwest', 'approval') 11.704
('note', 'accord') 11.704
('order', 'streamline') 11.704
('practice', 'amendment') 11.704
('principle', 'develop') 11.704
('principle', 'lay') 11.704
('professional', 'build') 11.704
('propose', 'coverage') 11.704
('propose', 'relocate') 11.704
('respond', 'within') 11.704
('result', 'renewal') 11.704
('send', 'first') 11.704
('september', 'accordingly') 11.704
('similar', 'interest') 11.704
('th', 'avenue') 11.704
('tn', 'special') 11.704
('website', 'within') 11.704
('whether', 'wish') 11.704
('within', 'allot') 11.704
('within', 'calendar') 11.704
('follow', 'public') 11.696
('log', 'requirement') 11.313
('act', 'interfere') 11.199
('act', 'name') 11.199
('breach', 'regulatory') 11.199
('deadline', 'begin') 11.199
('effect', 'current') 11.199
('exist', 'independent') 11.199
('floor', 'suite') 11.199
('include', 'return') 11.199
('interfere', 'exist') 11.199
('jurisdiction', 'act') 11.199
('know', 'hustler') 11.199
('make', 'easier') 11.199
('march', 'coopérative') 11.199
('march', 'various') 11.199
('pertain', 'hustler') 11.199
('promptly', 'upon') 11.199
('provision', 'act') 11.199
('reply', 'comment') 11.199
('representative', 'current') 11.199
('suite', 'edmonton') 11.199
('suite', 'halifax') 11.199
('suite', 'montréal') 11.199
('suite', 'winnipeg') 11.199
('telecommunication', 'act') 11.199
('begin', 'day') 11.185
('canada', 'city') 10.899
('distribution', 'undertake') 10.796
('fm', 'vancouver') 10.796
('january', 'term') 10.796
('new', 'westminster') 10.796
('review', 'procedure') 10.796
('site', 'additional') 10.796
('site', 'approximately') 10.796
('site', 'official') 10.796
('substitute', 'term') 10.796
('towards', 'new') 10.796
('undertake', 'distribute') 10.796
('undertake', 'know') 10.796
('vancouver', 'fm') 10.796
('vancouver', 'frequency') 10.796
('version', 'hard') 10.796
('base', 'use') 10.659
('company', 'interim') 10.659
('consider', 'give') 10.659
('february', 'february') 10.659
('grant', 'time') 10.659
('however', 'subsequent') 10.659
('industry', 'advise') 10.659
('industry', 'technical') 10.659
('portage', 'room') 10.659
('technical', 'parameter') 10.659
('toronto', 'ontario') 10.659
('transmission', 'line') 10.659
('transmit', 'parameter') 10.659
('use', 'microsoft') 10.659
('accordingly', 'definition') 10.461
('actually', 'receive') 10.461
('city', 'inconsistent') 10.461
('city', 'respondent') 10.461
('consult', 'document') 10.461
('document', 'damage') 10.461
('immediately', 'issue') 10.461
('issue', 'determine') 10.461
('issue', 'soon') 10.461
('june', 'city') 10.461
('name', 'city') 10.461
('notification', 'receive') 10.461
('receive', 'merely') 10.461
('requirement', 'next') 10.461
('requirement', 'specifically') 10.461
('verify', 'document') 10.461
('program', 'service') 10.292
('alberta', 'tel') 10.174
('applicant', 'tariff') 10.174
('available', 'identify') 10.174
('available', 'promptly') 10.174
('basis', 'follow') 10.174
('channel', 'currently') 10.174
('channel', 'nature') 10.174
('clec', 'tariff') 10.174
('data', 'channel') 10.174
('facility', 'tariff') 10.174
('follow', 'last') 10.174
('follow', 'review') 10.174
('form', 'available') 10.174
('language', 'format') 10.174
('manitoba', 'tel') 10.174
('member', 'public') 10.174
('mhz', 'channel') 10.174
('mode', 'program') 10.174
('monitor', 'public') 10.174
('network', 'channel') 10.174
('proceed', 'case') 10.174
('program', 'air') 10.174
('public', 'consult') 10.174
('public', 'examination') 10.174
('read', 'follow') 10.174
('reference', 'public') 10.174
('saskatchewan', 'tel') 10.174
('scope', 'proceed') 10.174
('scotia', 'tel') 10.174
('specialty', 'program') 10.174
('stag', 'proceed') 10.174
('crtc', 'date') 10.030
('contact', 'secretary') 9.924
('decide', 'modify') 9.924
('fax', 'bank') 9.924
('fax', 'clair') 9.924
('fax', 'cornwall') 9.924
('fax', 'hornby') 9.924
('fax', 'jasper') 9.924
('fax', 'kensington') 9.924
('first', 'letter') 9.924
('letter', 'describe') 9.924
('letter', 'stipulate') 9.924
('notice', 'whether') 9.924
('receipt', 'letter') 9.924
('request', 'submission') 9.721
('joint', 'submission') 9.701
('log', 'computer') 9.701
('prepare', 'submission') 9.701
('request', 'relief') 9.701
('submission', 'addition') 9.701
('letter', 'commission') 9.616
('affect', 'television') 9.614
('give', 'month') 9.614
('list', 'october') 9.614
('one', 'sanction') 9.614
('winfield', 'british') 9.614
('columbia', 'increase') 9.614
('could', 'alter') 9.614
('november', 'mean') 9.614
('set', 'section') 9.614
('across', 'canada') 9.501
('aeronautical', 'service') 9.501
('appendix', 'canada') 9.501
('behalf', 'canada') 9.501
('bell', 'canada') 9.501
('canada', 'behalf') 9.501
('canada', 'scope') 9.501
('canada', 'state') 9.501
('condition', 'effect') 9.501
('condition', 'specify') 9.501
('impose', 'condition') 9.501
('nature', 'service') 9.501
('populate', 'service') 9.501
('respondent', 'canada') 9.501
('response', 'canada') 9.501
('service', 'area') 9.501
('service', 'kuujjuaraapik') 9.501
('service', 'open') 9.501
('situation', 'canada') 9.501
('accord', 'licensee') 9.318
('air', 'licensee') 9.318
('background', 'may') 9.318
('case', 'licensee') 9.318
('compliance', 'may') 9.318
('licensee', 'breach') 9.318
('licensee', 'contribute') 9.318
('licensee', 'remain') 9.318
('licensee', 'representative') 9.318
('licensee', 'statement') 9.318
('stipulate', 'licensee') 9.318
('type', 'licensee') 9.318
('proceed', 'party') 9.270
('examination', 'file') 9.152
('file', 'joint') 9.152
('file', 'reply') 9.152
('file', 'response') 9.152
('file', 'website') 9.152
('decision', 'ottawa') 9.065
('identify', 'party') 8.998
('party', 'contact') 8.998
('party', 'monitor') 8.998
('party', 'similar') 8.998
('request', 'may') 8.942
('address', 'show') 8.936
('argue', 'principle') 8.936
('august', 'ten') 8.936
('avenue', 'east') 8.936
('avenue', 'room') 8.936
('base', 'principle') 8.936
('build', 'portage') 8.936
('bulletin', 'fail') 8.936
('central', 'build') 8.936
('consider', 'circumstance') 8.936
('continue', 'send') 8.936
('encourage', 'interest') 8.936
('find', 'appropriate') 8.936
('increase', 'power') 8.936
('interim', 'approval') 8.936
('list', 'interest') 8.936
('mail', 'address') 8.936
('midnight', 'period') 8.936
('portage', 'avenue') 8.936
('propose', 'technical') 8.936
('send', 'second') 8.936
('serve', 'specific') 8.936
('serve', 'winfield') 8.936
('staff', 'continue') 8.936
('summary', 'order') 8.936
('telecom', 'order') 8.936
('tn', 'introduction') 8.936
('well', 'part') 8.936
('winfield', 'september') 8.936
('wish', 'participate') 8.936
('calendar', 'day') 8.855
('day', 'adversely') 8.855
('day', 'end') 8.855
('crtc', 'construction') 8.721
('crtc', 'fairchild') 8.721
('licence', 'extension') 8.721
('licence', 'national') 8.721
('licence', 'pertain') 8.721
('licence', 'radiocommunication') 8.721
('licence', 'read') 8.721
('specify', 'licence') 8.721
('decision', 'applicable') 8.596
('decision', 'june') 8.596
('develop', 'decision') 8.596
('lay', 'decision') 8.596
('examine', 'pdf') 8.571
('internet', 'indicate') 8.571
('month', 'certificate') 8.571
('record', 'month') 8.571
('sanction', 'could') 8.571
('application', 'appendix') 8.478
('application', 'applicant') 8.478
('application', 'conditionally') 8.478
('application', 'impose') 8.478
('application', 'type') 8.478
('connection', 'application') 8.478
('original', 'application') 8.478
('also', 'advise') 8.433
('also', 'encourage') 8.433
('approve', 'interim') 8.433
('begin', 'midnight') 8.433
('begin', 'operation') 8.433
('concern', 'current') 8.433
('description', 'page') 8.433
('east', 'suite') 8.433
('february', 'march') 8.433
('increase', 'effective') 8.433
('municipality', 'access') 8.433
('one', 'regulatory') 8.433
('purpose', 'hustler') 8.433
('street', 'suite') 8.433
('suite', 'toronto') 8.433
('acceptable', 'broadcast') 8.367
('broadcast', 'across') 8.367
('renew', 'broadcast') 8.367
('subsection', 'broadcast') 8.367
('circumstance', 'commission') 8.332
('date', 'bell') 8.262
('date', 'complete') 8.262
('date', 'january') 8.262
('date', 'july') 8.262
('registration', 'date') 8.262
('alternative', 'transmit') 8.031
('line', 'vancouver') 8.031
('new', 'approach') 8.031
('operation', 'new') 8.031
('radio', 'vancouver') 8.031
('show', 'alternative') 8.031
('site', 'however') 8.031
('street', 'vancouver') 8.031
('undertake', 'grant') 8.031
('use', 'new') 8.031
('vancouver', 'increase') 8.031
('version', 'html') 8.031
('august', 'subject') 7.895
('fail', 'meet') 7.895
('intervention', 'serve') 7.895
('part', 'record') 7.895
('sanction', 'fail') 7.895
('transmitter', 'power') 7.895
('city', 'toronto') 7.698
('ottawa', 'ontario') 7.698
('receive', 'department') 7.698
('requirement', 'however') 7.698
('requirement', 'introduction') 7.698
('summary', 'document') 7.698
('technical', 'requirement') 7.698
('toronto', 'city') 7.698
('party', 'may') 7.534
('approach', 'follow') 7.413
('available', 'web') 7.413
('format', 'html') 7.413
('html', 'follow') 7.413
('html', 'format') 7.413
('initiate', 'proceed') 7.413
('introduction', 'program') 7.413
('line', 'follow') 7.413
('ontario', 'tel') 7.413
('participate', 'proceed') 7.413
('proceed', 'consider') 7.413
('radio', 'program') 7.413
('specific', 'format') 7.413
('tariff', 'february') 7.413
('access', 'internet') 7.393
('amend', 'effective') 7.393
('comment', 'november') 7.393
('could', 'include') 7.393
('effective', 'november') 7.393
('examine', 'make') 7.393
('license', 'meet') 7.393
('license', 'television') 7.393
('meet', 'regulatory') 7.393
('month', 'license') 7.393
('quebec', 'march') 7.393
('sanction', 'license') 7.393
('general', 'date') 7.303
('modify', 'date') 7.303
('file', 'commission') 7.278
('order', 'revise') 7.220
('september', 'august') 7.220
('watt', 'approval') 7.220
('watt', 'watt') 7.220
('commission', 'also') 7.211
('fax', 'de') 7.164
('general', 'mail') 7.164
('general', 'second') 7.164
('letter', 'sign') 7.164
('modify', 'approach') 7.164
('modify', 'telecom') 7.164
('notice', 'description') 7.164
('ontario', 'fax') 7.164
('sign', 'secretary') 7.164
('time', 'letter') 7.164
('municipal', 'procedure') 6.993
('october', 'procedure') 6.993
('procedure', 'contract') 6.993
('procedure', 'indicate') 6.993
('procedure', 'television') 6.993
('undertake', 'amend') 6.993
('log', 'continue') 6.943
('submission', 'participate') 6.943
('approach', 'log') 6.943
('paragraph', 'submission') 6.943
('request', 'date') 6.860
('commission', 'decide') 6.850
('commission', 'immediately') 6.850
('commission', 'intention') 6.850
('commission', 'lack') 6.850
('commission', 'office') 6.850
('commission', 'renew') 6.850
('commission', 'substitute') 6.850
('commission', 'verify') 6.850
('commission', 'without') 6.850
('help', 'commission') 6.850
('necessary', 'commission') 6.850
('notify', 'commission') 6.850
('remain', 'commission') 6.850
('specifically', 'commission') 6.850
('want', 'commission') 6.850
('affect', 'condition') 6.745
('b', 'service') 6.745
('canada', 'telecom') 6.745
('company', 'canada') 6.745
('condition', 'base') 6.745
('purpose', 'condition') 6.745
('service', 'accordance') 6.745
('service', 'company') 6.745
('service', 'well') 6.745
('telecom', 'service') 6.745
('well', 'condition') 6.745
('access', 'agreement') 6.720
('act', 'principle') 6.720
('address', 'include') 6.720
('also', 'note') 6.720
('approval', 'require') 6.720
('begin', 'september') 6.720
('comment', 'circumstance') 6.720
('comment', 'email') 6.720
('current', 'propose') 6.720
('effective', 'september') 6.720
('include', 'email') 6.720
('information', 'regulatory') 6.720
('license', 'fail') 6.720
('make', 'part') 6.720
('march', 'revise') 6.720
('note', 'regulatory') 6.720
('period', 'approve') 6.720
('require', 'information') 6.720
('amend', 'definition') 6.662
('meet', 'requirement') 6.662
('ottawa', 'november') 6.662
('ottawa', 'october') 6.662
('receive', 'intervention') 6.662
('requirement', 'meet') 6.662
('however', 'licensee') 6.564
('licensee', 'concern') 6.564
('licensee', 'give') 6.564
('may', 'find') 6.564
('may', 'use') 6.564
('sign', 'licensee') 6.564
('channel', 'transmitter') 6.378
('columbia', 'tel') 6.378
('examine', 'follow') 6.378
('month', 'submit') 6.378
('pdf', 'format') 6.378
('record', 'proceed') 6.378
('record', 'program') 6.378
('television', 'program') 6.378
('undertake', 'serve') 6.322
('approval', 'new') 6.322
('august', 'term') 6.322
('email', 'procedure') 6.322
('note', 'undertake') 6.322
('watt', 'vancouver') 6.322
('encourage', 'party') 6.247
('party', 'list') 6.247
('party', 'mail') 6.247
('act', 'act') 6.221
('current', 'access') 6.221
('license', 'also') 6.221
('general', 'indicate') 6.131
('day', 'accordance') 6.105
('day', 'mean') 6.105
('day', 'midnight') 6.105
('midnight', 'day') 6.105
('agreement', 'city') 5.992
('document', 'must') 5.992
('issue', 'within') 5.992
('requirement', 'must') 5.992
('b', 'licence') 5.974
('bulletin', 'crtc') 5.974
('crtc', 'central') 5.974
('licence', 'radio') 5.974
('licence', 'well') 5.974
('mail', 'crtc') 5.974
('copy', 'submission') 5.912
('log', 'record') 5.912
('november', 'submission') 5.912
('october', 'submission') 5.912
('decision', 'affect') 5.850
('grant', 'decision') 5.850
('new', 'exist') 5.825
('procedure', 'current') 5.825
('procedure', 'license') 5.825
('term', 'exist') 5.825
('version', 'comment') 5.825
('application', 'base') 5.734
('application', 'ten') 5.734
('future', 'application') 5.734
('service', 'could') 5.715
('subject', 'condition') 5.715
('channel', 'amendment') 5.710
('format', 'send') 5.710
('proceed', 'must') 5.710
('submit', 'appropriate') 5.710
('tariff', 'amendment') 5.710
('archive', 'public') 5.710
('fail', 'submit') 5.710
('must', 'submit') 5.710
('order', 'proceed') 5.710
('power', 'follow') 5.710
('renewal', 'follow') 5.710
('send', 'program') 5.710
('commission', 'issue') 5.693
('crtc', 'decision') 5.610
('indicate', 'licensee') 5.536
('may', 'examine') 5.536
('description', 'date') 5.521
('specific', 'date') 5.521
('city', 'make') 5.497
('definition', 'effective') 5.497
('license', 'receive') 5.497
('ottawa', 'march') 5.497
('page', 'issue') 5.497
('fax', 'email') 5.465
('notice', 'email') 5.465
('renewal', 'secretary') 5.465
('new', 'undertake') 5.430
('undertake', 'vancouver') 5.430
('contract', 'file') 5.372
('copy', 'file') 5.372
('log', 'within') 5.247
('request', 'approval') 5.247
('day', 'date') 5.238
('copy', 'party') 5.222
('october', 'party') 5.222
('channel', 'effective') 5.217
('follow', 'begin') 5.217
('make', 'available') 5.217
('page', 'submit') 5.217
('submit', 'make') 5.217
('broadcast', 'licence') 5.185
('submit', 'commission') 5.135
('definition', 'term') 5.104
('agreement', 'canada') 5.052
('canada', 'tn') 5.052
('propose', 'service') 5.052
('service', 'must') 5.052
('require', 'secretary') 4.973
('broadcast', 'decision') 4.955
('columbia', 'licence') 4.952
('fail', 'may') 4.875
('information', 'may') 4.875
('licensee', 'fail') 4.875
('licensee', 'propose') 4.875
('note', 'licensee') 4.875
('november', 'decision') 4.830
('alternative', 'submit') 4.825
('format', 'alternative') 4.825
('make', 'submission') 4.757
('page', 'request') 4.757
('require', 'log') 4.757
('application', 'broadcast') 4.741
('application', 'amend') 4.716
('file', 'serve') 4.713
('interest', 'file') 4.713
('must', 'file') 4.713
('wish', 'file') 4.713
('amend', 'broadcast') 4.608
('broadcast', 'amend') 4.608
('meet', 'broadcast') 4.608
('party', 'interest') 4.564
('condition', 'current') 4.563
('service', 'include') 4.563
('comment', 'may') 4.387
('may', 'hustler') 4.387
('require', 'licensee') 4.387
('log', 'new') 4.369
('order', 'crtc') 4.297
('part', 'licence') 4.297
('licence', 'amendment') 4.297
('issue', 'letter') 4.262
('file', 'comment') 4.227
('follow', 'tariff') 4.227
('format', 'submit') 4.227
('proceed', 'proceed') 4.227
('august', 'decision') 4.177
('decision', 'circumstance') 4.177
('renewal', 'decision') 4.177
('watt', 'decision') 4.177
('argue', 'commission') 4.143
('commission', 'alter') 4.143
('commission', 'consider') 4.143
('commission', 'encourage') 4.143
('commission', 'find') 4.143
('commission', 'initiate') 4.143
('commission', 'specific') 4.143
('commission', 'take') 4.143
('commission', 'web') 4.143
('concern', 'commission') 4.143
('transmission', 'commission') 4.143
('make', 'party') 4.080
('party', 'access') 4.080
('renewal', 'application') 4.065
('log', 'receive') 4.049
('request', 'definition') 4.049
('requirement', 'log') 4.049
('tariff', 'notice') 3.988
('archive', 'broadcast') 3.958
('broadcast', 'information') 3.958
('september', 'broadcast') 3.958
('day', 'hustler') 3.944
('date', 'information') 3.858
('date', 'order') 3.858
('date', 'september') 3.858
('send', 'date') 3.858
('current', 'licence') 3.817
('licence', 'require') 3.817
('fax', 'secretary') 3.751
('letter', 'secretary') 3.751
('version', 'party') 3.699
('channel', 'condition') 3.588
('channel', 'service') 3.588
('proceed', 'canada') 3.588
('application', 'effective') 3.587
('application', 'make') 3.587
('application', 'require') 3.587
('include', 'application') 3.587
('day', 'undertake') 3.564
('city', 'file') 3.530
('document', 'file') 3.530
('broadcast', 'act') 3.483
('may', 'submit') 3.418
('effective', 'date') 3.384
('submission', 'submission') 3.334
('application', 'new') 3.213
('commission', 'could') 3.162
('commission', 'intervention') 3.162
('intervention', 'commission') 3.162
('service', 'log') 3.148
('issue', 'licence') 3.129
('licence', 'issue') 3.129
('available', 'party') 3.121
('party', 'proceed') 3.121
('term', 'broadcast') 3.110
('issue', 'decision') 3.014
('vancouver', 'date') 3.013
('day', 'submit') 2.989
('log', 'licensee') 2.981
('submission', 'may') 2.981
('licensee', 'request') 2.981
('service', 'condition') 2.965
('application', 'city') 2.907
('crtc', 'public') 2.868
('licence', 'available') 2.868
('log', 'file') 2.830
('submission', 'file') 2.830
('day', 'commission') 2.802
('may', 'canada') 2.799
('date', 'document') 2.710
('submission', 'party') 2.691
('canada', 'file') 2.649
('licence', 'secretary') 2.642
('broadcast', 'public') 2.549
('commission', 'send') 2.548
('commission', 'serve') 2.548
('september', 'commission') 2.548
('staff', 'commission') 2.548
('general', 'decision') 2.530
('may', 'file') 2.487
('modify', 'broadcast') 2.328
('date', 'letter') 2.236
('application', 'request') 2.232
('day', 'licensee') 2.227
('licensee', 'day') 2.227
('application', 'commission') 2.218
('party', 'file') 2.207
('crtc', 'may') 2.112
('act', 'commission') 2.108
('approve', 'commission') 2.108
('comment', 'commission') 2.108
('require', 'commission') 2.108
('commission', 'make') 2.108
('procedure', 'commission') 1.771
('date', 'may') 1.727
('licensee', 'date') 1.727
('commission', 'receive') 1.501
('document', 'commission') 1.501
('issue', 'commission') 1.501
('date', 'party') 1.470
('crtc', 'broadcast') 1.337
('channel', 'commission') 1.281
('commission', 'program') 1.281
('proceed', 'commission') 1.281
('notice', 'commission') 1.096
('date', 'application') 1.078
('request', 'commission') 0.939
('may', 'commission') 0.688
('commission', 'application') 0.245
In [ ]:
'''
subject_graph = nx.Graph()
for row in file:
subject = row['keywords'].split(',')
for n in range(len(subject)-1):
for m in range(n, len(subject)):
subject_graph.add_edge(subject[n], subject[m])
'''
In [ ]:
#subject_communities = nx.connected_component_subgraphs(subject_graph)
In [ ]:
#test = list(subject_communities)
In [ ]:
#nx.draw(test[0])
#plt.show()
In [ ]:
#all_words = list(sum(only_words, []))
#plot_all_words = nltk.Text(all_words)
#fd_all = nltk.FreqDist(plot_all_words)
In [ ]:
#%matplotlib inline
#plt.rcParams['figure.figsize'] = (10.0, 10.0)
#fd_all.plot(50,cumulative=False,)
In [45]:
print(stopwords)
['i', 'me', 'my', 'myself', 'we', 'our', 'ours', 'ourselves', 'you', 'your', 'yours', 'yourself', 'yourselves', 'he', 'him', 'his', 'himself', 'she', 'her', 'hers', 'herself', 'it', 'its', 'itself', 'they', 'them', 'their', 'theirs', 'themselves', 'what', 'which', 'who', 'whom', 'this', 'that', 'these', 'those', 'am', 'is', 'are', 'was', 'were', 'be', 'been', 'being', 'have', 'has', 'had', 'having', 'do', 'does', 'did', 'doing', 'a', 'an', 'the', 'and', 'but', 'if', 'or', 'because', 'as', 'until', 'while', 'of', 'at', 'by', 'for', 'with', 'about', 'against', 'between', 'into', 'through', 'during', 'before', 'after', 'above', 'below', 'to', 'from', 'up', 'down', 'in', 'out', 'on', 'off', 'over', 'under', 'again', 'further', 'then', 'once', 'here', 'there', 'when', 'where', 'why', 'how', 'all', 'any', 'both', 'each', 'few', 'more', 'most', 'other', 'some', 'such', 'no', 'nor', 'not', 'only', 'own', 'same', 'so', 'than', 'too', 'very', 's', 't', 'can', 'will', 'just', 'don', 'should', 'now', 'd', 'll', 'm', 'o', 're', 've', 'y', 'ain', 'aren', 'couldn', 'didn', 'doesn', 'hadn', 'hasn', 'haven', 'isn', 'ma', 'mightn', 'mustn', 'needn', 'shan', 'shouldn', 'wasn', 'weren', 'won', 'wouldn']
In [46]:
print(len(stopwords))
153
Content source: telecom-research/crtc-scraper
Similar notebooks: