SNA in Bibliometrics A number of papers in a specific field (big data, cloud computing, SNA etc.) are gathered from the data feed (webofknowledge.com) and they are filtered by the funded agencies option which is available in this source. In this way it is declared that the selected papers are connected with a fund. Information like authors, title, abstract and keywords can easily be extracted from the source but the name of specific tools, projects or algorithms are possible missed. The full text of papers in .pdf or .txt format is gathered. NLP and text mining techniques like LDA, NER are used in order to extract the necessary information.



In [ ]: