Description
This data has been used as the basis of a scientific article by Violette Mens.
It is comprised of all the entries when writing "radicalization", "radicalisation" and "radicalisacion" in Web of Science on the 18th of June 2024. The data in the .txt files is separated by tab characters.
The data set hasn´t been touched, meaning no cleaning process has been done on this one. Yet, the data used for the article was obviously curated to limit any risk of error.
The author decided to keep all 8 files separated and not merge them to limit the size of the files as well as the risks of errors while copy and pasting the content of each. Indeed, for this kind of data, any change in the presentation of the text could impact how well the computer understands the file, rendering it unusable.
To use this data set, one would need to upload all the files (which contain all the information from the 3797 articles retrieved) and transfer them to a data cleaning platform (the one used for this article was OpenRefine). Once all duplicates and misspelling have been taken out, you could upload the cleaned document to an analyzing platform (the one used for this article was RStudio´s "Bibliometrix" extension).
It is very important to keep the file format in "plain text" (txt) as all platforms don´t understand the same format and plain text is the only one understood by both OpenRefine and RStudio.
If one wanted to check the data set without cleaning it, they could upload the files directly to RStudio but might get a slightly different output than the author of this article.
To better understand the abbreviations, the reader can look at them on the Web of Science website:
https://support.clarivate.com/ScientificandAcademicResearch/s/article/Web-of-Science-Core-Collection-List-of-field-tags-in-output?language=en_US
Yet, here is the full list:
FN: File Name
VR: Version Number
PT: Publication Type (J=Journal; B=Book; S=Series; P=Patent)
AU: Authors
AF: Author Full Name
BA: Book Authors
BF: Book Authors Full Name
CA: Group Authors
GP: Book Group Authors
BE: Editors
TI: Document Title
SO: Publication Name
SE: Book Series Title
BS: Book Series Subtitle
LA: Language
DT: Document Type
CT: Conference Title
CY: Conference Date
CL: Conference Location
SP: Conference Sponsors
HO: Conference Host
DE: Author Keywords
ID: Keywords Plus®
AB: Abstract
C1: Author Address
RP: Reprint Address
EM: E-mail Address
RI: ResearcherID Number
OI: ORCID Identifier (Open Researcher and Contributor ID)
FU: Funding Agency and Grant Number
FX: Funding Text
CR: Cited References
NR: Cited Reference Count
TC: Web of Science Core Collection Times Cited Count
Z9: Total Times Cited Count (Web of Science Core Collection, BIOSIS Citation Index, Chinese Science Citation Database, Data Citation Index, Russian Science Citation Index, SciELO Citation Index)
U1: Usage Count (Last 180 Days)
U2: Usage Count (Since 2013)
PU: Publisher
PI: Publisher City
PA: Publisher Address
SN: International Standard Serial Number (ISSN)
EI: Electronic International Standard Serial Number (eISSN)
BN: International Standard Book Number (ISBN)
J9: 29-Character Source Abbreviation
JI: ISO Source Abbreviation
PD: Publication Date
PY: Year Published
VL: Volume
IS: Issue
SI: Special Issue
PN: Part Number
SU: Supplement
MA: Meeting Abstract
BP: Beginning Page
EP: Ending Page
AR: Article Number
DI: Digital Object Identifier (DOI)
D2: Book Digital Object Identifier (DOI)
PG: Page Count
P2: Chapter Count (Book Citation Index)
WC: Web of Science Categories
SC: Research Areas
GA: Document Delivery Number
UT: Accession Number
PM: PubMed ID
ER: End of Record
EF: End of File
Metadata
Files
Document
Type
Size
License
Except where otherwised noted, this item's license is described as CC0 1.0 Universal
