https://www.socictopen.socict.org/files/original/c1f674d6d83e5d9f0fca0a45841339d5.pdf f9061a0681e8b666cf331927d1072566 Dublin Core The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/. Title A name given to the resource Coronavirus Description An account of the resource Dominio científico: Coronavirus Text A resource consisting primarily of words for reading. Examples include books, letters, dissertations, poems, newspapers, articles, archives of mailing lists. Note that facsimiles or images of texts are still of the genre Text. Dublin Core The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/. Title A name given to the resource Applying Machine Learning to Identify Anti-Vaccination Tweets during the COVID-19 Pandemic Creator An entity primarily responsible for making the resource Corneel Vandelanotte, Quyen G. To, Kien G. To, Van-Anh N. Huynh, Nhung TQ Nguyen, Diep TN Ngo, Stephanie J. Alley, Anh NQ Tran, Anh NP Tran, Ngan TT Pham, Thanh X Bui Description An account of the resource Anti-vaccination attitudes have been an issue since the development of the first vaccines. The increasing use of social media as a source of health information may contribute to vaccine hesitancy due to anti-vaccination content widely available on social media, including Twitter. Being able to identify anti-vaccination tweets could provide useful information for formulating strategies to reduce anti-vaccination sentiments among different groups. This study aims to evaluate the performance of different natural language processing models to identify anti-vaccination tweets that were published during the COVID-19 pandemic. We compared the performance of the bidirectional encoder representations from transformers (BERT) and the bidirectional long short-term memory networks with pre-trained GLoVe embeddings (Bi-LSTM) with classic machine learning methods including support vector machine (SVM) and naïve Bayes (NB). The results show that performance on the test set of the BERT model was: accuracy = 91.6%, precision = 93.4%, recall = 97.6%, F1 score = 95.5%, and AUC = 84.7%. Bi-LSTM model performance showed: accuracy = 89.8%, precision = 44.0%, recall = 47.2%, F1 score = 45.5%, and AUC = 85.8%. SVM with linear kernel performed at: accuracy = 92.3%, Precision = 19.5%, Recall = 78.6%, F1 score = 31.2%, and AUC = 85.6%. Complement NB demonstrated: accuracy = 88.8%, precision = 23.0%, recall = 32.8%, F1 score = 27.1%, and AUC = 62.7%. In conclusion, the BERT models outperformed the Bi-LSTM, SVM, and NB models in this task. Moreover, the BERT model achieved excellent performance and can be used to identify anti-vaccination tweets in future studies. Date A point or period of time associated with an event in the lifecycle of the resource 2021 Subject The topic of the resource neural network, deep learning, LSTM, bert, transformer, stance analysis Identifier An unambiguous reference to the resource within a given context 10.3390/ijerph18084069 Source A related resource from which the described resource is derived Epidemiology and Health Publisher An entity responsible for making the resource available Korean Society of Epidemiology Coverage The spatial or temporal topic of the resource, the spatial applicability of the resource, or the jurisdiction under which the resource is relevant Medicine