Using Twitter Dataset for Social Listening in Singapore (doi:10.21979/N9/PALUID)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link) (external link)

Document Description

Citation

Title:

Using Twitter Dataset for Social Listening in Singapore

Identification Number:

doi:10.21979/N9/PALUID

Distributor:

DR-NTU (Data)

Date of Distribution:

2024-07-02

Version:

1

Bibliographic Citation:

Wang, Qiongqiong; Sailor, Hardik B.; Lee, Kong Aik; Ma, Kai; Goh, Kim Huat; Boh, Wai Fong, 2024, "Using Twitter Dataset for Social Listening in Singapore", https://doi.org/10.21979/N9/PALUID, DR-NTU (Data), V1

Study Description

Citation

Title:

Using Twitter Dataset for Social Listening in Singapore

Identification Number:

doi:10.21979/N9/PALUID

Authoring Entity:

Wang, Qiongqiong (Agency for Science, Technology and Research (A⋆STAR))

Sailor, Hardik B. (Agency for Science, Technology and Research (A⋆STAR))

Lee, Kong Aik (The Hong Kong Polytechnic University, Hong Kong)

Ma, Kai (Nanyang Technological University)

Goh, Kim Huat (Nanyang Technological University)

Boh, Wai Fong (Nanyang Technological University)

Software used in Production:

MongoDB

Grant Number:

COT-CityScan-2020-1

Grant Number:

COT-CityScan-2020-1

Distributor:

DR-NTU (Data)

Access Authority:

Wang Qiongqiong

Depositor:

Kai, Ma

Date of Deposit:

2024-06-21

Holdings Information:

https://doi.org/10.21979/N9/PALUID

Study Scope

Keywords:

Computer and Information Science, Social Sciences, Computer and Information Science, Social Sciences, Twitter data, Singapore, Sentiment analysis, Bursty topic detection

Abstract:

<p>This study delves into analyzing social media data sourced from Twitter within the context of Singapore, forming a crucial component of a broader social listening initiative. We provide a decade’s worth of social data from Singapore, offering invaluable insights for the research community. This work presents two analytical approaches utilizing this dataset: sentiment analysis and bursty topic detection. Sentiment analysis for direct search is based on zero shot pretrained model while busrty topic analysis is based on biterm topic model. The detailed experiments demonstrate the efficacy of the approach for analyzing social trends using Twitter data.</p> <p>We collected all twitter data posted in Singapore from 2008 to 2023. The geocode setting as (1.346353, 103.807526, 25km) was used in Twitter API to cover the whole of Singapore. The total number of tweets in this dataset is 96,686,894.</p> <p>There are 3 data files: 1. place.json includes 10k detailed places information in Singapore.2.subzones.json includes 332 subzone information in Singapore 3.tweets.json includes 96M+tweets posted in Singapore. MongoDB was used as the database to store and manage the data.</p>

Time Period:

2008-2023

Kind of Data:

Twitter timeline dataset

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Publications

Citation

Identification Number:

10.1109/ACCESS.2024.3427760

Bibliographic Citation:

Wang, Q., Sailor, H. B., Lee, K. A., Ma, K., Goh, K. H., & Boh, W. F. (2024). Using Twitter dataset for social listening in Singapore. IEEE access, 12, 100015-100025.

Citation

Identification Number:

10356/181466

Bibliographic Citation:

Wang, Q., Sailor, H. B., Lee, K. A., Ma, K., Goh, K. H. & Boh, W. F. (2024). Using Twitter dataset for social listening in Singapore. IEEE Access, 12, 100015-100025.

Other Study-Related Materials

Label:

twitter_dataset_singapore.zip

Text:

Singapore Twitter dataset

Notes:

application/zip