View: |
Part 1: Document Description
|
Citation |
|
---|---|
Title: |
Using Twitter Dataset for Social Listening in Singapore |
Identification Number: |
doi:10.21979/N9/PALUID |
Distributor: |
DR-NTU (Data) |
Date of Distribution: |
2024-07-02 |
Version: |
1 |
Bibliographic Citation: |
Wang, Qiongqiong; Sailor, Hardik B.; Lee, Kong Aik; Ma, Kai; Goh, Kim Huat; Boh, Wai Fong, 2024, "Using Twitter Dataset for Social Listening in Singapore", https://doi.org/10.21979/N9/PALUID, DR-NTU (Data), V1 |
Citation |
|
Title: |
Using Twitter Dataset for Social Listening in Singapore |
Identification Number: |
doi:10.21979/N9/PALUID |
Authoring Entity: |
Wang, Qiongqiong (Agency for Science, Technology and Research (A⋆STAR)) |
Sailor, Hardik B. (Agency for Science, Technology and Research (A⋆STAR)) |
|
Lee, Kong Aik (The Hong Kong Polytechnic University, Hong Kong) |
|
Ma, Kai (Nanyang Technological University) |
|
Goh, Kim Huat (Nanyang Technological University) |
|
Boh, Wai Fong (Nanyang Technological University) |
|
Software used in Production: |
MongoDB |
Grant Number: |
COT-CityScan-2020-1 |
Grant Number: |
COT-CityScan-2020-1 |
Distributor: |
DR-NTU (Data) |
Access Authority: |
Wang Qiongqiong |
Depositor: |
Kai, Ma |
Date of Deposit: |
2024-06-21 |
Holdings Information: |
https://doi.org/10.21979/N9/PALUID |
Study Scope |
|
Keywords: |
Computer and Information Science, Social Sciences, Computer and Information Science, Social Sciences, Twitter data, Singapore, Sentiment analysis, Bursty topic detection |
Abstract: |
<p>This study delves into analyzing social media data sourced from Twitter within the context of Singapore, forming a crucial component of a broader social listening initiative. We provide a decade’s worth of social data from Singapore, offering invaluable insights for the research community. This work presents two analytical approaches utilizing this dataset: sentiment analysis and bursty topic detection. Sentiment analysis for direct search is based on zero shot pretrained model while busrty topic analysis is based on biterm topic model. The detailed experiments demonstrate the efficacy of the approach for analyzing social trends using Twitter data.</p> <p>We collected all twitter data posted in Singapore from 2008 to 2023. The geocode setting as (1.346353, 103.807526, 25km) was used in Twitter API to cover the whole of Singapore. The total number of tweets in this dataset is 96,686,894.</p> <p>There are 3 data files: 1. place.json includes 10k detailed places information in Singapore.2.subzones.json includes 332 subzone information in Singapore 3.tweets.json includes 96M+tweets posted in Singapore. MongoDB was used as the database to store and manage the data.</p> |
Time Period: |
2008-2023 |
Kind of Data: |
Twitter timeline dataset |
Methodology and Processing |
|
Sources Statement |
|
Data Access |
|
Other Study Description Materials |
|
Related Publications |
|
Citation |
|
Identification Number: |
10.1109/ACCESS.2024.3427760 |
Bibliographic Citation: |
Wang, Q., Sailor, H. B., Lee, K. A., Ma, K., Goh, K. H., & Boh, W. F. (2024). Using Twitter dataset for social listening in Singapore. IEEE access, 12, 100015-100025. |
Citation |
|
Identification Number: |
10356/181466 |
Bibliographic Citation: |
Wang, Q., Sailor, H. B., Lee, K. A., Ma, K., Goh, K. H. & Boh, W. F. (2024). Using Twitter dataset for social listening in Singapore. IEEE Access, 12, 100015-100025. |
Label: |
twitter_dataset_singapore.zip |
Text: |
Singapore Twitter dataset |
Notes: |
application/zip |