Using social network information to discover truth of movie ranking (doi:10.21979/N9/L5TTRW)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Using social network information to discover truth of movie ranking

Identification Number:

doi:10.21979/N9/L5TTRW

Distributor:

DR-NTU (Data)

Date of Distribution:

2018-06-10

Version:

1

Bibliographic Citation:

Yang, Jielong; Tay, Wee Peng, 2018, "Using social network information to discover truth of movie ranking", https://doi.org/10.21979/N9/L5TTRW, DR-NTU (Data), V1, UNF:6:nRQNiWhKidICLOUiRM+VhA== [fileUNF]

Study Description

Citation

Title:

Using social network information to discover truth of movie ranking

Identification Number:

doi:10.21979/N9/L5TTRW

Authoring Entity:

Yang, Jielong (Nanyang Technological University)

Tay, Wee Peng (Nanyang Technological University)

Software used in Production:

MS Excel

Distributor:

DR-NTU (Data)

Access Authority:

Yang Jielong

Access Authority:

Tay Wee Peng

Depositor:

Yang Jielong; Tay Wee Peng

Date of Deposit:

2018-05-23

Holdings Information:

https://doi.org/10.21979/N9/L5TTRW

Study Scope

Keywords:

Computer and Information Science, Engineering, Social Sciences, Computer and Information Science, Engineering, Social Sciences, Truth discovery, social network, movie ranking

Abstract:

The real dataset consists of movie evaluations from IMDB, which provides a platform where individuals can evaluate movies on a scale of 1 to 10. If a user rates a movie and clicks the share button, a Twitter message is generated. We then extract the rating from the Twitter message. We treat the ratings on the IMDB website as the event truths, which are based on the aggregated evaluations from all users, whereas our observations come from only a subset of users who share their ratings on Twitter. Using the Twitter API, we collect information about the follower and following relationships between individuals that generate movie evaluation Twitter messages. To better show the influence of social network information on event truth discovery, we delete small subnetworks that consist of less than 5 agents. The final dataset we use consists of 2266 evaluations from 209 individuals on 245 movies (events) and also the social network between these 209 individuals. We regard the social network to be undirected as both follower or following relationships indicate that the two users have similar taste.

Kind of Data:

csv

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

File Description--f2652

File: MovieRanking2Levels.tab

  • Number of cases: 2266

  • No. of variables per record: 4

  • Type of File: text/tab-separated-values

Notes:

UNF:6:idmdOHVRX2lCN+qFXSrj2A==

File Description--f2653

File: MovieRanking4Levels.tab

  • Number of cases: 2266

  • No. of variables per record: 4

  • Type of File: text/tab-separated-values

Notes:

UNF:6:VEOQFDZaksL/wFtlcyylEQ==

File Description--f2655

File: SocialNetwork.tab

  • Number of cases: 516

  • No. of variables per record: 2

  • Type of File: text/tab-separated-values

Notes:

UNF:6:hGoR7qSrSeOimkpW39DBww==

Variable Description

List of Variables:

Variables

Agent ID

f2652 Location:

Summary Statistics: Mean 96.27537511032429; Max. 208.0; Valid 2266.0; StDev 59.25412091901887; Min. 0.0

Variable Format: numeric

Notes: UNF:6:KqsOd6skCSJdHwwREDkVKQ==

Movie ID

f2652 Location:

Summary Statistics: Max. 244.0; Mean 85.67431597528685; Min. 0.0; StDev 60.03227110509165; Valid 2266.0

Variable Format: numeric

Notes: UNF:6:B/psHpvrOd3jIR4fhDUuPA==

Evaluation of the movie

f2652 Location:

Summary Statistics: Mean 0.6544571932921444; StDev 0.47564988995433216; Valid 2266.0; Min. 0.0; Max. 1.0;

Variable Format: numeric

Notes: UNF:6:UP517FEJB7KCM04qtpF6kQ==

True ranking of the movie

f2652 Location:

Summary Statistics: Mean 0.5511915269196823; Max. 1.0; Min. 0.0; StDev 0.4974823070843409; Valid 2266.0;

Variable Format: numeric

Notes: UNF:6:l8Uo3p3AskQztgveHi8n9w==

Agent ID

f2653 Location:

Summary Statistics: Mean 96.27537511032429; Valid 2266.0; StDev 59.25412091901887; Max. 208.0; Min. 0.0

Variable Format: numeric

Notes: UNF:6:KqsOd6skCSJdHwwREDkVKQ==

Movie ID

f2653 Location:

Summary Statistics: Mean 85.67431597528685; StDev 60.03227110509165; Valid 2266.0; Max. 244.0; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:B/psHpvrOd3jIR4fhDUuPA==

Evaluation of the movie

f2653 Location:

Summary Statistics: Valid 2266.0; StDev 0.9000821634280748; Max. 3.0; Mean 2.422771403353928; Min. 0.0

Variable Format: numeric

Notes: UNF:6:YC2VLAV04HiUYy0gzrY8Ig==

True ranking of the movie

f2653 Location:

Summary Statistics: Max. 3.0; Min. 0.0; Valid 2266.0; StDev 0.6443709048806621; Mean 2.469991173874669;

Variable Format: numeric

Notes: UNF:6:nwsiNm4CVOp/X5QTuc4Pdw==

Agent ID of the follower

f2655 Location:

Summary Statistics: Valid 516.0; StDev 59.67149965941716; Min. 1.0; Mean 106.61434108527156; Max. 187.0;

Variable Format: numeric

Notes: UNF:6:MENHPHjzqDP8/k3wQCkTPw==

Agent ID of the followee

f2655 Location:

Summary Statistics: StDev 58.956731936958555; Max. 208.0; Mean 98.97093023255813; Min. 0.0; Valid 516.0;

Variable Format: numeric

Notes: UNF:6:exhn81BJhCemJVV9xaH1+g==

Other Study-Related Materials

Label:

Readme.txt

Text:

Introduction of the dataset

Notes:

text/plain