Skip to content

avalanchesiqi/networked-popularity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Code and Data for YouTube Networked Popularity Study

We release the code and data for the following paper. If you use these datasets, or refer to its results, please cite:

Siqi Wu, Marian-Andrei Rizoiu, and Lexing Xie. Estimating Attention Flow in Online Video Networks. ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW), 2019. [paper|slides|blog]

Code usage

We provide three quickstart bash scripts:

  1. run_all_wrangling.sh
  2. run_all_measures.sh
  3. run_all_models.sh

Download and place data in the data directory, then uncompress them. First run run_all_wrangling.sh to create formatted data, then run run_all_temporal_analysis.sh to conduct the temporal analysis or run_all_predictors.sh to reproduce the results of prediction tasks. Detailed usage and running time are documented in the corresponding python scripts.

Note the datasets are large, so the quickstart scripts will take up to 24 hours to finish. Check the estimated running time in each python script before you run the quickstart scripts.

Python packages version

All codes are developed and tested in Python 3.6.7, along with NumPy 1.14.5, matplotlib 3.0.3 and SciPy 1.2.1.

Data

The data is hosted on Google Drive and Dataverse. See more details in this data description.

About

Code and Data for paper: Estimating Attention Flow in Online Video Networks (CSCW '19)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published