Skip to content

Implementation of our paper "NwQM: A neural quality assessment framework for Wikipedia"

Notifications You must be signed in to change notification settings

sasibhushan3/NwQM_EMNLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NwQM

Implementation of the paper NwQM: A neural quality assessment framework for Wikipedia, EMNLP 2020

Organization of the folder

1. Baselines - The Baseline models described in the paper are implemented here.
2. Data Processing - This folder contains code the generation of images (screenshots) of wikipedia pages from its URLs and also the text crawler for cleaning the text which is used in HAN etc.
3. Generate Embeddings - This folder contains the codes for generating the Finetuned BERT Embeddings for text pages, Google USE Embeddings for talk pages, Finetuned InceptionV3 Embeddings for Images.
4. NwQM Codes - The different NwQM Models such as NwQM, NwQM-w/oI (without Images), NwQM-w/oT (without Talk pages), NwQM-w/oTI (without Talk Pages and Images) are implemented here.

To Run

Please look into the individual folders for running the codes.

If you find this code useful in your research then please cite

@inproceedings{guda2020nwqm,
  title={NwQM: A Neural Quality Assessment Framework for Wikipedia},
  author={Guda, Bhanu Prakash Reddy and Seelaboyina, Sasi Bhushan and Sarkar, Soumya and Mukherjee, Animesh},
  booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  pages={8396--8406},
  year={2020}
}

About

Implementation of our paper "NwQM: A neural quality assessment framework for Wikipedia"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages