NwQM

Implementation of the paper NwQM: A neural quality assessment framework for Wikipedia, EMNLP 2020

Organization of the folder

1. Baselines - The Baseline models described in the paper are implemented here.
2. Data Processing - This folder contains code the generation of images (screenshots) of wikipedia pages from its URLs and also the text crawler for cleaning the text which is used in HAN etc.
3. Generate Embeddings - This folder contains the codes for generating the Finetuned BERT Embeddings for text pages, Google USE Embeddings for talk pages, Finetuned InceptionV3 Embeddings for Images.
4. NwQM Codes - The different NwQM Models such as NwQM, NwQM-w/oI (without Images), NwQM-w/oT (without Talk pages), NwQM-w/oTI (without Talk Pages and Images) are implemented here.

To Run

Please look into the individual folders for running the codes.

If you find this code useful in your research then please cite

@inproceedings{guda2020nwqm,
  title={NwQM: A Neural Quality Assessment Framework for Wikipedia},
  author={Guda, Bhanu Prakash Reddy and Seelaboyina, Sasi Bhushan and Sarkar, Soumya and Mukherjee, Animesh},
  booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  pages={8396--8406},
  year={2020}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Baselines

Baselines

Data Processing

Data Processing

Generate Embeddings

Generate Embeddings

NwQM Codes

NwQM Codes

README.md

README.md

Repository files navigation

NwQM

Organization of the folder

To Run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
Baselines		Baselines
Data Processing		Data Processing
Generate Embeddings		Generate Embeddings
NwQM Codes		NwQM Codes
README.md		README.md

sasibhushan3/NwQM_EMNLP

Folders and files

Latest commit

History

Repository files navigation

NwQM

Organization of the folder

To Run

About

Resources

Stars

Watchers

Forks

Languages