Low Resource Prediction Using Bert Attention Scores

This library is the code-base that accompanies our in-progress workshop paper. We demonstrate that the attention distributions of trained BERT models provide strong enough signal to be used as the input themselves to downstream shallow neural networks. This approach enables us to limit the amount of data we require to train classification models.

The data can be found here: https://drive.google.com/uc?id=1dfN-WvFMiAWuOXq1VJ_EnpTDGQruWuxm&export=download

Name		Name	Last commit message	Last commit date
Latest commit History 189 Commits
experiments/workshop_experiments		experiments/workshop_experiments
lib		lib
src		src
.gitignore		.gitignore
README.md		README.md
jupyter.sh		jupyter.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiments/workshop_experiments

experiments/workshop_experiments

lib

lib

src

src

.gitignore

.gitignore

README.md

README.md

jupyter.sh

jupyter.sh

Repository files navigation

Low Resource Prediction Using Bert Attention Scores

About

Releases

Packages

Contributors 3

Languages

rdiehlmartinez/attention-analysis

Folders and files

Latest commit

History

Repository files navigation

Low Resource Prediction Using Bert Attention Scores

About

Resources

Stars

Watchers

Forks

Languages