gpt-2

Status: Archive (code is provided as-is, no updates expected)

gpt-2

Code and models from the paper "Language Models are Unsupervised Multitask Learners".

You can read about GPT-2 and its staged release in original blog post, 6 month follow-up post, and final post.

They have also released a dataset for researchers to study their behaviors.

Usage

This repository is meant to be a starting point for researchers and engineers to experiment with GPT-2. This repository adds compatibility for GPT-2 for tensorflow 2.0 and above.

For basic information, see our model card.

Some caveats

GPT-2 models' robustness and worst case behaviors are not well-understood. As with any machine-learned model, carefully evaluate GPT-2 for your use case, especially if used without fine-tuning or in safety-critical applications where reliability is important.
The dataset our GPT-2 models were trained on contains many texts with biases and factual inaccuracies, and thus GPT-2 models are likely to be biased and inaccurate as well.
To avoid having samples mistaken as human-written, we recommend clearly labeling samples as synthetic before wide dissemination. Our models are often incoherent or inaccurate in subtle ways, which takes more than a quick read for a human to notice.

Installation

git clone https://github.com/namelessCrusader/Gpt-2-compat-tf2/edit/master/
python/python3 download_model.py 117M/335M/etc.
pip install tensorflow(if you don't have V2.0, this package add compatibility for Tensorflow 2.0)
pip install -r requirements.txt
cd src
python/python3 generate_unconditional_samples.py/interactive_conditional_samples.py

Contributors

See CONTRIBUTORS.md

Citation

Please use the following bibtex entry:

@article{radford2019language,
  title={Language Models are Unsupervised Multitask Learners},
  author={Radford, Alec and Wu, Jeff and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya},
  year={2019}
}

License

Modified MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
nlp/2021_04_22_shap_for_huggingface_transformers		nlp/2021_04_22_shap_for_huggingface_transformers
src		src
CONTRIBUTORS.md		CONTRIBUTORS.md
DEVELOPERS.md		DEVELOPERS.md
Dockerfile.cpu		Dockerfile.cpu
Dockerfile.gpu		Dockerfile.gpu
LICENSE		LICENSE
README.md		README.md
domains.txt		domains.txt
download_model.py		download_model.py
head_view_gpt2.ipynb		head_view_gpt2.ipynb
model_card.md		model_card.md
requirements.txt		requirements.txt

License

namelessCrusader/Gpt-2-compat-tf2

Folders and files

Latest commit

History

Repository files navigation

gpt-2

Usage

Some caveats

Installation

Contributors

Citation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages