twitter_gender

A simple experiment to enrich twitter profiles with gender

How to run it?

Clone Repository
Install Requirements: pip install -r requirements.txt
Run the file processor python file_processor.py

—> It automatically downloads the dataset, builds the classifier, prints the test results and stores an enriched version of the dataset (csv file) under: twitter_gender/data/users_enriched.csv

The last two columns are added:

‘prediction’: the prediction result ‘female’ or ‘male'
‘source’: If the final prediction has happened based on the name (’name) or on description + last tweet (’text’)

Can I classify unseen examples?

Yes.

from twitter_gender.file_processor import ProcessUsers

process_users = ProcessUsers()

labels = ['unknown', 'female', 'male']

gender = process_users.gender_classifier.get_gender_by_name('Johannes Erett')
print(labels[gender])
  >> 'male'

gender = process_users.gender_classifier.get_gender_by_text_custom('I am a woman of great faith... in unicorns ❤️ ')
print(labels[gender])
  >> 'female'

What comes next?

Currently the classifier is not stored, but alsways built and trained from scratch. I would recommend to pickle and re-load it.
Code needs better structure and more documentation.

Tested with Python 3.7.1

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
gender_lib		gender_lib
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
file_processor.py		file_processor.py
gender_classifier.py		gender_classifier.py
performance.py		performance.py
profile2vec.py		profile2vec.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

gender_lib

gender_lib

.gitignore

.gitignore

README.md

README.md

init.py

init.py

file_processor.py

file_processor.py

gender_classifier.py

gender_classifier.py

performance.py

performance.py

profile2vec.py

profile2vec.py

requirements.txt

requirements.txt

Repository files navigation

twitter_gender

How to run it?

Can I classify unseen examples?

What comes next?

About

Releases

Packages

Languages

Johannes-Julien/twitter_gender

Folders and files

Latest commit

History

Repository files navigation

twitter_gender

How to run it?

Can I classify unseen examples?

What comes next?

About

Resources

Stars

Watchers

Forks

Languages