Detection and Analysis of Content Creator Collaborations in Youtube Videos using Face Recognition

YouTube analysis concerning content creator collaborations in videos, using face recognition. This projects includes data acquisition from YouTube ,video and face processing and statistics evaluation.

Directories

data

contains evaluation results and plots

src

contains code for the different pipeline steps

data_collection

crawler code for YouTube, Socialblade and Google Images

data_evaluation

evaluation code, jupyter-notebook based evaluation, code for plot creation

face_recognition

video download, face recognition and clustering pipeline, contains Facenet code base aswell as youtube_dl

visualization

collaboration graph visualization based on Gugel Universum (http://universum.gugelproductions.de/).

external

misc scripts, face recognition evaluation scripts (heavily based on FaceNet and OpenFace (https://cmusatyalab.github.io/openface/) code)

Face Recognition Algorithm

For face feature extraction trained models and code from FaceNet is used. Face detection and alignment uses MTCNN Multi-task CNN.

Clustering

For clustering face features, the algorithm HDBSCAN is used.

Performance

Face recognition evaluation were conducted on the YouTube Faces dataset aswell as Labeled Faces in the Wild LFW dataset.

	accuracy
LFW	0.993 +- 0.004
YTF	0.998 +- 0.0013

Collaboration Detection

See the CATANA publication for further explanation.

Data

Data for applying face recognition and evaluation can be acquired using the provided crawler in [data_collection] directory.

Usage

Installation

Depending on the use-case different requirements must be met.

For using the CATANA framework only, without evaluation of the other face recognition methods like OpenFace, see requirements.txt aswell as the requirements of FaceNet.

For executing the face recognition evaluations, additionally requirements for the OpenFace (https://cmusatyalab.github.io/openface/) framework must be installed.

All use cases also require a current OpenCV 2.4 version installed.

For video extraction, youtube_dl is used, a version is included in the face_recognition directory, but should be updated to a current version when used.

Usage

See the respective directory Readme files for instructions.

Evaluation

Data evaluation is based on Jupyter notebook .ipynb files located in data_evaluation. File names indicate a execution order, while 'XX' files can be executed independent after the strict order execution.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Detection and Analysis of Content Creator Collaborations in Youtube Videos using Face Recognition

Directories

data

src

data_collection

data_evaluation

face_recognition

visualization

external

Face Recognition Algorithm

Clustering

Performance

Collaboration Detection

Data

Usage

Installation

Usage

Evaluation

About

Releases

Packages

Languages

License

billtiger/CATANA

Folders and files

Latest commit

History

Repository files navigation

Detection and Analysis of Content Creator Collaborations in Youtube Videos using Face Recognition

Directories

data

src

data_collection

data_evaluation

face_recognition

visualization

external

Face Recognition Algorithm

Clustering

Performance

Collaboration Detection

Data

Usage

Installation

Usage

Evaluation

About

Resources

License

Stars

Watchers

Forks

Languages