Telenor x SINTEF x Brain - AI Hackathon 2021 - 1st place solution 🥇

Unsupervised Anomaly Detection for Telenor Network Data

Hackathon 2021 - Group 5

Installation

In this project we used requirements.txt file. To install the packages please write:

cd brain-cogito-hackathon-2021/
pip install -r requirements.txt

The requirements.txt file includes the following Python libraries installed::

You will also need to have software installed to run and execute a Jupyter Notebook.

If you do not have Python installed yet, it is highly recommended that you install the Anaconda distribution of Python, which already has the above packages and more included.

Usage

To run this project you'll need to have access to the dataset files hackathon_kpis_anonymised.csv and relative_distance.csv. All code to recreate our results are in this repository.

'hackathon_kpis_anonymised.csv'
'relative_distance.csv'

These files can be found at Telenor's google drive.

preprocessing

All actions for this step can be found in the preprocessing.py file located at root folder.

from preprocessing import DataPreProcessor
from hierarchical_clustering import HierarchicalClustering
import pandas as pd

path_distance = 'data/relative_distance.csv'
path_harc = 'data/cell_clusters.csv'

# preprocessing the data for hierarchical clustering
hiarc_df = HierarchicalClustering.from_path(path = path_distance).cluster_data().extract_cell_name_from_clusters()

hiarc_df.to_csv(path_harc)

path = 'data/hackathon_kpis_anonymised-1.csv'

# preprocessing by extracting each feature from the cell name. 
# fixing the failure rates, for example 0 calls divided by 0 becomes NaN, this is now changed to 0.
# Process path is using hierarchical clustering data from csv. 

df = DataPreProcessor().read_file(read_kpis_path=path).extract_cell_name_data().fix_failure_rates().process_from_path(path_harc).fetch_data()

print(df.head())

The example above shows how to use every function in our arsenal to preprocess the data given for this assignment.

DeepAnT

To run DeepAnT is pretty easy. Configure whether you want to train on cuda or cpu at the top of DeepAnT.py, then run the file.

python DeepAnt.py

isolationForest

To run isolationForest is pretty easy. Go in the jupyter notebook : isolationForest.ipynb file and run each cell from top to down.

jupyter notebook isolationForest.ipynb

STD

To run STD is pretty easy. Go in the jupyter notebook : std-baseline.ipynb file and run each cell from top to down.

jupyter notebook std-baseline.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.gitignore		.gitignore
DataEx.html		DataEx.html
DataEx.ipynb		DataEx.ipynb
DeepAnT.py		DeepAnT.py
DeepAnT_model.py		DeepAnT_model.py
IsolationForest.ipynb		IsolationForest.ipynb
IsolationForest_v2.ipynb		IsolationForest_v2.ipynb
Plotter.ipynb		Plotter.ipynb
README.md		README.md
Std-baseline.ipynb		Std-baseline.ipynb
VAR.ipynb		VAR.ipynb
evaluation.py		evaluation.py
final_presentation.pdf		final_presentation.pdf
hierarchical_clustering.py		hierarchical_clustering.py
iForest.py		iForest.py
plot.py		plot.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
std_detector.py		std_detector.py
utils.py		utils.py

dilawarm/brain-cogito-hackathon-2021

Folders and files

Latest commit

History

Repository files navigation

Telenor x SINTEF x Brain - AI Hackathon 2021 - 1st place solution 🥇

Unsupervised Anomaly Detection for Telenor Network Data

Hackathon 2021 - Group 5

Installation

Usage

preprocessing

DeepAnT

isolationForest

STD

About

Topics

Resources

Stars

Watchers

Forks

Languages