File Fragment Type Identification using Recurrent Neural Networks

This code is mostly a modification of code taken from "FiFTy: Large-scale File Fragment Type Identification using Neural Networks" avaiable at https://github.com/mittalgovind/fifty

To use this code follow these steps:

1- Download Scenario #1 (512-byte blocks) from http://dx.doi.org/10.21227/kfxw-8084 and unzip into data directory

2- Run utility.py to create the feature dataset at unigram folder

3- Run rnn_param.py to find the optimal hyperparameters. You can skip this step and go to next step, if you don't need to modify current best hyperparameters

4- Run rnn.py to train the network and get loss and accuracy plots, and also confusion matrix. The resulting model is saved as rnn.h5

An already saved model can be found in model directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

model

model

unigram

unigram

README.md

README.md

rnn.py

rnn.py

rnn_param.py

rnn_param.py

utility.py

utility.py

Repository files navigation

File Fragment Type Identification using Recurrent Neural Networks

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
model		model
unigram		unigram
README.md		README.md
rnn.py		rnn.py
rnn_param.py		rnn_param.py
utility.py		utility.py

ali-i-abbas/file-fragment-rnn

Folders and files

Latest commit

History

Repository files navigation

File Fragment Type Identification using Recurrent Neural Networks

About

Resources

Stars

Watchers

Forks

Languages