project-panel

This is our project for CSCE474/874 Intro to Data Mining. Our project is to identify human activities.

src directory

It contains fft.py. It calculates fft and plot data both in frequency domain and time domain. To use fft.py, simply put data file as argument. For example, python fft.py $PROJECT/data/run1.csv.
preprocess.py is to extract our raw data to formatted data with features as attributes.
We completely separate DC component from other FFT samples because this value(fft[0]) is very large and also very important to evaluate as a new feature.
One of the features is called Power, this feature is the power of all frequency coefficients except the DC component. We identify the values of this feature are relatively small (10^-5~10^-6), and therefore we use logarithmic of base 10 to represent this feature.
hmm.py implements Gaussian HMM model by hmmlearn library https://github.com/hmmlearn/hmmlearn. It can handle any data time even like continuous time series. It plots four figures. The first figure is estimated state sequence from training data; the second figure is actual labels from training data; the third figure is estimated state sequence from test data; the fourth figure is actual labels from test data.

test directory

test_hmm.py is an example provided in seqlearn library https://github.com/larsmans/seqlearn. This library is lack of maintainance and has been deprecated from scikit-tool. Additionally, it can only support nominal data type (MultinominalHMM). We consider it not appropriate for our research. But this toy test program help us get more depth of understanding HMM.

data directory

It contains collected data from SensorLog.
In preprocess.py, it needs to search the filename to identify activities. Please make sure your data file name contain an "activity". For example, run123.csv and myrun.csv are good file names for activity RUN.
In genfile.sh, it uses data file set {1,2,3,4} to train the model, and test on the data set {5}. In case we would collect more data in future, please modify this file to generate data and test files.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
src		src
test		test
ProjectReport.pdf		ProjectReport.pdf
README.md		README.md
fourier_tutorial.pdf		fourier_tutorial.pdf
hmm-tutorial.pdf		hmm-tutorial.pdf
presentation.pdf		presentation.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

src

src

test

test

ProjectReport.pdf

ProjectReport.pdf

README.md

README.md

fourier_tutorial.pdf

fourier_tutorial.pdf

hmm-tutorial.pdf

hmm-tutorial.pdf

presentation.pdf

presentation.pdf

Repository files navigation

project-panel

src directory

test directory

data directory

About

Releases

Packages

Languages

zzxuanyuan/datamining-project

Folders and files

Latest commit

History

Repository files navigation

project-panel

src directory

test directory

data directory

About

Resources

Stars

Watchers

Forks

Languages