Data Mining

Important URLs

Usage

# Start and stop Hadoop
/usr/local/Cellar/hadoop121/1.2.1/bin/start-all.sh
/usr/local/Cellar/hadoop121/1.2.1/bin/stop-all.sh

# Hadoop dir
/usr/local/Cellar/hadoop121/1.2.1

# Copy data to HDFS
hadoop dfs -copyFromLocal /Users/lukas/data-mining/example/input /user/hduser/example

# Run the job
# Mapper and reducer paths are local, input and output paths are HDFS
hadoop jar ~/.bin/hadoop-streaming-1.2.1.jar \
-mapper /Users/lukas/data-mining/example/mapper.py \
-reducer /Users/lukas/data-mining/example/reducer.py \
-input "/user/hduser/example/*" \
-output /user/hduser/example-output

# List and output the results
hadoop dfs -ls /user/hduser/example-output
hadoop dfs -cat /user/hduser/example-output/part-00000

# Copy data to local dir
hadoop dfs -copyToLocal /user/hduser/example-output /Users/lukas/data-mining/example/output

# Delete dir
hadoop dfs -rmr /user/hduser/example-output

Name		Name	Last commit message	Last commit date
Latest commit History 212 Commits
1-lsh		1-lsh
2-svm		2-svm
3-kmeans		3-kmeans
4-bandit		4-bandit
exercises		exercises
finalReport		finalReport
gold-digger		gold-digger
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gold-digger.zip		gold-digger.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1-lsh

1-lsh

2-svm

2-svm

3-kmeans

3-kmeans

4-bandit

4-bandit

exercises

exercises

finalReport

finalReport

gold-digger

gold-digger

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

gold-digger.zip

gold-digger.zip

Repository files navigation

Data Mining

Important URLs

Usage

About

Releases 1

Packages

Contributors 3

Languages

License

lukaselmer/ethz-data-mining

Folders and files

Latest commit

History

Repository files navigation

Data Mining

Important URLs

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Languages