NASA_Server_Log_Analysis---Apache-Spark

Analysing Semi-structured data with Apache Spark.

Log data comes from many sources, such as web, file, and compute servers, application logs, user-generated content, and can be used for monitoring servers, improving business and customer intelligence, building recommendation systems, fraud detection, and much more. Here Spark is used to perform data exploration and mining on real Apache web server log files.A data set from NASA Kennedy Space Center web server in Florida is used here. The full data set is freely available at http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html, and it contains all HTTP requests for two months. We are using a subset that only contains several days' worth of requests.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Nasa_log.py		Nasa_log.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nasa_log.py

Nasa_log.py

README.md

README.md

Repository files navigation

NASA_Server_Log_Analysis---Apache-Spark

About

Releases

Packages

Languages

bhowmist/NASA_Server_Log_Analysis---Apache-Spark

Folders and files

Latest commit

History

Nasa_log.py

Nasa_log.py

README.md

README.md

Repository files navigation

NASA_Server_Log_Analysis---Apache-Spark

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages