Skip to content

bhowmist/NASA_Server_Log_Analysis---Apache-Spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

NASA_Server_Log_Analysis---Apache-Spark

Analysing Semi-structured data with Apache Spark.

Log data comes from many sources, such as web, file, and compute servers, application logs, user-generated content, and can be used for monitoring servers, improving business and customer intelligence, building recommendation systems, fraud detection, and much more. Here Spark is used to perform data exploration and mining on real Apache web server log files.A data set from NASA Kennedy Space Center web server in Florida is used here. The full data set is freely available at http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html, and it contains all HTTP requests for two months. We are using a subset that only contains several days' worth of requests.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages