Apache Sparkler Post Processing using Machine Learning

This code gets connected to Solr DB created for Sparkler Crawled Data to do further data extraction, classification, filtering and insights generation using various Machine Learning models.

The ML models are capable of using keywords list from user, extract features from URL content, and classify (score) output and update Solr parameter accordingly.

Apache Sparkler Link: https://github.com/USCDataScience/sparkler

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Extraction		Extraction
Keywords		Keywords
Post-Processing-Scripts		Post-Processing-Scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extraction

Extraction

Keywords

Keywords

Post-Processing-Scripts

Post-Processing-Scripts

README.md

README.md

Repository files navigation

Apache Sparkler Post Processing using Machine Learning

About

Releases

Packages

Languages

davtalab/Apache_Sparkler_Post_Processing

Folders and files

Latest commit

History

Repository files navigation

Apache Sparkler Post Processing using Machine Learning

About

Resources

Stars

Watchers

Forks

Languages