This project is to help relevant organization build situational awareness by live twitter data from the location identified by NASA satellite that’s on fire (detected by the surface temperature and light). So we can help people in need or arranging resources.
There are 3 types of data needed for this project. The over all work-flow for data collection is to collected labeled twitter data build a model. Using live satellite data to get fire coordinate information. Searching twitter in those in those locations as testing set to make prediction.
-
- Live Fire Information The Fire Information for Resource Management System (FIRMS) distributes Near Real-Time (NRT) active fire data within 3 hours of satellite observation from both the Moderate Resolution Imaging Spectroradiometer (MODIS) and the Visible Infrared Imaging Radiometer Suite (VIIRS).
-
- Testing data from Testing data is collected through Twitter Python API. Search criterion is the fire coordinates generated from FIRMS and 10 miles radius
Training and testing twitter are both applied to stemming process first and then to a TFIDF vectorizer with a max feature of 2000. After using NLP method clean and preprocess the Twitter data. I feed data into these 4 models, and the table shows the accuracy of each model. As we can see, SVC model have the highest accuracy.
training | testing | |
---|---|---|
Logistic Regression | 95.6% | 93.35% |
SVC | 97.99% | 94.81% |
Random Forest | 86.42% | 85.72% |
Neural Network | 94.23% | 93.35% |
Folium is based on leaflet.js and adapted to python, it’s very easy to use and powerful tool to build an interactive map. For this project I'm using Folium to build the map, the following link is a demonstration of how the map works:Demo Map
Using folium to build Interactive map in HTML.
Here is a link to the Presentation.