Skip to content

Retrieves handles (Facebook, Twitter, iOS App, Google Play Store App) from a webpage given a csv of URLs and returns the handles in a JSON string.

Notifications You must be signed in to change notification settings

rwason/Handle-Retriever

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Handle-Retriever

Retrieves handles (Facebook, Twitter, iOS App, Google Play Store App) from a webpage given a csv of URLs and returns the handles in a JSON string.

To run enter "python3 main.py" on the command line with appropriate input url csv.

Input: input_urls.csv Output: result.json

Given a url (https://nextdoor.com/), it will parse the webpage for any handles and return it in a format as such:

{ "facebook": "nextdoor", "google": "id=com.nextdoor", "twitter": "nextdoor" }

External libraries used: FuzzyWuzzy (for fuzzy matching of strings), BeautifulSoup (for parsing meta content of source HTML)

About

Retrieves handles (Facebook, Twitter, iOS App, Google Play Store App) from a webpage given a csv of URLs and returns the handles in a JSON string.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages