Skip to content

tanya34fish/event_clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Event Clustering and Tracking

crawler and preprocess

  • Write crawler to crawl news from PTT (Biggest BBS in Taiwan)
  • Please contact me if data may be helpful to you
  • use Jieba to segment sentences and Stanford POS tagging to filter terms

clustering

  • Perform single-pass algorithm using cluster mean
  • Please see event_mean.py for details

evaluate

  • Use Entropy/Precision/Recall/F-measure for evaluation

Reference

  • Automatic Online News Issue Construction in Web Environment (WWW, 2008)
  • A study on retrospective and online event detection (SIGIR, 1998)
  • A comparison of Document Clustering Techniques (1999)

Presentation ppt

About

Event Clustering and Tracking

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published