Skip to content

dariacodes/Frapalyzer

Repository files navigation

Frapalyzer

The project provides an algorithm of apocope detection in French text. Everything is written in Python except for the simplest (single-page) crawlers.

Note: To get the code up and running without headache, you can use the Anaconda distribution of Python. It includes all necessary libraries, as well as a good IDE and a shell.

Project content

  • To access the console menu run user_interface.py.

  • The corpus of apocopes can be found in dict_apo_auto.csv. It currently contains 1295 apocopes with their full forms.

  • Some relevant statistics can be found in dict_apo_aug.csv.

  • The list of full forms used by our algorithm (based on Le Grand Robert) can be found in list_words.csv.

  • The list of words encountered in the articles parsed from Le Point can be found in datasets.csv.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published