SDEC-AD for Semantic Frame Induction

Keras implementation for our paper:

Zheng-Xin Yong, Tiago Timponi Torrent. (2020). Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction. In: Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France.

Usage

Dependencies

The dependencies are

bcubed==1.5
nltk==3.4.3
matplotlib==3.2.1
numpy==1.18.2
Keras==2.2.5
scikit_learn==0.23.0

Or simply, run pip3 install -r requirements.txt to install all the dependencies.

Dataset Preparation

The data used in our research are as follows:

Berkeley FrameNet 1.7
FrameNet+
Curated anomalous lexical units (from WordNet). Can be accessed through the LRE Map repository.

We use the Python flair library to generate the embeddings for the lexical units using the exemplar sentences and their definitions.

The data/ folder contains the embeddings of the lexical units.

Semantic Frame Induction

Create a folder trained_SDEC_AD/ for saving the trained weights.
Run the Python script python3 semantic_frame_induction_tr.py to train the SDEC-AD model.
Run the Python script semantic_frame_induction_pred.py to predict and evaluate the clusters of LUs. Remember to update the parameter SDEC_trained_weights in the script to the trained weight that has the largest Bcubed F1-score (which is indicated in the name of the saved trained weights such as "SDEC_AD_bcubed_fscore_0.788.h5").

Anomalous Lexical Units Detection

Follow the instructions in the previous section "Semantic Frame Induction" to generate the trained weights.
Update the parameter SDEC_trained_weights in the Python script anomaly_detection.py to the trained weight that has the largest Bcubed F1-score. Then, run the Python script anomaly_detection.py to train the decoder and detect anomalous lexical units.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
evaluation		evaluation
models		models
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
anomaly_detection.py		anomaly_detection.py
data.zip		data.zip
requirements.txt		requirements.txt
semantic_frame_induction_pred.py		semantic_frame_induction_pred.py
semantic_frame_induction_tr.py		semantic_frame_induction_tr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

evaluation

evaluation

models

models

scripts

scripts

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

anomaly_detection.py

anomaly_detection.py

data.zip

data.zip

requirements.txt

requirements.txt

semantic_frame_induction_pred.py

semantic_frame_induction_pred.py

semantic_frame_induction_tr.py

semantic_frame_induction_tr.py

Repository files navigation

SDEC-AD for Semantic Frame Induction

Usage

Dependencies

Dataset Preparation

Semantic Frame Induction

Anomalous Lexical Units Detection

About

Releases

Packages

Languages

License

yongzx/SDEC-AD

Folders and files

Latest commit

History

Repository files navigation

SDEC-AD for Semantic Frame Induction

Usage

Dependencies

Dataset Preparation

Semantic Frame Induction

Anomalous Lexical Units Detection

About

Resources

License

Stars

Watchers

Forks

Languages