CMPUT 651 Project

By: Sarah Davis, Delaney Lothian, and Henry Tang

The objective of this project is to explore a solution for the transduction problem. This can be described as the task to not only accept or reject a given string, but additionally translate the string into another language. In other words, transduction is the combination of recognition with the task of generation.

Final Submission Related

Output Files

For machine outputs, graphs, and models that were used in the final paper, please see the output_files folder.

Code

The code in this repository is incomplete. The large majority of the coding working can be found here, as we built of the fork of another repository.

Human Readable logs

For our google sheet of human readable logs, click here

Introduction

This repository contains the code, instructions, and data for our project.

The Datasets directory contains the scripts to generate our mathematical equations dataset. This script generates simple mathematical equations (operators: +,-,/,*,(,)) and its reverse polish notation equivalent.

The NNPDA directory contains the (work in progress) code and instructions for the implementation of a neural network pushdown automata (NNPDA). This algorithm is an important part of our approach for solving the transduction problem.

The supervised_approaches directory contains the code and instructions for a sequence-to-sequence implementation. This algorithm is used to understand how seq2seq can be used for this project, as well as give an idea of what we need to incorporate into the NNPDA code.

The output_files directory contains the results from each of the project parts. Currently, only the seq2seq model outputs to this directory, containing the results of translating from French to English and infix to postfix mathematical notation.

Installation and execution

Activate a virtual environment and install dependencies

python3 -m venv venv

venv\Scripts\activate.bat  # windows
source venv/bin/activate  # unix

pip install -r requirements.txt

Generate the mathematical equations dataset
```
cd Datasets
python main.py
```
Run the part of the project you are interested in. More details can be found in the README.md files of each project part.

Data

Datasets/infix_dataset.tsv is the training data we generated and Datasets/infix_dataset_test.tsv is our test set.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Datasets

Datasets

NNPDA

NNPDA

output_files

output_files

supervised_approaches

supervised_approaches

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

CMPUT 651 Project

Final Submission Related

Output Files

Code

Human Readable logs

Introduction

Installation and execution

Data

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
Datasets		Datasets
NNPDA		NNPDA
output_files		output_files
supervised_approaches		supervised_approaches
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

sacha-davis/651-project

Folders and files

Latest commit

History

Repository files navigation

CMPUT 651 Project

Final Submission Related

Output Files

Code

Human Readable logs

Introduction

Installation and execution

Data

About

Resources

License

Stars

Watchers

Forks

Languages