Very basic reencofment learning for the Pong Game - a FF network with only 1 hidden layer (200 neurons). Performance is basically neutral - https://gym.openai.com/evaluations/eval_BGdO8RrmRg6r4FjFCVcTIA
In branch cnn, the same task is done with a cnn architecture