Can you train a RNN using RL?[D]

[removed]