Temporal Difference Reinforcement Learning Research Articles