Evolution of the evaluation functions, Reinforcement and TD errors -FQL algorithm