[Reading Note]: Asynchronous Stochastic Approximation and Q-Learning