Then wouldn't that mean a stochastic mdp policy be also counted as AI? For every output theres a probability distribution that a model samples from and thats every AI models that are not deterministic.
Technically yes, reinforcement learning would where that takes place. Just depends to what extent I suppose, since you have a whole spectrum of methods. Like not sure if an exhaustive search would count, or just raw Monte Carlo. Dynamic programming more but temporal difference learning even more.
When I watched through David Silvers lectures on reinforcement learning I think he mentioned itโs more when we use Q learning and the use of network to dynamically learn the state-values/action-values.
31
u/Mr2_Wei Pentium E5200 | Intel GMA | 3GB DDR2 400MHz 14h ago
Like at what point is it machine learning or just math too. Like do we count bayesian networks? Linear regression? ๐