-
Notifications
You must be signed in to change notification settings - Fork 0
/
ml_q_4.html
37 lines (35 loc) · 2.4 KB
/
ml_q_4.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
<!DOCTYPE html>
<html>
<head>
<title></title>
<link rel="stylesheet" type="text/css" href="style2.css">
<style>
body{
font-size: 40px;
text-align: center;
}
</style>
</head>
<body>
<h1>Reinforcement Learning(RL) </h1>
<div style="width: 100%;text-align: center;overflow: wr">
<p>
Reinforcement Learning(RL) is a type of machine learning technique that enables an agent to learn in an interactive environment by trial and error using feedback from its own actions and experiences.
<br>
Though both supervised and reinforcement learning use mapping between input and output, unlike supervised learning where feedback provided to the agent is correct set of actions for performing a task, reinforcement learning uses rewards and punishment as signals for positive and negative behavior.
<br>
As compared to unsupervised learning, reinforcement learning is different in terms of goals. While the goal in unsupervised learning is to find similarities and differences between data points, in reinforcement learning the goal is to find a suitable action model that would maximize the total cumulative reward of the agent. The figure below represents the basic idea and elements involved in a reinforcement learning model.
</p>
</div>
<!-- Image Map Generated by http://www.image-map.net/ -->
<img src="mlre.png" style="width:100%;height: 71.53%;" usemap="#image-map">
<map name="image-map">
<area alt="Feedback from the environment" title="Feedback from the environment" coords="1132,524,1348,618" shape="rect">
<area alt="The machine" title="The machine" coords="606,21,722,78" shape="rect">
<area alt="Physical world in which the agent operates" title="Physical world in which the agent operates" coords="31,24,279,66" shape="rect">
<area alt="Method to map agent’s state to actions" title="Method to map agent’s state to actions" coords="1279,750,1396,792" shape="rect">
<area alt="Value: Future reward that an agent would receive by taking an action in a particular state" title="Value: Future reward that an agent would receive by taking an action in a particular state" coords="253,478,452,534" shape="rect">
<area alt="Value: Future reward that an agent would receive by taking an action in a particular state" title="Value: Future reward that an agent would receive by taking an action in a particular state" coords="404,731,618,815" shape="rect">
</map>
</body>
</html>