Close | |
![]() |
|
Figure 2: Optimized path (white dashed line) obtained in different environment (a: environment 1, b: environment 2). The location of radiation sources shown in red colour. Color bar indicates the total number of times agent visited a particular state across all episodes |
|