Reinforcement Learning (RL) is a unique subset of machine learning where an agent learns by interacting with an environment. Unlike supervised learning, RL does not rely on labeled datasets. Instead, it employs a reward-based system where the agent receives feedback (positive rewards for desired actions and penalties for suboptimal ones). Through trial and error, the agent aims to maximize its cumulative reward over time by discovering the best policy. For instance, RL is used in robotics to enable autonomous movement, in gaming AI (e.g., AlphaGo), and in resource management (e.g., optimizing energy grids). The agent’s learning occurs iteratively, using algorithms like Q-learning or policy gradients, making it essential for dynamic decision-making tasks in uncertain environments. Why Other Options Are Incorrect:
Repo and Reverse repo rates are two rates set by RBI for .................... ?
What does C stand for in BCSBI ?
RBI recently reduced the risk weightage on home loans above Rs 75 lakh from 75% to
Arrangement made for the likely loss in the profit and loss account while finalizing accounts of banks is known as...............................
Which of the following is the 8 digit code and extended upto 11 digits?
In SFMS, M denotes
IFSC Code contains how many characters that facilitate fund transfer in any part of the country?
Maximum limit of SLR is
Which of the following Bank also owns a linkage Program called SHG’s.
The first RRB was set up at