Elevating the agent beyond simple input/output to an entity that reasons about its next action and learns from observations