Abstract: Iterative learning control (ILC) is typically applied in practice combined with a feedback controller for time-domain stability. In this closed-loop design with actuator constraints, ...
RULER (Relative Universal LLM-Elicited Rewards) eliminates the need for hand-crafted reward functions by using an LLM-as-judge to automatically score agent trajectories. Simply define your task in the ...