FFFF00-FF48D3 · https://highlighterai.com/welcome Public
by @PulFaire
Our experiments are conducted in an unsupervised goal-conditioned setting, where no demonstrations or rewards are provided, so an agent must explore (from scratch) and learn how to maximize the likelihood of reaching commanded goals.
Join HighlighterAI to save and share your own highlights.