Select colors to filter by:

Our experiments are conducted in an unsupervised goal-conditioned setting, where no demonstrations or rewards are provided, so an agent must explore (from scratch) and learn how to maximize the likelihood of reaching commanded goals.

My Highlights

1 236

performance consistently improves as scale increases in complex tasks. In addition, deep models exhibit qualitatively better behaviors which might be interpreted as implicitly acquired skills necessary to reach the goal.

AI Model Scaling and Performance

1 238

Character counts are represented on low-dimensional curved manifolds discretized by sparse feature families, analogous to biological place cells.

When Models Manipulate Manifolds: The Geometry of a Counting Task

2 302
Up next
Loading...

Arena's mission is to measure and advance the frontier of AI through open, rigorous, and community-driven evaluation.

Arena Academic Partnerships: Funding AI Evaluation Research

1 297
Up next
Loading...