WorldModel Gym

Imagination Benchmark

Evaluate long-horizon planning agents under sparse rewards, partial observability, and procedural generalization.

WorldModel Gym combines benchmark tasks, model-based planners, deterministic seed tracks, and a reproducible leaderboard platform.

MemoryMaze

Key-door maze under limited FOV and delayed terminal reward.

SwitchQuest

Discover and execute hidden subgoal chain in partial observations.

CraftLite

Resource collection and crafting dependencies with strict sparse mode.