Overview
Developed by Menlo Research, AlphaMaze is a novel model designed to enhance and assess visual reasoning in large language models (LLMs). Unlike approaches that rely on complex image generation, AlphaMaze uses a surprisingly simple task: solving text-based mazes. This requires the LLM to internally reconstruct the maze, plan its path, and strategically reset after dead ends. To further improve AlphaMaze's capabilities, we utilize the GRPO (Generalized Relative Policy Optimization) method. The AlphaMaze model itself offers a richer, more nuanced assessment of spatial understanding than traditional multiple-choice tests.
Variants
No | Variant | Cortex CLI command |
---|---|---|
1 | gguf | cortex run alphamaze-v0.2 |
Use it with Jan (UI)
- Install Jan using Quickstart
- Use in Jan model Hub:
cortexso/alphamaze-v0.2
Use it with Cortex (CLI)
- Install Cortex using Quickstart
- Run the model with command:
cortex run alphamaze-v0.2
Credits
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.