cortexso/alphamaze-v0.2 · Hugging Face

Overview

Developed by Menlo Research, AlphaMaze is a novel model designed to enhance and assess visual reasoning in large language models (LLMs). Unlike approaches that rely on complex image generation, AlphaMaze uses a surprisingly simple task: solving text-based mazes. This requires the LLM to internally reconstruct the maze, plan its path, and strategically reset after dead ends. To further improve AlphaMaze's capabilities, we utilize the GRPO (Generalized Relative Policy Optimization) method. The AlphaMaze model itself offers a richer, more nuanced assessment of spatial understanding than traditional multiple-choice tests.

Variants

No	Variant	Cortex CLI command
1	gguf	`cortex run alphamaze-v0.2`

Use it with Jan (UI)

Install Jan using Quickstart
Use in Jan model Hub:
```
cortexso/alphamaze-v0.2
```

Use it with Cortex (CLI)

Install Cortex using Quickstart
Run the model with command:
```
cortex run alphamaze-v0.2
```

Credits

Author: HomebrewResearch
Converter: Homebrew
Original License: License
Papers: Papers