Overview

Developed by Menlo Research, AlphaMaze is a novel model designed to enhance and assess visual reasoning in large language models (LLMs). Unlike approaches that rely on complex image generation, AlphaMaze uses a surprisingly simple task: solving text-based mazes. This requires the LLM to internally reconstruct the maze, plan its path, and strategically reset after dead ends. To further improve AlphaMaze's capabilities, we utilize the GRPO (Generalized Relative Policy Optimization) method. The AlphaMaze model itself offers a richer, more nuanced assessment of spatial understanding than traditional multiple-choice tests.

Variants

No Variant Cortex CLI command
1 gguf cortex run alphamaze-v0.2

Use it with Jan (UI)

  1. Install Jan using Quickstart
  2. Use in Jan model Hub:
    cortexso/alphamaze-v0.2
    

Use it with Cortex (CLI)

  1. Install Cortex using Quickstart
  2. Run the model with command:
    cortex run alphamaze-v0.2
    

Credits

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.