> ## Documentation Index
> Fetch the complete documentation index at: https://mintlify.com/primeintellect-ai/verifiers/llms.txt
> Use this file to discover all available pages before exploring further.

# Verifiers

> Environments for LLM Reinforcement Learning

<img className="block dark:hidden" src="https://github.com/user-attachments/assets/40c36e38-c5bd-4c5a-9cb3-f7b902cd155d" alt="Prime Intellect" />

<img className="hidden dark:block" src="https://github.com/user-attachments/assets/6414bc9b-126b-41ca-9307-9e982430cde8" alt="Prime Intellect" />

## Welcome to Verifiers

Verifiers is a Python library for creating environments to train and evaluate LLMs with reinforcement learning. Build custom RL environments with datasets, reward functions, and interaction harnesses for training capable language model agents.

<CardGroup cols={2}>
  <Card title="Quick Start" icon="rocket" href="/quickstart">
    Get up and running in minutes with your first environment
  </Card>

  <Card title="Installation" icon="download" href="/installation">
    Install Verifiers with pip, uv, or poetry
  </Card>

  <Card title="GitHub" icon="github" href="https://github.com/PrimeIntellect-ai/verifiers">
    View the source code and contribute
  </Card>

  <Card title="Environments Hub" icon="cube" href="https://app.primeintellect.ai/dashboard/environments?ex_sort=most_stars">
    Browse and share environments with the community
  </Card>
</CardGroup>

## What are Environments?

Environments contain everything required to run and evaluate a model on a particular task:

* **Dataset** - Task inputs for the model to solve
* **Harness** - Tools, sandboxes, context management, and interaction protocols
* **Rubric** - Reward functions to score the model's performance

Environments can be used for:

* Training models with reinforcement learning (RL)
* Evaluating model capabilities
* Generating synthetic data
* Experimenting with agent harnesses
* Creating custom benchmarks

## Key Features

<CardGroup cols={2}>
  <Card title="Multiple Environment Types" icon="layer-group">
    From simple single-turn Q\&A to complex multi-turn agents with tool use
  </Card>

  <Card title="Flexible Reward Functions" icon="star">
    Define custom rubrics with multiple reward functions and metrics
  </Card>

  <Card title="Tool Integration" icon="wrench">
    Stateless tools, MCP servers, sandboxed execution, and browser automation
  </Card>

  <Card title="Seamless Training" icon="brain">
    Integrated with prime-rl for distributed RL training
  </Card>
</CardGroup>

## Simple Example

Create a basic math environment in just a few lines:

```python theme={null}
import verifiers as vf
from datasets import Dataset

def load_environment():
    # Your task data
    dataset = Dataset.from_list([
        {"prompt": [{"role": "user", "content": "What is 2+2?"}], "answer": "4"},
        {"prompt": [{"role": "user", "content": "What is 3*5?"}], "answer": "15"},
    ])
    
    # Your reward function
    async def correct_answer(completion, answer) -> float:
        response = completion[-1]["content"]
        return 1.0 if answer in response else 0.0
    
    rubric = vf.Rubric(funcs=[correct_answer])
    
    return vf.SingleTurnEnv(dataset=dataset, rubric=rubric)
```

## Ecosystem Integration

Verifiers is tightly integrated with the Prime Intellect platform:

* **[Environments Hub](https://app.primeintellect.ai/dashboard/environments?ex_sort=most_stars)** - Share and discover environments
* **[prime-rl](https://github.com/PrimeIntellect-ai/prime-rl)** - Distributed training framework
* **[Hosted Training](https://app.primeintellect.ai/dashboard/training)** - Cloud training platform
* **[Prime Inference](https://docs.primeintellect.ai/inference/overview)** - Model serving for evaluation

## Get Started

<CardGroup cols={2}>
  <Card title="Installation Guide" icon="terminal" href="/installation">
    Install Verifiers and the Prime CLI
  </Card>

  <Card title="Quick Start Tutorial" icon="book-open" href="/quickstart">
    Build and run your first environment
  </Card>
</CardGroup>

## Latest Updates

<Note>
  **v0.1.9** (January 2026) - New experimental environment classes, monitor rubrics for automatic metric collection, improved workspace setup, better error handling, and documentation overhaul.
</Note>

* **v0.1.9** - Experimental environment classes, monitor rubrics, improved workspace setup
* **v0.1.8** - Trajectory-based rollout tracking for token-in token-out training
* **v0.1.7** - Improved quickstart, new RLTrainer, documentation improvements

## Citation

If you use Verifiers in your research, please cite:

```bibtex theme={null}
@misc{brown_verifiers_2025,
  author       = {William Brown},
  title        = {{Verifiers}: Environments for LLM Reinforcement Learning},
  howpublished = {\url{https://github.com/PrimeIntellect-ai/verifiers}},
  year         = {2025}
}
```

Originally created by Will Brown ([@willccbb](https://github.com/willccbb)).
