Welcome to Prompt Forge

Your powerful toolkit for AI prompt engineering, evaluation, and optimization.

Get Started

Simple LLM as a Judge Evaluation

Evaluate prompts using an LLM as a judge based on specified metrics. Upload JSON and adjust parameters.

Simple Ground Truth LLM Evaluation

Evaluate prompts by comparing LLM outputs against a provided ground truth dataset.

Automated Prompt Engineering (APE)

Let AI iterate and optimize your prompt to maximize its performance based on your goals.

Evolutionary Prompt Optimization

Use genetic algorithms to evolve and discover high-performing prompts from a set of seed prompts.

Activity Feed

View your recent tasks and project statuses.