Question 1

What is Prompt Studio?

Accepted Answer

Prompt Studio is OrchStack's dedicated prompt engineering environment. It is a full-featured editor where you write, version, test, and score prompts before they reach production. Think of it as an IDE specifically designed for prompt engineering — with syntax highlighting, variable autocomplete, multi-role support, version control, A/B testing, and quality scoring built in. Every prompt change is tracked, every version is immutable, and every output can be measured against ground truth.

Question 2

How does version control work for prompts?

Accepted Answer

Every time you save a prompt in Prompt Studio, an immutable version is created with a version number, author, timestamp, and commit message. You can compare any two versions with a visual diff that highlights additions, deletions, and modifications — just like Git. Versions can be tagged (e.g., 'production', 'staging'), rolled back to with one click, and branched for experimentation. The full version history is searchable and exportable. Rollback is instant and does not destroy intermediate versions.

Question 3

Can I use variables in my prompts?

Accepted Answer

Yes. Prompt Studio supports dynamic variable injection using the {{variable_name}} syntax. Variables are resolved at runtime from the conversation context, tenant configuration, or external data sources. In the editor, you get autocomplete suggestions for available variables, type hints, and validation. You can also define test values for each variable so you can preview prompt outputs with realistic data without deploying to production.

Question 4

How does A/B comparison work?

Accepted Answer

Create two or more prompt variants, then run them against the same set of test inputs. Prompt Studio renders the outputs side-by-side with scoring on accuracy, tone, relevance, and any custom metrics you define. Statistical significance testing (using a two-sample t-test) tells you whether the difference is real or noise. The winning variant can be promoted to production with one click. You can also run A/B tests in production by routing a percentage of live traffic to each variant.

Question 5

What metrics are available for scoring prompts?

Accepted Answer

Prompt Studio scores outputs across four dimensions: Accuracy (semantic similarity to expected output, factual correctness), Tone (formality, friendliness, brand alignment measured via classifier), Relevance (how well the response addresses the user's intent, information completeness), and Latency (time to generate the response, token efficiency). Each metric is scored 0-100 and weighted to produce a composite score. You can add custom evaluation functions for domain-specific metrics like legal compliance or medical accuracy.

Version, Test, and Perfect Every Prompt

Write Prompts with Precision and Power

Git-like Versioning for Every Prompt

Version History

Compare Prompt Variants Side-by-Side

Variant A

Variant B

Score Every Prompt for Accuracy, Tone, and Relevance

Accuracy

Tone

Relevance

Latency

Score Trend — Last 4 Versions

Complete Prompt Engineering Toolkit

Rich Prompt Editor

Git-like Versioning

A/B Comparison

Accuracy Scoring

Variable Injection

Multi-Role Support

Prompt Studio FAQ

Perfect Every Prompt Before It Ships