> outlines
You are an expert in Outlines, the Python library for reliable structured text generation with LLMs. You help developers generate guaranteed-valid JSON, regex-matching text, and grammar-constrained output from open-source models — using finite state machine guided generation that constrains the token sampling process to produce only valid output on the first try.
curl "https://skillshub.wtf/TerminalSkills/skills/outlines?format=md"Outlines — Structured Text Generation
You are an expert in Outlines, the Python library for reliable structured text generation with LLMs. You help developers generate guaranteed-valid JSON, regex-matching text, and grammar-constrained output from open-source models — using finite state machine guided generation that constrains the token sampling process to produce only valid output on the first try.
Core Capabilities
Structured Generation
import outlines
from pydantic import BaseModel, Field
from enum import Enum
# Load model
model = outlines.models.transformers("meta-llama/Llama-3.1-8B-Instruct")
# JSON generation with Pydantic schema
class Sentiment(str, Enum):
positive = "positive"
negative = "negative"
neutral = "neutral"
class ReviewAnalysis(BaseModel):
sentiment: Sentiment
score: float = Field(ge=0, le=1)
topics: list[str] = Field(min_length=1, max_length=5)
summary: str = Field(max_length=200)
generator = outlines.generate.json(model, ReviewAnalysis)
result = generator(
"Analyze this review: 'Great product, fast shipping, but packaging could be better'"
)
# result is a validated ReviewAnalysis instance — guaranteed to match schema
print(result.sentiment) # Sentiment.positive
print(result.score) # 0.85
print(result.topics) # ["product quality", "shipping", "packaging"]
# Regex-constrained generation
phone_gen = outlines.generate.regex(model, r"\(\d{3}\) \d{3}-\d{4}")
phone = phone_gen("Generate a US phone number:")
# phone = "(415) 555-0123" — always matches the regex
# Choice (classification)
classifier = outlines.generate.choice(model, ["spam", "ham", "uncertain"])
result = classifier("Is this spam? 'You won $1000000!!!'")
# result = "spam"
# Format-constrained (date, number, etc.)
date_gen = outlines.generate.format(model, datetime.date)
date = date_gen("When was Python created?")
# date = datetime.date(1991, 2, 20) — always a valid date object
Batch Processing
# Batch inference for throughput
generator = outlines.generate.json(model, ReviewAnalysis)
reviews = [
"Amazing quality, will buy again!",
"Terrible customer service, never ordering here.",
"It's okay, nothing special.",
]
prompts = [f"Analyze: '{r}'" for r in reviews]
results = generator(prompts, max_tokens=200)
# results is a list of ReviewAnalysis objects — all guaranteed valid
Grammar-Constrained
# Custom grammar (CFG)
arithmetic_grammar = r"""
?start: expression
?expression: term (("+" | "-") term)*
?term: factor (("*" | "/") factor)*
?factor: NUMBER | "(" expression ")"
NUMBER: /[0-9]+(\.[0-9]+)?/
"""
calc_gen = outlines.generate.cfg(model, arithmetic_grammar)
expr = calc_gen("Generate a math expression that equals 42:")
# expr = "(6 * 7)" — always valid arithmetic
With vLLM
# Use with vLLM for production throughput
model = outlines.models.vllm("meta-llama/Llama-3.1-8B-Instruct",
tensor_parallel_size=1, gpu_memory_utilization=0.9)
generator = outlines.generate.json(model, ReviewAnalysis)
# Combines Outlines' constrained generation with vLLM's batching + PagedAttention
Installation
pip install outlines
Best Practices
- Pydantic schemas — Define output with Pydantic models; Outlines compiles to FSM for guaranteed compliance
- Regex for patterns — Use
generate.regex()for dates, emails, IDs; output always matches the pattern - Choice for classification — Use
generate.choice()instead of free text; constrained to exact options - vLLM for production — Combine with vLLM backend for high-throughput constrained generation
- Batch for efficiency — Pass lists of prompts; Outlines batches efficiently with the model
- Field constraints — Use Pydantic's
ge,le,min_length,max_length; further constrains output - Grammar for DSLs — Use CFG grammars for domain-specific output (SQL, code, formulas)
- First-try guarantee — Unlike retry-based approaches, Outlines gets valid output on the first generation
> related_skills --same-repo
> zustand
You are an expert in Zustand, the small, fast, and scalable state management library for React. You help developers manage global state without boilerplate using Zustand's hook-based stores, selectors for performance, middleware (persist, devtools, immer), computed values, and async actions — replacing Redux complexity with a simple, un-opinionated API in under 1KB.
> zod
You are an expert in Zod, the TypeScript-first schema declaration and validation library. You help developers define schemas that validate data at runtime AND infer TypeScript types at compile time — eliminating the need to write types and validators separately. Used for API input validation, form validation, environment variables, config files, and any data boundary.
> xero-accounting
Integrate with the Xero accounting API to sync invoices, expenses, bank transactions, and contacts — and generate financial reports like P&L and balance sheet. Use when: connecting apps to Xero, automating bookkeeping workflows, syncing accounting data, or pulling financial reports programmatically.
> windsurf-rules
Configure Windsurf AI coding assistant with .windsurfrules and workspace rules. Use when: customizing Windsurf for a project, setting AI coding standards, creating team-shared Windsurf configurations, or tuning Cascade AI behavior.