openai Extension

This extension allows rill scripts to access OpenAI’s GPT and embedding APIs. The host binds it to a namespace with prefixFunctions('llm', ext), and scripts call llm::message(), llm::embed(), and so on. Switching to Anthropic or Google means changing one line of host config. Scripts stay identical.

Five functions cover the core LLM operations. message sends a single prompt. messages continues a multi-turn conversation. embed and embed_batch generate vector embeddings — OpenAI offers text-embedding-3-small and text-embedding-3-large for this. tool_loop runs an agentic loop where the model calls rill closures as tools. All return the same dict shape (content, model, usage, stop_reason, id, messages), so scripts work across providers without changes.

The host sets API key, model, and temperature at creation time — scripts never handle credentials. Each call emits a structured event (openai:message, openai:tool_call) for host-side logging and metrics.

Quick Start

import { createRuntimeContext, prefixFunctions } from '@rcrsr/rill';
import { createOpenAIExtension } from '@rcrsr/rill-ext-openai';

const ext = createOpenAIExtension({
  api_key: process.env.OPENAI_API_KEY!,
  model: 'gpt-4o',
});
const functions = prefixFunctions('openai', ext);
const ctx = createRuntimeContext({ functions });

// Script: openai::message("Explain TCP handshakes")

Configuration

const ext = createOpenAIExtension({
  api_key: process.env.OPENAI_API_KEY!,
  model: 'gpt-4o',
  temperature: 0.7,
  max_tokens: 4096,
  system: 'You are a helpful assistant.',
  embed_model: 'text-embedding-3-small',
  base_url: 'https://custom-endpoint.example.com',
  max_retries: 3,
  timeout: 30000,
});

Parameter	Type	Default	Description
`api_key`	string	—	API key (required)
`model`	string	—	Model identifier (required)
`temperature`	number	—	Response randomness, 0.0–2.0
`max_tokens`	number	4096	Maximum response tokens
`system`	string	—	Default system prompt
`embed_model`	string	—	Model for embed operations
`base_url`	string	—	Custom API endpoint
`max_retries`	number	—	Retry attempts for failures
`timeout`	number	—	Request timeout in ms

Functions

message(text, options?) — Send a single prompt:

openai::message("Explain TCP handshakes") => $result
$result.content      # Response text
$result.model        # Model used
$result.usage.input  # Input tokens
$result.usage.output # Output tokens

messages(messages, options?) — Multi-turn conversation:

[
  [role: "user", content: "What is rill?"],
  [role: "assistant", content: "A scripting language."],
  [role: "user", content: "Tell me more."],
] -> openai::messages => $result
$result.content   # Latest response
$result.messages  # Full conversation history

embed(text) — Generate text embedding:

openai::embed("sample text") => $vec
$vec -> .dimensions  # Vector size
$vec.model           # Embedding model used

embed_batch(texts) — Batch embeddings:

["first text", "second text"] -> openai::embed_batch => $vectors
$vectors.len  # Number of vectors

tool_loop(prompt, options?) — Agentic tool-use loop:

tool("get_weather", "Get current weather", [city: "string"], {
  "Weather in {$city}: 72F sunny"
}) => $weather_tool

openai::tool_loop("What's the weather in Paris?", [
  tools: [$weather_tool],
  max_turns: 5,
]) => $result
$result.content  # Final response
$result.turns    # Number of LLM round-trips

Per-Call Options

Option	Type	Applies To	Description
`system`	string	message, messages, tool_loop	Override system prompt
`max_tokens`	number	message, messages, tool_loop	Override max tokens
`tools`	list	tool_loop (required)	Tool descriptors
`max_turns`	number	tool_loop	Limit LLM round-trips
`max_errors`	number	tool_loop	Consecutive error limit (default: 3)
`messages`	list	tool_loop	Prepend conversation history

Result Dict

All functions except embed and embed_batch return:

Field	Type	Description
`content`	string	Response text
`model`	string	Model identifier
`usage.input`	number	Input token count
`usage.output`	number	Output token count
`stop_reason`	string	Why generation stopped
`id`	string	Request identifier
`messages`	list	Conversation history

The tool_loop result adds turns (number of LLM round-trips).

Error Behavior

Validation errors (before API call):

Empty prompt → RuntimeError RILL-R004: prompt text cannot be empty
Missing role → RuntimeError RILL-R004: message missing required 'role' field
Invalid role → RuntimeError RILL-R004: invalid role '{value}'
Missing content → RuntimeError RILL-R004: {role} message requires 'content'
No embed_model → RuntimeError RILL-R004: embed_model not configured
Missing tools → RuntimeError RILL-R004: tool_loop requires 'tools' option

API errors (from provider):

Rate limit → RuntimeError RILL-R004: OpenAI: rate limit
Auth failure → RuntimeError RILL-R004: OpenAI: authentication failed (401)
Timeout → RuntimeError RILL-R004: OpenAI: request timeout
Other → RuntimeError RILL-R004: OpenAI: {detail} ({status})

Tool loop errors:

Unknown tool → RuntimeError RILL-R004: unknown tool '{name}'
Error limit → RuntimeError RILL-R004: tool loop aborted after {n} consecutive errors

Events

Event	Emitted When
`openai:message`	message() completes
`openai:messages`	messages() completes
`openai:embed`	embed() completes
`openai:embed_batch`	embed_batch() completes
`openai:tool_loop`	tool_loop() completes
`openai:tool_call`	Tool invoked during loop
`openai:tool_result`	Tool returns during loop
`openai:error`	Any operation fails

Test Host

A runnable example at packages/ext/openai/examples/test-host.ts demonstrates integration:

# Set API key
export OPENAI_API_KEY="sk-..."

# Built-in demo
pnpm exec tsx examples/test-host.ts

# Inline expression
pnpm exec tsx examples/test-host.ts -e 'llm::message("Tell me a joke") -> $.content -> log'

# Script file
pnpm exec tsx examples/test-host.ts script.rill

Override model or endpoint with OPENAI_MODEL and OPENAI_BASE_URL. Works with any OpenAI-compatible server:

# LM Studio
OPENAI_BASE_URL=http://localhost:1234/v1 OPENAI_API_KEY=lm-studio OPENAI_MODEL=local pnpm exec tsx examples/test-host.ts

# Ollama
OPENAI_BASE_URL=http://localhost:11434/v1 OPENAI_API_KEY=ollama OPENAI_MODEL=llama3.2 pnpm exec tsx examples/test-host.ts