이 페이지는 아직 한국어로 제공되지 않습니다. 번역 작업 중입니다.
현재 번역 프로젝트에 대한 질문이나 피드백이 있으신 경우
언제든지 연락주시기 바랍니다.Overview
Patterns automatically clusters your LLM application’s production traffic into meaningful topics, helping you understand what users are asking, identify coverage gaps, and diagnose failure modes.
You can create multiple named Patterns, each scoped to a different application, span type, or use case.
How it works
Patterns uses a mix of calls to your connected LLM provider account and text embeddings to give you an interpretable view of production behavior without manual tagging.
When you run a Pattern, it:
- Pulls LLM interactions from your production traffic based on your filter and sampling configuration
- Summarizes each interaction with AI-generated text
- Computes text embedding of these summaries using a self-hosted, open source model
- Forms clusters using machine learning (UMAP and HDBSCAN)
- Reviews each cluster and generates meaningful topics with AI-generated text
- Attributes each interaction to a single topic
- Builds a hierarchy using AI by grouping similar topics together
Each topic shows its interaction volume and share of total traffic. Interactions that don’t fit any cluster are collected into an Outliers group.
Set up a Pattern
- Click + New Pattern
- Enter a Pattern Name
- Under Clustering model, select your LLM Provider, Account, and Model — these are used to generate topic names, summaries, topic hierarchy, and to attribute each interaction to a topic.
- Under Scope, configure:
- Time window: The lookback period for interactions to analyze
- Which spans do you want to cluster?: Filter by application, environment, span type, or other tags to scope the Pattern to a specific slice of traffic.
- Sampling Rate: The percentage of matching interactions to include. Patterns processes up to 10,000 records per run; if your filter matches more than that, records are randomly sampled down to the cap.
- Under What should we detect Patterns on?, enter a template that defines what gets sent to the model for analysis. Use {{variable}} syntax to reference any span field — for example, {{meta.input.value}} to analyze patterns by user input, or {{meta.span.kind}} to analyze by span kind. Click Template Examples to see common configurations. As you type, the right panel previews matching spans and shows what percentage of interactions have values for the variables you’ve referenced.
- Click Save
Explore your Patterns
Use the dropdown in the header to switch between your named Patterns. Each Pattern shows results from its most recent run.
Read the summary metrics
The top of the Patterns page shows three metrics from your most recent run:
- Total interactions: How many interactions were analyzed
- Identified topics: The total number of distinct topics found, including parent and child topics
- Classified: The percentage of analyzed interactions assigned to a named topic — interactions in Outliers count as unclassified
Visualize patterns by dimension
Above the topic table, a scatter plot compares your patterns against each other. Each bubble represents one topic, with the Y axis showing the number of interactions and the X axis showing the metric selected in the Dimension dropdown (for example, total errors). Use this chart to spot outliers — topics with unexpectedly high error rates or latency relative to their volume.
Navigate the topic list
The topic table provides a hierarchical view of all discovered topics. Each topic shows:
- Pattern — auto-generated name and description based on the interactions in the cluster
- Interactions — count and percentage of total traffic
- Cost — estimated LLM cost for interactions in this topic
- Tokens — token usage for interactions in this topic
- Errors — error count and rate
- Latency — median latency for interactions in this topic
- Online Evals — evaluation results if online evaluations are configured
Expand parent topics to see their sub-topics and examine specific areas of your application’s traffic.
Drill into a topic
Click any topic name to open the detail view. The detail view shows a summary of what the topic represents, the total interaction count, and an interactions table with the child topic label, input text, and timestamp for each interaction. Search the table by keyword to find specific examples.
Export and act on interactions
From the interactions table inside a topic’s detail view, you can act on the interactions in that cluster:
- Download as CSV: Export the interactions as a CSV file.
- Add to Dataset: Send the interactions to a Dataset to build evaluation test cases from real production traffic.
- Add to Queue: Send the interactions to an Annotation Queue for human review and labeling.
Trigger a new run
To analyze your production traffic, click Run analysis in the Patterns header. The pipeline runs in the background and takes 5 to 10 minutes. You can close the page and return later — the header shows the last run date and lookback period when the run completes.
If a run fails, a modal explains the cause and what action to take. The page continues to display results from the most recent successful run while the failed run is shown in the header.
Use topics to improve your application
Understand your production traffic
Use the topic list to see what users are actually doing with your application.
Use traffic percentage to identify your most common use cases. The parent-child hierarchy helps you move from a high-level pattern down to the specific sub-patterns underneath.
Find evaluation coverage gaps
Compare your topic distribution against what your golden datasets actually cover. Look at topics that represent high production volume but have no corresponding evaluation cases: this is where your test coverage has gaps, and where model regressions are least likely to be caught before they reach users.
Diagnose failure patterns
Scope your Pattern’s filter to spans with poor quality scores or failed evaluations, then run the analysis. The resulting topic taxonomy shows which types of requests are failing most, giving you a structured way to prioritize fixes instead of debugging trace by trace.
Track how traffic evolves
Re-run your Pattern periodically and use the Compare to dropdown to compare topic distributions across runs. When a topic marked NEW appears near the top, that signals your users have found a new use case or a new failure mode.
Further Reading