OVOS Intent Pipeline

The OpenVoiceOS (OVOS) Intent Pipeline is a modular and extensible system designed to interpret user utterances and map them to appropriate actions or responses.

It orchestrates various intent parsers and fallback mechanisms to ensure accurate and contextually relevant responses.

What is an Intent Pipeline?

An intent pipeline in OVOS is a sequence of processing stages that analyze user input to determine the user's intent. Each stage employs different strategies, ranging from high-confidence intent parsers to fallback mechanisms, to interpret the input.

This layered approach ensures that OVOS can handle a wide range of user queries with varying degrees of specificity and complexity.

Pipeline Structure

OVOS pipelines are structured to prioritize intent matching based on confidence levels:

High Confidence: Primary intent parsers that provide precise matches.
Medium Confidence: Secondary parsers that handle less specific queries.
Low Confidence: Fallback mechanisms for ambiguous or unrecognized inputs.

Each component in the pipeline is a plugin that can be enabled, disabled, or reordered according to user preferences. This flexibility allows for customization based on specific use cases or device capabilities.

Available Pipeline Components

Below is a list of available pipeline components, categorized by their confidence levels and functionalities:

High Confidence Components

Pipeline	Description	Notes
`stop_high`	Exact match for stop commands	Replaces skill-ovos-stop
`converse`	Continuous conversation interception for skills
`padatious_high`	High-confidence matches using Padatious
`adapt_high`	High-confidence matches using Adapt
`fallback_high`	High-priority fallback skill matches
`ocp_high`	High-confidence media-related queries
`ovos-persona-pipeline-plugin-high`	Active persona conversation (e.g., LLM integration)
`ovos-m2v-pipeline-high`	Multilingual intent classifier capable of handling paraphrasing	Only supports default skills, not dynamic

Medium Confidence Components

Pipeline	Description	Notes
`stop_medium`	Medium-confidence stop command matches	Replaces skill-ovos-stop
`padatious_medium`	Medium-confidence matches using Padatious
`adapt_medium`	Medium-confidence matches using Adapt
`ocp_medium`	Medium-confidence media-related queries
`fallback_medium`	Medium-priority fallback skill matches
`ovos-m2v-pipeline-medium`	Multilingual intent classifier capable of handling paraphrasing	Only supports default skills, not dynamic

Low Confidence Components

Pipeline	Description	Notes
`stop_low`	Low-confidence stop command matches	Disabled by default
`padatious_low`	Low-confidence matches using Padatious	Often inaccurate; disabled by default
`adapt_low`	Low-confidence matches using Adapt
`ocp_low`	Low-confidence media-related queries
`fallback_low`	Low-priority fallback skill matches
`common_query`	Sends utterance to common_query skills	Selects the best match among available skills
`ovos-persona-pipeline-plugin-low`	Persona catch-all fallback (e.g., LLM integration)
`ovos-m2v-pipeline-low`	Multilingual intent classifier capable of handling paraphrasing	Only supports default skills, not dynamic

Customizing the Pipeline

OVOS allows users to customize the intent pipeline through configuration files. Users can enable or disable specific components, adjust their order, and set confidence thresholds to tailor the system's behavior to their needs. This customization ensures that OVOS can be optimized for various applications, from simple command recognition to complex conversational agents.

  "intents": {
    "adapt": {
      "conf_high": 0.5,
      "conf_med": 0.3,
      "conf_low": 0.2
    },
    "persona": {
      "handle_fallback": true,
      "default_persona": "Remote Llama"
    },
    "pipeline": [
      "ovos-m2v-pipeline-high",
      "ocp_high",
      "stop_high",
      "converse",
      "padatious_high",
      "adapt_high",
      "stop_medium",
      "adapt_medium",
      "common_qa",
      "fallback_medium",
      "fallback_low"
    ]
  },

By understanding and configuring the OVOS Intent Pipeline, developers and users can enhance the accuracy and responsiveness of their voice assistant applications.