Audio data as a raw Buffer (WAV, MP3, OGG, etc.). The format must be supported by the configured STT provider.
Optional metadataOptional metadata to attach to the indexed document. Stored alongside the embedding for filtering during search.
Optional collectionVector store collection to index into.
'multimodal'
Optional languageBCP-47 language hint for the STT provider (e.g. 'en', 'es', 'ja'). Improves transcription accuracy for non-English audio.
Options for indexing an audio file into the vector store.
The audio is transcribed via an STT provider, then the transcript is embedded and stored alongside the original audio reference.
Example