Total video duration in seconds.
Number of scenes detected.
Ordered list of all detected scenes with rich descriptions.
Overall summary of the video content, generated by an LLM from the scene descriptions and transcript.
Optional fullFull transcript of all audio in the video, when transcription was enabled. Concatenation of all scene transcripts with timestamp markers.
Optional ragIDs of RAG vector store chunks created from this analysis. Only populated when VideoAnalyzeRequestRich.indexForRAG was enabled.
Additional metadata about the analyzed video. Provider-specific information that doesn't fit into the structured fields above.
Rich video analysis result with full scene descriptions, summary, transcript, and optional RAG chunk references.
This is a richer version of the base VideoAnalysis type that includes SceneDescription scenes (with cut types, confidence, key frames), a generated summary, and optional RAG indexing metadata.
Example