Raw image bytes (PNG, JPEG, WebP, etc.).
MIME type of data.
'image/png' | 'image/jpeg'
Optional captionAuto-generated or OCR-derived caption.
Present when a vision LLM is configured and extractImages: true.
Optional pagePage number the image appears on (1-based, PDF/DOCX).
Optional embeddingDense embedding of the image caption or visual content. Only present when embeddings were computed during extraction.
An image extracted from a document during ingestion.