Indicates whether speech is currently detected in the frame.
The calculated energy (RMS) of the current audio frame.
The speech detection threshold used for this frame, adapted from the noise profile.
The silence detection threshold used for this frame.
Optional confidenceConfidence score (0-1) in the isSpeech detection. Can be basic for now.
Represents the result of VAD processing for an audio frame.