Ensemble detector (--detector ensemble)ΒΆ
The recommended detector. Combines multiple detectors and uses a voting algorithm to pick the most confident classification for each segment.
UsageΒΆ
The --detector auto option also routes to ensemble.
How it worksΒΆ
flowchart LR
A[Audio] --> B[FFmpeg astats]
A --> C[Librosa spectral]
A --> D[INA CNN\nif installed]
A --> E[Demucs stems\nif --demix]
B --> F[Ensemble decision\nconflict resolution]
C --> F
D --> F
E --> F
F --> G[Classified blocks\nwith confidence]
Conflict resolution rules: - If INA is available β INA has highest priority for speech/music boundaries - If Demucs is used β demix scores refine singing vs talking-over-music - FFmpeg provides base signal statistics for all segments
Confidence scoresΒΆ
Each block gets a confidence score (0β1). Low-confidence borders are re-classified using weighted voting from all available detectors.