Librosa spectral detector (--detector librosa)ΒΆ
Uses spectral features (RMS energy, spectral centroid, zero-crossing rate) from librosa to classify segments.
UsageΒΆ
How it worksΒΆ
flowchart LR
A[Audio frame] --> B[RMS energy dB]
A --> C[Spectral centroid]
A --> D[Zero-crossing rate]
B --> E[Classify frame]
C --> E
D --> E
E --> F{speech, music,\nsilence, singing?}
StrengthsΒΆ
- No extra install (librosa is a base dependency after install)
- Fast β runs on CPU without ML inference
- Works fine for clean recordings
WeaknessesΒΆ
- Less accurate than ensemble or INA on noisy or complex audio
- May confuse speech over music as pure music