Skip to content

🎙️ PraisonAI Editor

praisonai-editor

🏠 Home
🚀 Quick Start
CLI Commands
CLI Commands
- probe
- convert
- transcribe
- plan
- edit
🎛️ Presets
🎛️ Presets
- Overview
- podcast
- meeting
- course
- clean
- songs_only
- speech_only
- no_silence
🔍 Content Detection
🔍 Content Detection
- Overview
- Ensemble (auto) Ensemble (auto)
  Table of contents
- INA Speech Segmenter
- Librosa Spectral
- FFmpeg Heuristic
🎵 Stem Separation (Demix)
🎵 Stem Separation (Demix)
🤖 AI Agent Editing
🤖 AI Agent Editing
- Prompt-based Edit
- Agent Tools
🐍 Python API
🐍 Python API
🔌 Extending (Protocols)
🔌 Extending (Protocols)
📦 Artifacts & Cache
📦 Artifacts & Cache
- Overview
⚙️ Installation

Ensemble detector (`--detector ensemble`)¶

The recommended detector. Combines multiple detectors and uses a voting algorithm to pick the most confident classification for each segment.

Usage¶

praisonai-editor edit file.mp3 --preset songs_only --detector ensemble

The --detector auto option also routes to ensemble.

How it works¶

flowchart LR
    A[Audio] --> B[FFmpeg astats]
    A --> C[Librosa spectral]
    A --> D[INA CNN\nif installed]
    A --> E[Demucs stems\nif --demix]
    B --> F[Ensemble decision\nconflict resolution]
    C --> F
    D --> F
    E --> F
    F --> G[Classified blocks\nwith confidence]

Conflict resolution rules: - If INA is available → INA has highest priority for speech/music boundaries - If Demucs is used → demix scores refine singing vs talking-over-music - FFmpeg provides base signal statistics for all segments

Confidence scores¶

Each block gets a confidence score (0–1). Low-confidence borders are re-classified using weighted voting from all available detectors.