Skip to content

🎙️ PraisonAI Editor

INA Speech Segmenter

praisonai-editor

🎙️ PraisonAI Editor

praisonai-editor

🏠 Home
🚀 Quick Start
CLI Commands
CLI Commands
- probe
- convert
- transcribe
- plan
- edit
🎛️ Presets
🎛️ Presets
- Overview
- podcast
- meeting
- course
- clean
- songs_only
- speech_only
- no_silence
🔍 Content Detection
🔍 Content Detection
- Overview
- Ensemble (auto)
- INA Speech Segmenter INA Speech Segmenter
  Table of contents
- Librosa Spectral
- FFmpeg Heuristic
🎵 Stem Separation (Demix)
🎵 Stem Separation (Demix)
🤖 AI Agent Editing
🤖 AI Agent Editing
- Prompt-based Edit
- Agent Tools
🐍 Python API
🐍 Python API
🔌 Extending (Protocols)
🔌 Extending (Protocols)
📦 Artifacts & Cache
📦 Artifacts & Cache
- Overview
⚙️ Installation

INA Speech Segmenter (`--detector ina`)¶

Uses a CNN (Convolutional Neural Network) trained on broadcast media to detect speech vs music vs noise vs silence.

Install¶

pip install "praisonai-editor[detect]"

Usage¶

praisonai-editor edit file.mp3 --preset speech_only --detector ina

Strengths¶

High accuracy for speech vs music boundary detection
Handles noisy environments well (broadcast quality)
Good at detecting short speech segments within music

Weaknesses¶

Slower (CNN inference)
Cannot distinguish singing from music (both labeled as music)
Use with --demix to get singing classification

Content types returned¶

INA label	Mapped to
`speech`	`speech`
`music`	`music`
`noEnergy`	`silence`