PraisonAI EditorΒΆ
AI-powered audio & video editor β transcribe, clean, and edit media with one command.
-
Transcribe
Word-level timestamps in SRT, TXT, or JSON. OpenAI Whisper + local faster-whisper.
-
Edit
Remove fillers, repetitions, silences, or keep only singing. One command.
-
Stem Separation
Powered by Demucs β isolate vocals from instruments to find singing zones.
-
AI Agent
"Remove the intro and any off-topic discussion about weather" β plain English.
How it worksΒΆ
flowchart LR
A[π΅ Input\nMP3 Β· MP4 Β· WAV] --> B[Probe\nmetadata]
B --> C[Transcribe\nWhisper API]
C --> D[Detect\nContent blocks]
D --> E[Plan\nkeep / remove]
E --> F[Render\nFFmpeg]
F --> G[π§ Output\nedited file]
C -.->|cache| C
D -.->|Demucs stems cache| D
Install in 30 secondsΒΆ
Feature overviewΒΆ
| Feature | Command | Extra install? |
|---|---|---|
| Probe metadata | probe |
No |
| Convert format | convert |
No |
| Transcribe (Whisper) | transcribe |
No |
| Create edit plan | plan |
No |
| Edit (podcast/meeting/courseβ¦) | edit |
No |
| Content detection (ensemble) | edit --detector ensemble |
No |
| INA speech segmenter | edit --detector ina |
[detect] |
| Stem separation (Demucs) | edit --demix |
[demix] |
| Primary singing zone crop | edit --demix --primary-zone |
[demix] |
| AI agent editing | edit --prompt "β¦" |
No |
| Local Whisper | edit --local |
[local] |