Skip to content

PraisonAI EditorΒΆ

AI-powered audio & video editor β€” transcribe, clean, and edit media with one command.

  • Transcribe


    Word-level timestamps in SRT, TXT, or JSON. OpenAI Whisper + local faster-whisper.

    transcribe

  • Edit


    Remove fillers, repetitions, silences, or keep only singing. One command.

    edit

  • Stem Separation


    Powered by Demucs β€” isolate vocals from instruments to find singing zones.

    demix

  • AI Agent


    "Remove the intro and any off-topic discussion about weather" β€” plain English.

    prompt edit


How it worksΒΆ

flowchart LR
    A[🎡 Input\nMP3 · MP4 · WAV] --> B[Probe\nmetadata]
    B --> C[Transcribe\nWhisper API]
    C --> D[Detect\nContent blocks]
    D --> E[Plan\nkeep / remove]
    E --> F[Render\nFFmpeg]
    F --> G[🎧 Output\nedited file]
    C -.->|cache| C
    D -.->|Demucs stems cache| D

Install in 30 secondsΒΆ

pip install praisonai-editor
export OPENAI_API_KEY=sk-...
praisonai-editor edit podcast.mp3 -v

Feature overviewΒΆ

Feature Command Extra install?
Probe metadata probe No
Convert format convert No
Transcribe (Whisper) transcribe No
Create edit plan plan No
Edit (podcast/meeting/course…) edit No
Content detection (ensemble) edit --detector ensemble No
INA speech segmenter edit --detector ina [detect]
Stem separation (Demucs) edit --demix [demix]
Primary singing zone crop edit --demix --primary-zone [demix]
AI agent editing edit --prompt "…" No
Local Whisper edit --local [local]