SoftWhisper simplifies audio and video transcription using Whisper model

3 months ago 2

SoftWhisper simplifies audio and video transcription using the powerful Whisper model. You can easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and specify start and end times for targeted segments.

🎯 High-accuracy transcription (using Whisper model)
👥 Speaker identification
🌍 Supports all languages supported by the Whisper model (+30)
🎮 User-friendly GUI interface

  1. Run SoftWhisper.bat: .\SoftWhisper.bat When the GUI launches, follow these steps for transcription (screenshot credits: Sunwood-ai-labs):

Softwhisper interface – Credits to Sunwood-ai-labs

  1. Select an audio/video file.
  2. Choose a model size (tiny, base, small, medium, large).
  3. Enable speaker diarization if needed.
  4. Click the "Start" button.

Common issues and how to solve them

  1. libvlc.dll not found error

    • Please check if VLC Media Player is installed. Please download it here: https://www.videolan.org/
    • Restart the program after installation
  2. FFmpeg or corresponding library not found

Read Entire Article