Common Workflows¶

Step-by-step recipes for the most common tasks.

Quick Transcription (Simplest)¶

The fastest path from video to subtitles.

Launch WhisperJAV GUI
Drag a video file onto the app
Leave all defaults (Balanced mode, Aggressive sensitivity, Japanese)
Click Start
SRT file appears next to your video

Time: ~2-5 minutes for a 2-hour video with a modern GPU.

High-Quality Transcription (Ensemble)¶

Uses two different ASR engines and merges their output for best accuracy.

Go to the Ensemble tab
Pass 1: Balanced pipeline (default)
Enable Pass 2: Check the Pass 2 checkbox
Pass 2: Select Qwen3-ASR
Merge Strategy: Smart Merge
Click Start

Both passes run, then results are intelligently combined. Takes roughly 2x the time of a single pass.

ChronosJAV Pipeline (Anime/JAV Content)¶

Dedicated pipeline using models trained on anime and JAV dialogue.

Go to the Ensemble tab
Pass 1 Pipeline: Select ChronosJAV
Model: Choose from:
- anime-whisper (~4GB) — best quality for anime/JAV
- Kotoba v2.1 (~2GB) — lighter, with punctuation
- Kotoba v2.0 (~2GB) — lighter, no punctuation
Click Start

Tip

For maximum quality, use anime-whisper in Pass 1 and Qwen3-ASR in Pass 2 with Smart Merge.

Transcribe + Translate in One Step¶

Get translated subtitles without a separate step.

Configure your transcription (any tab)
On the Ensemble tab, check "AI-translate"
Select your translation provider (DeepSeek, Gemini, etc.)
Enter your API key if needed
Click Start

Transcription runs first, then translation happens automatically on the result.

Translate an Existing SRT File¶

Use Tab 4 to translate subtitles you already have.

Go to the AI SRT Translate tab (Tab 4)
Click Add File(s) and select your .srt file
Select a Provider and Model
Enter your API key and click Test Connection
Set Target Language (e.g., English)
Choose Tone: Standard or Adult-Explicit
Click Start

Note

The Adult-Explicit tone uses specialized instructions tuned for JAV dialogue with appropriate vocabulary.

Batch Processing (Multiple Files)¶

Process an entire folder of videos at once.

Click Add Folder and select a folder containing videos
All media files are added to the list
Configure your pipeline settings
Click Start

Files are processed sequentially. Each output SRT is saved next to its source video (or to your chosen output directory).

Serial Ensemble Mode

In Ensemble mode, enable Serial mode to complete each file fully (Pass 1 → Pass 2 → Merge) before starting the next. This lets you see results as they finish instead of waiting for the entire batch.

CPU-Only Mode (No GPU)¶

WhisperJAV works without a GPU, just slower.

Go to the Advanced tab
Check "Accept CPU-only mode"
Use Faster mode for the best speed without GPU
Click Start

Warning

CPU mode is 5-10x slower than GPU mode. A 2-hour video may take 30-60 minutes.

WebVTT Output¶

Generate VTT subtitles for HTML5 video players.

Go to the Advanced tab
Set Output Format to VTT or Both (SRT + VTT)
Run your transcription as normal

The .vtt file is saved alongside the .srt file.