Common Workflows¶
Step-by-step recipes for the most common tasks.
Quick Transcription (Simplest)¶
The fastest path from video to subtitles.
- Launch WhisperJAV GUI
- Drag a video file onto the app
- Leave all defaults (Balanced mode, Aggressive sensitivity, Japanese)
- Click Start
- SRT file appears next to your video
Time: ~2-5 minutes for a 2-hour video with a modern GPU.
High-Quality Transcription (Ensemble)¶
Uses two different ASR engines and merges their output for best accuracy.
- Go to the Ensemble tab
- Pass 1: Balanced pipeline (default)
- Enable Pass 2: Check the Pass 2 checkbox
- Pass 2: Select Qwen3-ASR
- Merge Strategy: Smart Merge
- Click Start
Both passes run, then results are intelligently combined. Takes roughly 2x the time of a single pass.
ChronosJAV Pipeline (Anime/JAV Content)¶
Dedicated pipeline using models trained on anime and JAV dialogue.
- Go to the Ensemble tab
- Pass 1 Pipeline: Select ChronosJAV
- Model: Choose from:
- anime-whisper (~4GB) — best quality for anime/JAV
- Kotoba v2.1 (~2GB) — lighter, with punctuation
- Kotoba v2.0 (~2GB) — lighter, no punctuation
- Click Start
Tip
For maximum quality, use anime-whisper in Pass 1 and Qwen3-ASR in Pass 2 with Smart Merge.
Transcribe + Translate in One Step¶
Get translated subtitles without a separate step.
- Configure your transcription (any tab)
- On the Ensemble tab, check "AI-translate"
- Select your translation provider (DeepSeek, Gemini, etc.)
- Enter your API key if needed
- Click Start
Transcription runs first, then translation happens automatically on the result.
Translate an Existing SRT File¶
Use Tab 4 to translate subtitles you already have.
- Go to the AI SRT Translate tab (Tab 4)
- Click Add File(s) and select your
.srtfile - Select a Provider and Model
- Enter your API key and click Test Connection
- Set Target Language (e.g., English)
- Choose Tone: Standard or Adult-Explicit
- Click Start
Note
The Adult-Explicit tone uses specialized instructions tuned for JAV dialogue with appropriate vocabulary.
Batch Processing (Multiple Files)¶
Process an entire folder of videos at once.
- Click Add Folder and select a folder containing videos
- All media files are added to the list
- Configure your pipeline settings
- Click Start
Files are processed sequentially. Each output SRT is saved next to its source video (or to your chosen output directory).
Serial Ensemble Mode
In Ensemble mode, enable Serial mode to complete each file fully (Pass 1 → Pass 2 → Merge) before starting the next. This lets you see results as they finish instead of waiting for the entire batch.
CPU-Only Mode (No GPU)¶
WhisperJAV works without a GPU, just slower.
- Go to the Advanced tab
- Check "Accept CPU-only mode"
- Use Faster mode for the best speed without GPU
- Click Start
Warning
CPU mode is 5-10x slower than GPU mode. A 2-hour video may take 30-60 minutes.
WebVTT Output¶
Generate VTT subtitles for HTML5 video players.
- Go to the Advanced tab
- Set Output Format to VTT or Both (SRT + VTT)
- Run your transcription as normal
The .vtt file is saved alongside the .srt file.