⚙️ Advanced Options

This guide covers advanced features for power users who need more control over transcription, translation, and export settings.

Translation

Translate your video's subtitles into a different language using the translateTo parameter.

{
  "templateId": "a51c5222-47a7-4c37-b052-7b9853d66bf6",
  "autoApprove": true,
  "translateTo": "es"
}

The API will transcribe your video in its original language, then translate the subtitles to the target language.

Supported Languages

Both language (source) and translateTo (target) support the following language codes:

af, am, ar, as, az, ba, be, bg, bn, bo, br, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fo, fr, gl, gu, ha, haw, he, hi, hr, ht, hu, hy, id, is, it, ja, jw, ka, kk, km, kn, ko, la, lb, ln, lo, lt, lv, mg, mi, mk, ml, mn, mr, ms, mt, my, ne, nl, nn, no, oc, pa, pl, ps, pt, ro, ru, sa, sd, si, sk, sl, sn, so, sq, sr, su, sv, sw, ta, te, tg, th, tk, tl, tr, tt, uk, ur, uz, vi, yi, yo, zh, yue

Dictionary

Improve transcription accuracy for domain-specific terms using the dictionary parameter. This is useful for brand names, product names, technical terms, or any words the transcription engine might not recognize correctly.

{
  "templateId": "a51c5222-47a7-4c37-b052-7b9853d66bf6",
  "autoApprove": true,
  "dictionary": ["ZapCap", "OpenAI", "GPT-4", "Kubernetes"]
}

The transcription engine will use these words as hints to improve recognition accuracy.

Reference Transcript

Use a reference transcript for spelling correction when you have phrase-level timing (like an SRT file) but need word-level timestamps.

{
  "templateId": "a51c5222-47a7-4c37-b052-7b9853d66bf6",
  "autoApprove": true,
  "referenceTranscript": [
    {
      "text": "Hello world, how are you?",
      "start_time": 1.5,
      "end_time": 3.2
    },
    {
      "text": "Welcome to ZapCap.",
      "start_time": 3.5,
      "end_time": 5.0
    }
  ]
}

The API will transcribe the video to generate word-level timing, then use your reference phrases to correct spellings via matching. The timing bounds improve matching confidence.

tip

If you already have word-level timing, use the transcript parameter instead to bypass transcription entirely.

Bring Your Own Transcript (BYOT)

Skip transcription entirely by providing your own word-level transcript using the transcript parameter.

{
  "templateId": "a51c5222-47a7-4c37-b052-7b9853d66bf6",
  "autoApprove": true,
  "transcript": [
    {
      "text": "Hello",
      "type": "word",
      "start_time": 1.5,
      "end_time": 1.8
    },
    {
      "text": "world",
      "type": "word",
      "start_time": 1.9,
      "end_time": 2.3
    }
  ]
}

Word Entry Properties

Property	Type	Required	Description
`text`	string	Yes	The subtitle text
`type`	string	Yes	Entry type (use `"word"`)
`start_time`	number	Yes	Start time in seconds
`end_time`	number	Yes	End time in seconds
`emoji`	string	No	Emoji representation of the word
`important`	boolean	No	Mark word as important for highlighting
`fontId`	string	No	Custom font ID for this word

See the API Reference for the full list of word entry properties.

Export Settings

Control the output quality and format using exportSettings.

{
  "templateId": "a51c5222-47a7-4c37-b052-7b9853d66bf6",
  "autoApprove": true,
  "exportSettings": {
    "fps": 60,
    "quality": "quadHD",
    "speed": "fast"
  }
}

Properties

Property	Type	Options	Default	Description
`fps`	number	`30`, `60`	`30`	Frames per second
`quality`	string	`"standard"`, `"quadHD"`, `"ultraHD"`	`"standard"`	Output resolution (1080p, 2K, 4K)
`speed`	string	`"standard"`, `"fast"`, `"ultraFast"`	`"standard"`	Export speed (each level ~2x faster)
`greenScreen`	boolean	`true`, `false`	`false`	Export with green screen background for compositing

Quality Levels

standard: 1080p resolution
quadHD: 2K resolution
ultraHD: 4K resolution

Cost Impact

Higher quality and FPS settings increase processing costs. See Billing for pricing details.

Complete Example

Here's an example combining multiple advanced options:

POST /videos/{videoId}/task
Content-Type: application/json
x-api-key: YOUR_API_KEY

{
  "templateId": "a51c5222-47a7-4c37-b052-7b9853d66bf6",
  "autoApprove": true,
  "language": "en",
  "translateTo": "es",
  "dictionary": ["ZapCap", "API"],
  "exportSettings": {
    "fps": 60,
    "quality": "quadHD",
    "speed": "fast"
  }
}

This creates a task that:

Transcribes the video in English
Uses the dictionary to improve recognition of "ZapCap" and "API"
Translates the subtitles to Spanish
Exports at 60fps in 2K resolution with faster processing

⚙️ Advanced Options

Translation​

Supported Languages​

Dictionary​

Reference Transcript​

Bring Your Own Transcript (BYOT)​

Word Entry Properties​

Export Settings​

Properties​

Quality Levels​

Complete Example​