Job Parameters and examples

Parameters

File parameters

files

list

required

List of local input audio file name(s).

files:
- filename_XYZ.mp3
- filename_XYZ.wav

prefix

string

default:""

Apply the task to the files with mentioned prefix in their file name.

prefix: XYZ

Task, model and agent parameters

Below is a table summarizing the output files for each task mode.

Task	Output	Explanation
transcribe	`.json` and `.wav`	Contains the full text transcript and Original audio
phi	`.json` and `.wav`	Transcript plus PHI spans and Original audio with all PHI segments muted
biometric	`.json` and `.wav`	Transcript plus Audio with a new synthetic identity, gender, and age
phi-biometric	`.json` and `.wav`	Transcript plus PHI spans and Audio with both PHI removed and a new synthetic identity, gender, and age

task

string

default:"protect"

Task to apply to attached files. Available task’s:

transcribe (Transcription)
phi (transcribe + PHI redaction)
biometric (Speech to speech voice print redaction)
phi-biometric (transcribe + phi + biometric)

task: transcribe

Advanced vs. Mini

Choose the right model.

model

string

default:"mini"

Model tier to use for phi reduction. Pass advanced if you want to use advanced reasoning via our private-LLM. Available values:

mini
advanced Example:

model: mini

model: advanced

agents

list

default:"health-generic"

Agents with specififc domain dependency and purpose. Available values:

hipaa
health-generic
clinical

model: advanced
agents:
- hipaa

Available Agents

Front-desk health calls or clinical reports, our agents take care of most sensitive Lables.

Diarization and transcription parameters

language

string

default:"en"

Set the target language for transcription. Leave this parameter to detect automatically amoung supported languages else force target language.

language: en

code-switch

boolean

default:"false"

If automatic code-switching should be applied to your files. Available languages supported for code-switching transcription:

code-switch: true

diar

boolean

default:"false"

If speaker diarization should be applied to your files.

diar: true

Biometric parameters

biometric

string

default:"en"

If biometric voice print redaction should be applied to your files, and if so, which language. Available values:

biometric: en

biometric_gender

string

default:"random"

The gender which should be applied to the anonymised version. Available values:

random
same
opposite

biometric_gender: random

biometric_age

string

default:"middle-aged-adult"

The age group which should be applied to the anonymised version. Available values:

young-adult (18-39)
middle-aged-adult (40-69)
same
random

biometric_age: middle-aged-adult

Submit your Job using Voice Habor’s SDK

Speech to text is by default applied for the task protect. To have the transcription without any reduction use transcribe as task.

Define your parameter based on data sensitivity.

Choose the right model

Build Job File

Build your job as yaml file containing the parameters in your target programming language.Minimal job example:

files:
- filename1.mp3
- filename2.wav
model: mini

Submit Job File

Submit your job to trigger the task processing.

BASE_URL = "https://voiceharbor.ai"
usage_token = "USAGE_TOKEN"
# Create a new job on the server via the class method.
job_id = VoiceHarborClient.create_job(BASE_URL, usage_token)

client = VoiceHarborClient(
    base_url=BASE_URL,
    job_id=job_id,
    token=usage_token,
    inputs_dir="./inputs/tests"
)

# Submit input files and the job file. 
job_params = {"files": [], "model":"mini"}  
job_params = client.submit_files(job_params)
job_file = client.submit_job(job_params)
logger.info(f"Job file created: {job_file}")

Get Started with SDK

Start coding today using Python and integrate the Voice Harbor API into your workflows.

Python SDK

Start using our Python SDK to connect with Voice Harbor and implement voice solutions

Transcription, Translation, and Protection Use-Case examples

Show 1. Transcribe in 49 languages

Use this configuration to transcribe audio files in 49 languages.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: translate

Show 2. Transcribe in a Target Language

Use this configuration to transcribe audio files in a target language (e.g., English).

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: translate
language: en

Show 3. Separate Speaker and Translate in Target Language

Use this configuration to separate speakers and translate to a target language.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: translate
language: en
diar: true

Show 4. Separate Speaker, Translate in 49 Languages

Use this configuration to separate speakers and translate in 49 languages.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: translate
diar: true

Show 5. Separate Speaker, Translate for Code-Switch Data in 4 Languages

Use this configuration to separate speakers and translate for code-switch data in 4 languages.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: translate
diar: true
code-switch: true

Show 6. Separate Speaker, Translate for Code-Switch Data in 4 Languages and Redact PHI with Mini Model

Use this configuration to separate speakers, translate for code-switch data in 4 languages, and redact PHI using the mini model.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: phi
diar: true
code-switch: true
model: mini

Show 7. Separate Speaker, Translate in 49 Languages and Redact PHI with Mini Model

Use this configuration to separate speakers, translate in 49 languages, and redact PHI using the mini model.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: phi
diar: true
model: mini

Show 8. Separate Speaker, Translate for Code-Switch Data in 4 Languages and Redact PHI with Advanced Model

Use this configuration to separate speakers, translate for code-switch data in 4 languages, and redact PHI using the advanced model. Includes HIPAA protection.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
diar: true
code-switch: true
task: phi
model: advanced
agents:
  - hippa
  - health-generic

Show 9. Separate Speaker, Translate in Target Language and Redact Biometric Voice Print

Use this configuration to separate speakers, translate in a target language, and redact biometric voice prints.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
task: biometric
language: en
diar: true
biometric: en
biometric_gender: random
biometric_age: middle-aged-adult

Show 10. Separate Speaker, Translate in Target Language, Redact PHI, and Biometric Voice Print

Use this configuration to separate speakers, translate in a target language, redact PHI, and redact biometric voice prints using the advanced model.

files:
  - filename_XYZ.mp3
  - filename_XYZ.wav
  - filename_XYZ.mp3
  - filename_ZYX.mp3
prefix: XYZ
biometric: en
biometric_gender: random
biometric_age: middle-aged-adult
task: phi-biometric
model: advanced
agents:
  - hippa
  - health-generic
diar: true
code-switch: true

PHI and Biometric reduction

Build with Voice Harbor

Speech to Text

Speaker Diarization

Playground

Parameters

File parameters

Task, model and agent parameters

Advanced vs. Mini

Available Agents

Diarization and transcription parameters

Biometric parameters

Submit your Job using Voice Habor’s SDK

Get Started with SDK

Python SDK

Transcription, Translation, and Protection Use-Case examples

PHI and Biometric reduction

Build with Voice Harbor

Speech to Text

Speaker Diarization

Playground

​Parameters

​File parameters

​Task, model and agent parameters

Advanced vs. Mini

Available Agents

​Diarization and transcription parameters

​Biometric parameters

​Submit your Job using Voice Habor’s SDK

​Get Started with SDK

Python SDK

​Transcription, Translation, and Protection Use-Case examples

Parameters

File parameters

Task, model and agent parameters

Diarization and transcription parameters

Biometric parameters

Submit your Job using Voice Habor’s SDK

Get Started with SDK

Transcription, Translation, and Protection Use-Case examples