Redact PHI in audio and transcription using Python SDK
Learn how to transcribe a pre-recorded audio file using the Voice Harbor’s Python SDK, while removing PHI.
Protecting a pre-recorded audio file using the Voice Harbor’s Python SDK
Prerequisite: Ensure that you have Python 3.6+ installed and an active Voice Harbor account with API access.
Step 1: Install Required Libraries
Before you start, you’ll need to install the required libraries for interacting with the Voice Harbor API.
From Source
- Clone the Voice Harbor SDK repository from Github.
- Change into the project directory.
- Install the package in editable mode using pip:
Step 2: Set Up Your API Key
To authenticate with the Voice Harbor API, you’ll need to set your API key. Obtain your API key from your Voice Harbor dashboard.
To set up the developer token for authentication, use the following code:
Step 3: Set Up the Client and Create a Job
Once you have the necessary setup, you can create a job on the server and interact with the API. Here’s how to do it:
Step 4: Initialize the Client and Submit Job Files
Next, initialize the client and define the parameters for your job, including the files you want to submit for transcription.
Not sure which model best fits your data? Read more in Advanced vs. Mini.
Step 5: Submit Job and Download Results
Now, submit the job and wait for the transcription results. Here’s how to complete the process:
It’s recommended to use the a scheduled download approach if submitting larger amount of volume with advanced models. Read Documentation.
Results example
Troubleshooting
If you encounter any issues while transcribing the audio, check the following:
- Ensure that the audio file path is correct and the file is accessible.
- Check the format of the audio file. Voice Harbor currently supports formats like WAV, MP3, and FLAC.
Additional Resources
For more information, you can refer to the following:
Next Steps
If you’re interested in additional use cases or need help with more advanced features, don’t hesitate to explore our comprehensive Voice Harbor Documentation.