whisperx

pixeltable.ext.functions.whisperx

transcribe

transcribe(
    audio: Audio,
    *,
    model: String,
    compute_type: Optional[String] = None,
    language: Optional[String] = None,
    chunk_size: Int = 30
) -> Json

Transcribe an audio file using WhisperX.

This UDF runs a transcription model locally using the WhisperX library, equivalent to the WhisperX transcribe function, as described in the WhisperX library documentation.

WhisperX is part of the pixeltable.ext package: long-term support in Pixeltable is not guaranteed.

Requirements:

pip install whisperx

Parameters:

audio (Audio) –

The audio file to transcribe.
model (String) –

The name of the model to use for transcription.

See the WhisperX library documentation for details on the remaining parameters.

Returns:

Json –

A dictionary containing the transcription and various other metadata.

Examples:

Add a computed column that applies the model tiny.en to an existing Pixeltable column tbl.audio of the table tbl:

>>> tbl['result'] = transcribe(tbl.audio, model='tiny.en')