speechmaticstranscriber

Speech to Text filter, using Speechmatics transcribe

Hierarchy

GObject
    ╰──GInitiallyUnowned
        ╰──GstObject
            ╰──GstElement
                ╰──speechmaticstranscriber

Implemented interfaces

Factory details

Authors: – Mathieu Duponchelle

Classification:Audio/Text/Filter

Rank – none

Plugin – speechmatics

Package – gst-plugin-speechmatics

Pad Templates

sink

audio/x-raw:
           rate: [ 8000, 48000 ]
       channels: 1
         layout: { (string)interleaved, (string)non-interleaved }
         format: S16LE

Presencealways

Directionsink

Object typeGstPad


src

text/x-raw:
         format: utf8

Presencealways

Directionsrc

Object typeGstSpeechmaticsTranscriberSrcPad


translate_src_%u

text/x-raw:
         format: utf8

Presencerequest

Directionsrc

Object typeGstSpeechmaticsTranscriberSrcPad


unsynced_src

application/x-json:

Presencealways

Directionsrc

Object typeGstPad


unsynced_translate_src_%u

application/x-json:

Presencesometimes

Directionsrc

Object typeGstPad


Properties

additional-vocabulary

“additional-vocabulary” GstValueArray *

Additional vocabulary speechmatics should use

Flags : Read / Write


api-key

“api-key” gchararray

Speechmatics API Key

Flags : Read / Write

Default value : NULL


diarization

“diarization” GstSpeechmaticsTranscriberDiarization *

Defines how to separate speakers in the audio

Flags : Read / Write

Default value : none (0)


enable-late-punctuation-hack

“enable-late-punctuation-hack” gboolean

Pass a reduced max-delay to speechmatics to make sure we always get punctuation in time for joining it with the preceding word.

Flags : Read / Write

Default value : true


join-punctuation

“join-punctuation” gboolean

Whether punctuation should be joined with the preceding word

Flags : Read / Write

Default value : true


language-code

“language-code” gchararray

The Language of the Stream, ISO code

Flags : Read / Write

Default value : en


latency

“latency” guint

Amount of milliseconds to allow for transcription

Flags : Read / Write

Default value : 8000


lateness

“lateness” guint

Amount of milliseconds to introduce as lateness

Flags : Read / Write

Default value : 0


max-delay

“max-delay” guint

Max delay to pass to the speechmatics API (0 = use latency)

Flags : Read / Write

Default value : 0


max-speakers

“max-speakers” guint

The maximum number of speakers that may be detected with diarization=speaker

Flags : Read / Write

Default value : 50


url

“url” gchararray

URL of the transcription server

Flags : Read / Write

Default value : ws://0.0.0.0:9000


Named constants

GstSpeechmaticsTranscriberDiarization

Members

none (0) – None: no diarization
speaker (1) – Speaker: identify speakers by their voices

GstSpeechmaticsTranscriberSrcPad

GObject
    ╰──GInitiallyUnowned
        ╰──GstObject
            ╰──GstPad
                ╰──GstSpeechmaticsTranscriberSrcPad

Properties

language-code

“language-code” gchararray

The Language the Stream must be translated to

Flags : Read / Write

Default value : NULL


The results of the search are