gst-plugins-rs/video/closedcaption/src/transcriberbin/mod.rs

// Copyright (C) 2021 Mathieu Duponchelle <mathieu@centricular.com>
//
// This Source Code Form is subject to the terms of the Mozilla Public License, v2.0.
// If a copy of the MPL was not distributed with this file, You can obtain one at
// <https://mozilla.org/MPL/2.0/>.
//
// SPDX-License-Identifier: MPL-2.0

use gst::glib;
use gst::prelude::*;

mod imp;

#[derive(Debug, Eq, PartialEq, Ord, PartialOrd, Hash, Clone, Copy, glib::Enum)]
#[repr(u32)]
#[enum_type(name = "GstTranscriberBinCaptionSource")]
pub enum CaptionSource {
    Both,
    Transcription,
    Inband,
}

#[derive(Debug, Copy, Clone, Default, PartialEq, Eq, glib::Enum)]
#[repr(u32)]
#[enum_type(name = "GstTranscriberBinMuxMethod")]
enum MuxMethod {
    #[default]
    Cea608,
    Cea708,
}

glib::wrapper! {
    pub struct TranscriberBin(ObjectSubclass<imp::TranscriberBin>) @extends gst::Bin, gst::Element, gst::Object, @implements gst::ChildProxy;
}

glib::wrapper! {
    pub struct TranscriberSinkPad(ObjectSubclass<imp::TranscriberSinkPad>) @extends gst::GhostPad, gst::ProxyPad, gst::Pad, gst::Object;
}

glib::wrapper! {
    pub struct TranscriberSrcPad(ObjectSubclass<imp::TranscriberSrcPad>) @extends gst::GhostPad, gst::ProxyPad, gst::Pad, gst::Object;
}

pub fn register(plugin: &gst::Plugin) -> Result<(), glib::BoolError> {
    #[cfg(feature = "doc")]
    {
        CaptionSource::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());
        MuxMethod::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());
        TranscriberSinkPad::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());
        TranscriberSrcPad::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());
    }

    gst::Element::register(
        Some(plugin),
        "transcriberbin",
        gst::Rank::NONE,
        TranscriberBin::static_type(),
    )
}
transcriberbin: new high-level bin for speech to Closed Caption This new element puts together some of the elements we've written in recent times (awstranscriber, tttocea608, textwrap, cccombiner) into a convenience high-level element. The design of the element is AV in -> AV (+ CC metas) out. The element exposes property to set and unset a "passthrough" mode, during which the transcriber element's state is set to NULL but kept in the bin, in order for the user to be able to set properties on sub elements no matter what the current mode is, using the GstChildProxy interface. In addition, the element ensures that the latency it reports stays fixed so that playback continues uninterrupted. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/528> 2021-06-22 20:08:08 +00:00			`// Copyright (C) 2021 Mathieu Duponchelle <mathieu@centricular.com>`
			`//`
Re-license LGPL-2.1 plugins to MPL-2 Fixes https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/issues/168 2022-01-15 18:40:12 +00:00			`// This Source Code Form is subject to the terms of the Mozilla Public License, v2.0.`
			`// If a copy of the MPL was not distributed with this file, You can obtain one at`
			`// <https://mozilla.org/MPL/2.0/>.`
transcriberbin: new high-level bin for speech to Closed Caption This new element puts together some of the elements we've written in recent times (awstranscriber, tttocea608, textwrap, cccombiner) into a convenience high-level element. The design of the element is AV in -> AV (+ CC metas) out. The element exposes property to set and unset a "passthrough" mode, during which the transcriber element's state is set to NULL but kept in the bin, in order for the user to be able to set properties on sub elements no matter what the current mode is, using the GstChildProxy interface. In addition, the element ensures that the latency it reports stays fixed so that playback continues uninterrupted. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/528> 2021-06-22 20:08:08 +00:00			`//`
Re-license LGPL-2.1 plugins to MPL-2 Fixes https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/issues/168 2022-01-15 18:40:12 +00:00			`// SPDX-License-Identifier: MPL-2.0`
transcriberbin: new high-level bin for speech to Closed Caption This new element puts together some of the elements we've written in recent times (awstranscriber, tttocea608, textwrap, cccombiner) into a convenience high-level element. The design of the element is AV in -> AV (+ CC metas) out. The element exposes property to set and unset a "passthrough" mode, during which the transcriber element's state is set to NULL but kept in the bin, in order for the user to be able to set properties on sub elements no matter what the current mode is, using the GstChildProxy interface. In addition, the element ensures that the latency it reports stays fixed so that playback continues uninterrupted. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/528> 2021-06-22 20:08:08 +00:00
			`use gst::glib;`
			`use gst::prelude::*;`

			`mod imp;`

transcriberbin: Add caption-source property By using this new property, application can select exclusive caption source. There are three source types - Both: Inband and transcription captions are combined if exist. This is default behavior. - Inband: Transcription buffers will be dropped - Transcription: Caption meta of each video buffer will be dropped In this version, transcriberbin doesn't provide any hint for application to help caption source decision. That can be done by application's strategy, passthrough status or probing inband caption meta for example. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/684> 2022-03-07 16:06:12 +00:00			`#[derive(Debug, Eq, PartialEq, Ord, PartialOrd, Hash, Clone, Copy, glib::Enum)]`
			`#[repr(u32)]`
			`#[enum_type(name = "GstTranscriberBinCaptionSource")]`
			`pub enum CaptionSource {`
			`Both,`
			`Transcription,`
			`Inband,`
			`}`

transcriberbin: also support 608 inside 708 Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/1406> 2023-12-12 03:20:43 +00:00			`#[derive(Debug, Copy, Clone, Default, PartialEq, Eq, glib::Enum)]`
			`#[repr(u32)]`
			`#[enum_type(name = "GstTranscriberBinMuxMethod")]`
			`enum MuxMethod {`
			`#[default]`
			`Cea608,`
			`Cea708,`
			`}`

transcriberbin: new high-level bin for speech to Closed Caption This new element puts together some of the elements we've written in recent times (awstranscriber, tttocea608, textwrap, cccombiner) into a convenience high-level element. The design of the element is AV in -> AV (+ CC metas) out. The element exposes property to set and unset a "passthrough" mode, during which the transcriber element's state is set to NULL but kept in the bin, in order for the user to be able to set properties on sub elements no matter what the current mode is, using the GstChildProxy interface. In addition, the element ensures that the latency it reports stays fixed so that playback continues uninterrupted. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/528> 2021-06-22 20:08:08 +00:00			`glib::wrapper! {`
transcriberbin: add support for consuming secondary audio streams In some situations, a translated alternate audio stream for a content might be available. Instead of going through transcription and translation of the original audio stream, it may be preferrable for accuracy purposes to simply transcribe the secondary audio stream. This MR adds support for doing just that: * Secondary audio sink pads can be requested as "sink_audio_%u" * Sometimes audio source pads are added at that point to pass through the audio, as "src_audio_%u" * The main transcription bin now contains per-input stream transcription bins. Those can be individually controlled through properties on the sink pads, for instance translation-languages can be dynamically set per audio stream * Some properties that originally existed on the main element still remain, but are now simply mapped to the always audio sink pad * Releasing of secondary sink pads is nominally implemented, but not tested in states other than NULL An example launch line for this would be: ``` $ gst-launch-1.0 transcriberbin name=transcriberbin latency=8000 accumulate-time=0 \ cc-caps="closedcaption/x-cea-708, format=cc_data" sink_audio_0::language-code="es-US" \ sink_audio_0::translation-languages="languages, transcript=cc3" uridecodebin uri=file:///home/meh/Music/chaplin.mkv name=d d. ! videoconvert ! transcriberbin.sink_video d. ! clocksync ! audioconvert ! transcriberbin.sink_audio transcriberbin.src_video ! cea608overlay field=1 ! videoconvert ! autovideosink \ transcriberbin.src_audio ! audioconvert ! fakesink \ uridecodebin uri=file:///home/meh/Music/chaplin-spanish.webm name=d2 \ d2. ! audioconvert ! transcriberbin.sink_audio_0 \ transcriberbin.src_audio_0 ! fakesink ``` Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/1546> 2024-04-19 18:12:46 +00:00			`pub struct TranscriberBin(ObjectSubclass<imp::TranscriberBin>) @extends gst::Bin, gst::Element, gst::Object, @implements gst::ChildProxy;`
			`}`

			`glib::wrapper! {`
			`pub struct TranscriberSinkPad(ObjectSubclass<imp::TranscriberSinkPad>) @extends gst::GhostPad, gst::ProxyPad, gst::Pad, gst::Object;`
			`}`

			`glib::wrapper! {`
			`pub struct TranscriberSrcPad(ObjectSubclass<imp::TranscriberSrcPad>) @extends gst::GhostPad, gst::ProxyPad, gst::Pad, gst::Object;`
transcriberbin: new high-level bin for speech to Closed Caption This new element puts together some of the elements we've written in recent times (awstranscriber, tttocea608, textwrap, cccombiner) into a convenience high-level element. The design of the element is AV in -> AV (+ CC metas) out. The element exposes property to set and unset a "passthrough" mode, during which the transcriber element's state is set to NULL but kept in the bin, in order for the user to be able to set properties on sub elements no matter what the current mode is, using the GstChildProxy interface. In addition, the element ensures that the latency it reports stays fixed so that playback continues uninterrupted. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/528> 2021-06-22 20:08:08 +00:00			`}`

			`pub fn register(plugin: &gst::Plugin) -> Result<(), glib::BoolError> {`
Generate plugins documentation using hotdoc Which will automatically be integrated in gstreamer documentation 2022-08-25 22:30:08 +00:00			`#[cfg(feature = "doc")]`
transcriberbin: also support 608 inside 708 Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/1406> 2023-12-12 03:20:43 +00:00			`{`
			`CaptionSource::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());`
			`MuxMethod::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());`
transcriberbin: add support for consuming secondary audio streams In some situations, a translated alternate audio stream for a content might be available. Instead of going through transcription and translation of the original audio stream, it may be preferrable for accuracy purposes to simply transcribe the secondary audio stream. This MR adds support for doing just that: * Secondary audio sink pads can be requested as "sink_audio_%u" * Sometimes audio source pads are added at that point to pass through the audio, as "src_audio_%u" * The main transcription bin now contains per-input stream transcription bins. Those can be individually controlled through properties on the sink pads, for instance translation-languages can be dynamically set per audio stream * Some properties that originally existed on the main element still remain, but are now simply mapped to the always audio sink pad * Releasing of secondary sink pads is nominally implemented, but not tested in states other than NULL An example launch line for this would be: ``` $ gst-launch-1.0 transcriberbin name=transcriberbin latency=8000 accumulate-time=0 \ cc-caps="closedcaption/x-cea-708, format=cc_data" sink_audio_0::language-code="es-US" \ sink_audio_0::translation-languages="languages, transcript=cc3" uridecodebin uri=file:///home/meh/Music/chaplin.mkv name=d d. ! videoconvert ! transcriberbin.sink_video d. ! clocksync ! audioconvert ! transcriberbin.sink_audio transcriberbin.src_video ! cea608overlay field=1 ! videoconvert ! autovideosink \ transcriberbin.src_audio ! audioconvert ! fakesink \ uridecodebin uri=file:///home/meh/Music/chaplin-spanish.webm name=d2 \ d2. ! audioconvert ! transcriberbin.sink_audio_0 \ transcriberbin.src_audio_0 ! fakesink ``` Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/1546> 2024-04-19 18:12:46 +00:00			`TranscriberSinkPad::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());`
			`TranscriberSrcPad::static_type().mark_as_plugin_api(gst::PluginAPIFlags::empty());`
transcriberbin: also support 608 inside 708 Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/1406> 2023-12-12 03:20:43 +00:00			`}`
Generate plugins documentation using hotdoc Which will automatically be integrated in gstreamer documentation 2022-08-25 22:30:08 +00:00
transcriberbin: new high-level bin for speech to Closed Caption This new element puts together some of the elements we've written in recent times (awstranscriber, tttocea608, textwrap, cccombiner) into a convenience high-level element. The design of the element is AV in -> AV (+ CC metas) out. The element exposes property to set and unset a "passthrough" mode, during which the transcriber element's state is set to NULL but kept in the bin, in order for the user to be able to set properties on sub elements no matter what the current mode is, using the GstChildProxy interface. In addition, the element ensures that the latency it reports stays fixed so that playback continues uninterrupted. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/528> 2021-06-22 20:08:08 +00:00			`gst::Element::register(`
			`Some(plugin),`
			`"transcriberbin",`
Update for `gst::Rank` API changes 2023-11-02 12:10:59 +00:00			`gst::Rank::NONE,`
transcriberbin: new high-level bin for speech to Closed Caption This new element puts together some of the elements we've written in recent times (awstranscriber, tttocea608, textwrap, cccombiner) into a convenience high-level element. The design of the element is AV in -> AV (+ CC metas) out. The element exposes property to set and unset a "passthrough" mode, during which the transcriber element's state is set to NULL but kept in the bin, in order for the user to be able to set properties on sub elements no matter what the current mode is, using the GstChildProxy interface. In addition, the element ensures that the latency it reports stays fixed so that playback continues uninterrupted. Part-of: <https://gitlab.freedesktop.org/gstreamer/gst-plugins-rs/-/merge_requests/528> 2021-06-22 20:08:08 +00:00			`TranscriberBin::static_type(),`
			`)`
			`}`