Writing a N-to-1 Element or Muxer

Writing a N-to-1 Element or Muxer N-to-1 elements have been previously mentioned and discussed in both and in . The main noteworthy thing about N-to-1 elements is that they should always, without any single exception, be _loop ()-based. Apart from that, there is not much general that you need to know. We will discuss one special type of N-to-1 elements here, these being muxers. The first two of these sections apply to N-to-1 elements in general, though. The Data Loop Function As previously mentioned in , N-to-1 elements generally try to have one buffer from each sink pad and then handle the one with the earliest timestamp. There's some exceptions to this rule, we will come to those later. This only works if all streams actually continuously provide input. There might be cases where this is not true, for example subtitles (there might be no subtitle for a while), overlay images and so forth. For this purpose, there is a _select () function in &GStreamer;. It checks whether input is available on a (list of) pad(s). In this way, you can skip over the pads that are 'non- continuous'. /* Pad selection is currently broken, FIXME some day */ Events in the Loop Function N-to-1 elements using a cache will sometimes receive events, and it is often unclear how to handle those. For example, how do you seek to a frame in an output file (and what's the point of it anyway)? So, do discontinuity or seek events make sense, and should you use them? Discontinuities and flushes Don't do anything. They specify a discontinuity in the output, and you should continue to playback as you would otherwise. You generally do not need to put a discontinuity in the output stream in muxers; you would have to manually start adapting timestamps of output frames (if appliccable) to match the previous timescale, though. Note that the output data stream should be continuous. For other types of N-to-1-elements, it is generally fine to forward the discontinuity once it has been received from all pads. This depends on the specific element. Seeks Depends on the element. Muxers would generally not implement this, because the concept of seeking in an output stream at frame level is not very useful. Seeking at byte level can be useful, but that is more generally done by muxers on sink elements. End-of-Stream Speaks for itself. Negotiation Most container formats will have a fair amount of issues with changing content on an elementary stream. Therefore, you should not allow caps to be changed once you've started using data from them. The easiest way to achieve this is by using explicit caps, which have been explained before. However, we're going to use them in a slightly different way then what you're used to, having the core do all the work for us. The idea is that, as long as the stream/file headers have not been written yet and no data has been processed yet, a stream is allowed to renegotiate. After that point, the caps should be fixed, because we can only use a stream once. Caps may then only change within an allowed range (think MPEG, where changes in FPS are allowed), or sometimes not at all (such as AVI audio). In order to do that, we will, after data retrieval and header writing, set an explicit caps on each sink pad, that is the minimal caps describing the properties of the format that may not change. As an example, for MPEG audio inside an MPEG system stream, this would mean a wide caps of audio/mpeg with mpegversion=1 and layer=[1,2]. For the same audio type in MPEG, though, the samplerate, bitrate, layer and number of channels would become static, too. Since the (request) pads will be removed when the stream ends, the static caps will cease to exist too, then. While the explicit caps exist, the _link ()- function will not be called, since the core will do all necessary checks for us. Note that the property of using explicit caps should be added along with the actual explicit caps, not any earlier. Below here follows the simple example of an AVI muxer's audio caps negotiation. The _link ()-function is fairly normal, but the -Loop ()-function does some of the tricks mentioned above. There is no _getcaps ()- function since the pad template contains all that information already (not shown). static GstPadLinkReturn gst_avi_mux_audio_link (GstPad *pad, const GstCaps *caps) { GstAviMux *mux = GST_AVI_MUX (gst_pad_get_parent (pad)); GstStructure *str = gst_caps_get_structure (caps, 0); const gchar *mime = gst_structure_get_name (str); if (!strcmp (str, "audio/mpeg")) { /* get version, make sure it's 1, get layer, make sure it's 1-3, * then create the 2-byte audio tag (0x0055) and fill an audio * stream structure (strh/strf). */ [..] return GST_PAD_LINK_OK; } else if !strcmp (str, "audio/x-raw-int")) { /* See above, but now with the raw audio tag (0x0001). */ [..] return GST_PAD_LINK_OK; } else [..] [..] } static void gst_avi_mux_loop (GstElement *element) { GstAviMux *mux = GST_AVI_MUX (element); [..] /* As we get here, we should have written the header if we hadn't done * that before yet, and we're supposed to have an internal buffer from * each pad, also from the audio one. So here, we check again whether * this is the first run and if so, we set static caps. */ if (mux->first_cycle) { const GList *padlist = gst_element_get_pad_list (element); GList *item; for (item = padlist; item != NULL; item = item->next) { GstPad *pad = item->data; GstCaps *caps; if (!GST_PAD_IS_SINK (pad)) continue; /* set static caps here */ if (!strncmp (gst_pad_get_name (pad), "audio_", 6)) { /* the strf is the struct you filled in the _link () function. */ switch (strf->format) { case 0x0055: /* mp3 */ caps = gst_caps_new_simple ("audio/mpeg", "mpegversion", G_TYPE_INT, 1, "layer", G_TYPE_INT, 3, "bitrate", G_TYPE_INT, strf->av_bps, "rate", G_TYPE_INT, strf->rate, "channels", G_TYPE_INT, strf->channels, NULL); break; case 0x0001: /* pcm */ caps = gst_caps_new_simple ("audio/x-raw-int", [..]); break; [..] } } else if (!strncmp (gst_pad_get_name (pad), "video_", 6)) { [..] } else { g_warning ("oi!"); continue; } /* set static caps */ gst_pad_use_explicit_caps (pad); gst_pad_set_explicit_caps (pad, caps); } } [..] /* Next runs will never be the first again. */ mux->first_cycle = FALSE; } Note that there are other ways to achieve that, which might be useful for more complex cases. This will do for the simple cases, though. This method is provided to simplify negotiation and renegotiation in muxers, it is not a complete solution, nor is it a pretty one. Markup vs. data processing As we noted on demuxers before, we love common programming paradigms such as clean, lean and mean code. To achieve that in muxers, it's generally a good idea to separate the actual data stream markup from the data processing. To illustrate, here's how AVI muxers should write out RIFF tag chunks: static void gst_avi_mux_write_chunk (GstAviMux *mux, guint32 id, GstBuffer *data) { GstBuffer *hdr; hdr = gst_buffer_new_and_alloc (8); ((guint32 *) GST_BUFFER_DATA (buf))[0] = GUINT32_TO_LE (id); ((guint32 *) GST_BUFFER_DATA (buf))[1] = GUINT32_TO_LE (GST_BUFFER_SIZE (data)); gst_pad_push (mux->srcpad, hdr); gst_pad_push (mux->srcpad, data); } static void gst_avi_mux_loop (GstElement *element) { GstAviMux *mux = GST_AVI_MUX (element); GstBuffer *buf; [..] buf = gst_pad_pull (mux->sinkpad[0]); [..] gst_avi_mux_write_chunk (GST_MAKE_FOURCC ('0','0','d','b'), buf); } In general, try to program clean code, that should cover pretty much everything.