docs/pwg/advanced-types.xml: Document typefinding.

Original commit message from CVS: * docs/pwg/advanced-types.xml: Document typefinding. * docs/pwg/other-oneton.xml: Document one-to-n elements, demuxers and parsers.
2025-01-03 14:08:56 +00:00 · 2004-03-26 01:53:06 +00:00 · 2004-03-26 01:53:06 +00:00 · 4d9b9b989e
commit 4d9b9b989e
parent 5710f325b3
3 changed files with 208 additions and 7 deletions
--- a/7
+++ b/7
@ -1,3 +1,10 @@
 2004-03-25  Ronald Bultje  <rbultje@ronald.bitfreak.net>
 	* docs/pwg/advanced-types.xml:
 	  Document typefinding.
 	* docs/pwg/other-oneton.xml:
 	  Document one-to-n elements, demuxers and parsers.
 2004-03-25  Tim-Philipp Müller <t.i.m@zen.co.uk>
 	reviewed by: David Schleef  <ds@schleef.org>
--- a/docs/pwg/advanced-types.xml
+++ b/docs/pwg/advanced-types.xml
@ -86,7 +86,66 @@
  <sect1 id="section-types-typefind" xreflabel="Typefind Functions and Autoplugging">
    <title>Typefind Functions and Autoplugging</title>
    <para>
-      WRITEME
+      With only <emphasis>defining</emphasis> the types, we're not yet there.
      In order for a random data file to be recognized and played back as
      such, we need a way of recognizing their type out of the blue. For this
      purpose, <quote>typefinding</quote> was introduced. Typefinding is the
      process of detecting the type of a datastream. Typefinding consists of
      two separate parts: first, there's an unlimited number of functions
      that we call <emphasis>typefind functions</emphasis>, which are each
      able to recognize one or more types from an input stream. Then,
      secondly, there's a small engine which registers and calls each of
      those functions. This is the typefind core. On top of this typefind
      core, you would normally write an autoplugger, which is able to use
      this type detection system to dynamically build a pipeline around an
      input stream. Here, we will focus only on typefind functions.
    </para>
    <para>
      A typefind function ususally lives in
      <filename>gst-plugins/gst/typefind/gsttypefindfunctions.c</filename>,
      unless there's a good reason (like library dependencies) to put it
      elsewhere. The reason for this centralization is to decreate the
      number of plugins that need to be loaded in order to detect a stream's
      type. Below is an example that will recognize AVI files, which start
      with a <quote>RIFF</quote> tag, then the size of the file and then an
      <quote>AVI </quote> tag:
    </para>
    <programlisting>
 static void
 gst_my_typefind_function (GstTypeFind *tf,
 			  gpointer     data)
 {
  guint8 *data = gst_type_find_peek (tf, 0, 12);
  if (data &amp;&amp;
      GUINT32_FROM_LE ((guint32 *) data)[0] == GST_MAKE_FOURCC ('R','I','F','F') &amp;&amp;
      GUINT32_FROM_LE ((guint32 *) data)[2] == GST_MAKE_FOURCC ('A','V','I',' ')) {
    gst_type_find_suggest (tf, GST_TYPE_FIND_MAXIMUM,
 			   gst_caps_new_simple ("video/x-msvideo", NULL));
  }
 }
 static gboolean
 plugin_init (GstPlugin *plugin)
 {
  static gchar *exts[] = { "avi", NULL };
  if (!gst_type_find_register (plugin, "", GST_RANK_PRIMARY,
 			       gst_my_typefind_function, exts,
 			       gst_caps_new_simple ("video/x-msvideo",
 						    NULL), NULL))
    return FALSE;
 }
    </programlisting>
    <para>
      Note that
      <filename>gst-plugins/gst/typefind/gsttypefindfunctions.c</filename>
      has some simplification macros to decrease the amount of code. Make
      good use of those if you want to submit typefinding patches with new
      typefind functions.
    </para>
    <para>
      Autoplugging will be discussed in great detail in the chapter called
      <xref linkend="chapter-other-autoplugger"/>.
    </para>
  </sect1>
--- a/docs/pwg/other-oneton.xml
+++ b/docs/pwg/other-oneton.xml
@ -1,16 +1,151 @@
 <!-- ############ chapter ############# -->
-<chapter id="chapter-other-oneton" xreflabel="Writing a 1-to-N Element">
+<chapter id="chapter-other-oneton" xreflabel="Writing a 1-to-N Element, Demuxer or Parser">
-  <title>Writing a 1-to-N element</title>
+  <title>Writing a 1-to-N Element, Demuxer or Parser</title>
  <para>
-    FIXME: write.
+    1-to-N elements don't have much special needs or requirements that
    haven't been discussed already. The most important thing to take care
    of in 1-to-N elements (things like <classname>tee</classname>-elements
    or so) is to use proper buffer refcounting and caps negotiation. If
    those two are taken care of (see the <classname>tee</classname> element
    if you need example code), there's little that can go wrong.
  </para>
  <para>
    Demuxers are the 1-to-N elements that need very special care, though.
    They are responsible for timestamping raw, unparsed data into
    elementary video or audio streams, and there are many things that you
    can optimize or do wrong. Here, several culprits will be mentioned
    and common solutions will be offered. Parsers are demuxers with only
    one source pad. Also, they only cut the stream into buffers, they
    don't touch the data otherwise.
  </para>
-  <sect1 id="section-other-demuxer" xreflabel="Writing a Demuxer">
+  <sect1 id="section-oneton-capsnego" xreflabel="Demuxer Caps Negotiation">
-    <title>Writing a Demuxer</title>
+    <title>Demuxer Caps Negotiation</title>
    <para>
-      WRITEME
+      Demuxers will usually contain several elementary streams, and each
      of those streams' properties will be defined in a stream header at
      the start of the file (or, rather, stream) that you're parsing.
      Since those are fixed and there is no possibility to negotiate
      stream properties with elements earlier in the pipeline, you should
      always use explicit caps on demuxer source pads. This prevents a
      whole lot of caps negotiation or re-negotiation errors.
    </para>
  </sect1>
  <sect1 id="section-oneton-data" xreflabel="Data processing and downstream events">
    <title>Data processing and downstream events</title>
    <para>
      Data parsing, pulling this into subbuffers and sending that to the
      source pads of the elementary streams is the one single most
      important task of demuxers and parsers. Usually, an element will
      have a <function>_loop ()</function> function using the
      <classname>bytestream</classname> object to read data. Try to have
      a single point of data reading from the bytestream object. In this
      single point, do <emphasis>proper</emphasis> event handling (in
      case there is any) and <emphasis>proper</emphasis> error handling
      in case that's needed. Make your element as fault-tolerant as
      possible, but do not go further than possible.
    </para>
  </sect1>
  <sect1 id="section-oneton-parsing" xreflabel="Parsing versus interpreting">
    <title>Parsing versus interpreting</title>
    <para>
      One particular convention that &GStreamer; demuxers follow is that
      of separation of parsing and interpreting. The reason for this is
      maintainability, clarity and code reuse. An easy example of this
      is something like RIFF, which has a chunk header of 4 bytes, then
      a length indicator of 4 bytes and then the actual data. We write
      special functions to read one chunk, to peek a chunk ID and all
      those; that's the <emphasis>parsing</emphasis> part of the demuxer.
      Then, somewhere else, we like to write the main data processing
      function, which calls this parse function, reads one chunk and
      then does with the data whatever it needs to do.
    </para>
    <para>
      Some example code for RIFF-reading to illustrate the above two points:
    </para>
    <programlisting>
 static gboolean
 gst_my_demuxer_peek (GstMyDemuxer *demux,
 		     guint32      *id,
 		     guint32      *size)
 {
  guint8 *data;
  while (gst_bytestream_peek_bytes (demux->bs, &amp;data, 4) != 4) {
    guint32 remaining;
    GstEvent *event;
    gst_bytestream_get_status (demux->bs, &amp;remaining, &amp;event);
    if (event) {
      GstEventType type = GST_EVENT_TYPE (event);
      /* or maybe custom event handling, up to you - we lose reference! */
      gst_pad_event_default (demux->sinkpad, event);
      if (type == GST_EVENT_EOS)
        return FALSE;
    } else {
      GST_ELEMENT_ERROR (demux, STREAM, READ, (NULL), (NULL));
      return FALSE;
    }
  }
  *id = GUINT32_FROM_LE (((guint32 *) data)[0]);
  *size = GUINT32_FROM_LE (((guint32 *) data)[0]);
  return TRUE;
 }
 static void
 gst_my_demuxer_loop (GstElement *element)
 {
  GstMyDemuxer *demux = GST_MY_DEMUXER (element);
  guint32 id, size;
  if (!gst_my_demuxer_peek (demux, &amp;id, &amp;size))
    return;
  switch (id) {
    [.. normal chunk handling ..]
  }
 }
    </programlisting>
    <para>
      Reason for this is that event handling is now centralized in one
      place and the <function>_loop ()</function> function is a lot
      cleaner and more readable. Those are common code practices, but
      since the mistake of <emphasis>not</emphasis> using such common
      code practices has been made too often, we explicitely mention
      this here.
    </para>
  </sect1>
  <sect1 id="section-oneton-seeking" xreflabel="Simple seeking and indexes">
    <title>Simple seeking and indexes</title>
    <para>
      Sources will generally receive a seek event in the exact supported
      format by the element. Demuxers, however, can not seek in
      themselves directly, but need to convert from one unit (e.g.
      time) to the other (e.g. bytes) and send a new event to its sink
      pad. Given this, the <function>_convert ()</function>-function (or,
      more general: unit conversion) is the most important function in a
      demuxer. Some demuxers (AVI, Matroska) and parsers will keep an
      index of all chunks in a stream, firstly to improve seeking
      precision and secondly so they won't lose sync. Some other demuxers
      will seek the stream directly without index (e.g. MPEG, Ogg) -
      usually based on something like a cumulative bitrate - and then
      find the closest next chunk from their new position. The best
      choice depends on the format.
    </para>
    <para>
      Note that it is recommended for demuxers to implement event,
      conversion and query handling functions (using time units or so),
      in addition to the ones (usually in byte units) provided by the
      pipeline source element.
    </para>
  </sect1>
 </chapter>