docs/pwg/advanced-types.xml: Document typefinding.

Original commit message from CVS: * docs/pwg/advanced-types.xml: Document typefinding. * docs/pwg/other-oneton.xml: Document one-to-n elements, demuxers and parsers.
2025-02-16 19:25:18 +00:00 · 2004-03-26 01:53:06 +00:00 · 2004-03-26 01:53:06 +00:00 · 4d9b9b989e
commit 4d9b9b989e
parent 5710f325b3
3 changed files with 208 additions and 7 deletions
--- a/7
+++ b/7
@ -1,3 +1,10 @@
+2004-03-25  Ronald Bultje  <rbultje@ronald.bitfreak.net>
+
+	* docs/pwg/advanced-types.xml:
+	  Document typefinding.
+	* docs/pwg/other-oneton.xml:
+	  Document one-to-n elements, demuxers and parsers.
+
 2004-03-25  Tim-Philipp Müller <t.i.m@zen.co.uk>

 	reviewed by: David Schleef  <ds@schleef.org>
--- a/docs/pwg/advanced-types.xml
+++ b/docs/pwg/advanced-types.xml
@ -86,7 +86,66 @@
  <sect1 id="section-types-typefind" xreflabel="Typefind Functions and Autoplugging">
    <title>Typefind Functions and Autoplugging</title>
    <para>
-      WRITEME
+      With only <emphasis>defining</emphasis> the types, we're not yet there.
+      In order for a random data file to be recognized and played back as
+      such, we need a way of recognizing their type out of the blue. For this
+      purpose, <quote>typefinding</quote> was introduced. Typefinding is the
+      process of detecting the type of a datastream. Typefinding consists of
+      two separate parts: first, there's an unlimited number of functions
+      that we call <emphasis>typefind functions</emphasis>, which are each
+      able to recognize one or more types from an input stream. Then,
+      secondly, there's a small engine which registers and calls each of
+      those functions. This is the typefind core. On top of this typefind
+      core, you would normally write an autoplugger, which is able to use
+      this type detection system to dynamically build a pipeline around an
+      input stream. Here, we will focus only on typefind functions.
+    </para>
+    <para>
+      A typefind function ususally lives in
+      <filename>gst-plugins/gst/typefind/gsttypefindfunctions.c</filename>,
+      unless there's a good reason (like library dependencies) to put it
+      elsewhere. The reason for this centralization is to decreate the
+      number of plugins that need to be loaded in order to detect a stream's
+      type. Below is an example that will recognize AVI files, which start
+      with a <quote>RIFF</quote> tag, then the size of the file and then an
+      <quote>AVI </quote> tag:
+    </para>
+    <programlisting>
+static void
+gst_my_typefind_function (GstTypeFind *tf,
+			  gpointer     data)
+{
+  guint8 *data = gst_type_find_peek (tf, 0, 12);
+
+  if (data &amp;&amp;
+      GUINT32_FROM_LE ((guint32 *) data)[0] == GST_MAKE_FOURCC ('R','I','F','F') &amp;&amp;
+      GUINT32_FROM_LE ((guint32 *) data)[2] == GST_MAKE_FOURCC ('A','V','I',' ')) {
+    gst_type_find_suggest (tf, GST_TYPE_FIND_MAXIMUM,
+			   gst_caps_new_simple ("video/x-msvideo", NULL));
+  }
+}
+
+static gboolean
+plugin_init (GstPlugin *plugin)
+{
+  static gchar *exts[] = { "avi", NULL };
+  if (!gst_type_find_register (plugin, "", GST_RANK_PRIMARY,
+			       gst_my_typefind_function, exts,
+			       gst_caps_new_simple ("video/x-msvideo",
+						    NULL), NULL))
+    return FALSE;
+}
+    </programlisting>
+    <para>
+      Note that
+      <filename>gst-plugins/gst/typefind/gsttypefindfunctions.c</filename>
+      has some simplification macros to decrease the amount of code. Make
+      good use of those if you want to submit typefinding patches with new
+      typefind functions.
+    </para>
+    <para>
+      Autoplugging will be discussed in great detail in the chapter called
+      <xref linkend="chapter-other-autoplugger"/>.
    </para>
  </sect1>

--- a/docs/pwg/other-oneton.xml
+++ b/docs/pwg/other-oneton.xml
@ -1,16 +1,151 @@

 <!-- ############ chapter ############# -->

-<chapter id="chapter-other-oneton" xreflabel="Writing a 1-to-N Element">
-  <title>Writing a 1-to-N element</title>
+<chapter id="chapter-other-oneton" xreflabel="Writing a 1-to-N Element, Demuxer or Parser">
+  <title>Writing a 1-to-N Element, Demuxer or Parser</title>
  <para>
-    FIXME: write.
+    1-to-N elements don't have much special needs or requirements that
+    haven't been discussed already. The most important thing to take care
+    of in 1-to-N elements (things like <classname>tee</classname>-elements
+    or so) is to use proper buffer refcounting and caps negotiation. If
+    those two are taken care of (see the <classname>tee</classname> element
+    if you need example code), there's little that can go wrong.
+  </para>
+  <para>
+    Demuxers are the 1-to-N elements that need very special care, though.
+    They are responsible for timestamping raw, unparsed data into
+    elementary video or audio streams, and there are many things that you
+    can optimize or do wrong. Here, several culprits will be mentioned
+    and common solutions will be offered. Parsers are demuxers with only
+    one source pad. Also, they only cut the stream into buffers, they
+    don't touch the data otherwise.
  </para>

-  <sect1 id="section-other-demuxer" xreflabel="Writing a Demuxer">
-    <title>Writing a Demuxer</title>
+  <sect1 id="section-oneton-capsnego" xreflabel="Demuxer Caps Negotiation">
+    <title>Demuxer Caps Negotiation</title>
    <para>
-      WRITEME
+      Demuxers will usually contain several elementary streams, and each
+      of those streams' properties will be defined in a stream header at
+      the start of the file (or, rather, stream) that you're parsing.
+      Since those are fixed and there is no possibility to negotiate
+      stream properties with elements earlier in the pipeline, you should
+      always use explicit caps on demuxer source pads. This prevents a
+      whole lot of caps negotiation or re-negotiation errors.
+    </para>
+  </sect1>
+
+  <sect1 id="section-oneton-data" xreflabel="Data processing and downstream events">
+    <title>Data processing and downstream events</title>
+    <para>
+      Data parsing, pulling this into subbuffers and sending that to the
+      source pads of the elementary streams is the one single most
+      important task of demuxers and parsers. Usually, an element will
+      have a <function>_loop ()</function> function using the
+      <classname>bytestream</classname> object to read data. Try to have
+      a single point of data reading from the bytestream object. In this
+      single point, do <emphasis>proper</emphasis> event handling (in
+      case there is any) and <emphasis>proper</emphasis> error handling
+      in case that's needed. Make your element as fault-tolerant as
+      possible, but do not go further than possible.
+    </para>
+  </sect1>
+
+  <sect1 id="section-oneton-parsing" xreflabel="Parsing versus interpreting">
+    <title>Parsing versus interpreting</title>
+    <para>
+      One particular convention that &GStreamer; demuxers follow is that
+      of separation of parsing and interpreting. The reason for this is
+      maintainability, clarity and code reuse. An easy example of this
+      is something like RIFF, which has a chunk header of 4 bytes, then
+      a length indicator of 4 bytes and then the actual data. We write
+      special functions to read one chunk, to peek a chunk ID and all
+      those; that's the <emphasis>parsing</emphasis> part of the demuxer.
+      Then, somewhere else, we like to write the main data processing
+      function, which calls this parse function, reads one chunk and
+      then does with the data whatever it needs to do.
+    </para>
+    <para>
+      Some example code for RIFF-reading to illustrate the above two points:
+    </para>
+    <programlisting>
+static gboolean
+gst_my_demuxer_peek (GstMyDemuxer *demux,
+		     guint32      *id,
+		     guint32      *size)
+{
+  guint8 *data;
+
+  while (gst_bytestream_peek_bytes (demux->bs, &amp;data, 4) != 4) {
+    guint32 remaining;
+    GstEvent *event;
+
+    gst_bytestream_get_status (demux->bs, &amp;remaining, &amp;event);
+    if (event) {
+      GstEventType type = GST_EVENT_TYPE (event);
+
+      /* or maybe custom event handling, up to you - we lose reference! */
+      gst_pad_event_default (demux->sinkpad, event);
+
+      if (type == GST_EVENT_EOS)
+        return FALSE;
+    } else {
+      GST_ELEMENT_ERROR (demux, STREAM, READ, (NULL), (NULL));
+      return FALSE;
+    }
+  }
+
+  *id = GUINT32_FROM_LE (((guint32 *) data)[0]);
+  *size = GUINT32_FROM_LE (((guint32 *) data)[0]);
+
+  return TRUE;
+}
+
+static void
+gst_my_demuxer_loop (GstElement *element)
+{
+  GstMyDemuxer *demux = GST_MY_DEMUXER (element);
+  guint32 id, size;
+
+  if (!gst_my_demuxer_peek (demux, &amp;id, &amp;size))
+    return;
+
+  switch (id) {
+    [.. normal chunk handling ..]
+  }
+}
+    </programlisting>
+    <para>
+      Reason for this is that event handling is now centralized in one
+      place and the <function>_loop ()</function> function is a lot
+      cleaner and more readable. Those are common code practices, but
+      since the mistake of <emphasis>not</emphasis> using such common
+      code practices has been made too often, we explicitely mention
+      this here.
+    </para>
+  </sect1>
+
+  <sect1 id="section-oneton-seeking" xreflabel="Simple seeking and indexes">
+    <title>Simple seeking and indexes</title>
+    <para>
+      Sources will generally receive a seek event in the exact supported
+      format by the element. Demuxers, however, can not seek in
+      themselves directly, but need to convert from one unit (e.g.
+      time) to the other (e.g. bytes) and send a new event to its sink
+      pad. Given this, the <function>_convert ()</function>-function (or,
+      more general: unit conversion) is the most important function in a
+      demuxer. Some demuxers (AVI, Matroska) and parsers will keep an
+      index of all chunks in a stream, firstly to improve seeking
+      precision and secondly so they won't lose sync. Some other demuxers
+      will seek the stream directly without index (e.g. MPEG, Ogg) -
+      usually based on something like a cumulative bitrate - and then
+      find the closest next chunk from their new position. The best
+      choice depends on the format.
+    </para>
+    <para>
+      Note that it is recommended for demuxers to implement event,
+      conversion and query handling functions (using time units or so),
+      in addition to the ones (usually in byte units) provided by the
+      pipeline source element.
    </para>
  </sect1>
 </chapter>