docs/pwg/advanced-types.xml: Document typefinding.

Original commit message from CVS:
* docs/pwg/advanced-types.xml:
Document typefinding.
* docs/pwg/other-oneton.xml:
Document one-to-n elements, demuxers and parsers.
This commit is contained in:
Ronald S. Bultje 2004-03-26 01:53:06 +00:00
parent 5710f325b3
commit 4d9b9b989e
3 changed files with 208 additions and 7 deletions

View file

@ -1,3 +1,10 @@
2004-03-25 Ronald Bultje <rbultje@ronald.bitfreak.net>
* docs/pwg/advanced-types.xml:
Document typefinding.
* docs/pwg/other-oneton.xml:
Document one-to-n elements, demuxers and parsers.
2004-03-25 Tim-Philipp Müller <t.i.m@zen.co.uk> 2004-03-25 Tim-Philipp Müller <t.i.m@zen.co.uk>
reviewed by: David Schleef <ds@schleef.org> reviewed by: David Schleef <ds@schleef.org>

View file

@ -86,7 +86,66 @@
<sect1 id="section-types-typefind" xreflabel="Typefind Functions and Autoplugging"> <sect1 id="section-types-typefind" xreflabel="Typefind Functions and Autoplugging">
<title>Typefind Functions and Autoplugging</title> <title>Typefind Functions and Autoplugging</title>
<para> <para>
WRITEME With only <emphasis>defining</emphasis> the types, we're not yet there.
In order for a random data file to be recognized and played back as
such, we need a way of recognizing their type out of the blue. For this
purpose, <quote>typefinding</quote> was introduced. Typefinding is the
process of detecting the type of a datastream. Typefinding consists of
two separate parts: first, there's an unlimited number of functions
that we call <emphasis>typefind functions</emphasis>, which are each
able to recognize one or more types from an input stream. Then,
secondly, there's a small engine which registers and calls each of
those functions. This is the typefind core. On top of this typefind
core, you would normally write an autoplugger, which is able to use
this type detection system to dynamically build a pipeline around an
input stream. Here, we will focus only on typefind functions.
</para>
<para>
A typefind function ususally lives in
<filename>gst-plugins/gst/typefind/gsttypefindfunctions.c</filename>,
unless there's a good reason (like library dependencies) to put it
elsewhere. The reason for this centralization is to decreate the
number of plugins that need to be loaded in order to detect a stream's
type. Below is an example that will recognize AVI files, which start
with a <quote>RIFF</quote> tag, then the size of the file and then an
<quote>AVI </quote> tag:
</para>
<programlisting>
static void
gst_my_typefind_function (GstTypeFind *tf,
gpointer data)
{
guint8 *data = gst_type_find_peek (tf, 0, 12);
if (data &amp;&amp;
GUINT32_FROM_LE ((guint32 *) data)[0] == GST_MAKE_FOURCC ('R','I','F','F') &amp;&amp;
GUINT32_FROM_LE ((guint32 *) data)[2] == GST_MAKE_FOURCC ('A','V','I',' ')) {
gst_type_find_suggest (tf, GST_TYPE_FIND_MAXIMUM,
gst_caps_new_simple ("video/x-msvideo", NULL));
}
}
static gboolean
plugin_init (GstPlugin *plugin)
{
static gchar *exts[] = { "avi", NULL };
if (!gst_type_find_register (plugin, "", GST_RANK_PRIMARY,
gst_my_typefind_function, exts,
gst_caps_new_simple ("video/x-msvideo",
NULL), NULL))
return FALSE;
}
</programlisting>
<para>
Note that
<filename>gst-plugins/gst/typefind/gsttypefindfunctions.c</filename>
has some simplification macros to decrease the amount of code. Make
good use of those if you want to submit typefinding patches with new
typefind functions.
</para>
<para>
Autoplugging will be discussed in great detail in the chapter called
<xref linkend="chapter-other-autoplugger"/>.
</para> </para>
</sect1> </sect1>

View file

@ -1,16 +1,151 @@
<!-- ############ chapter ############# --> <!-- ############ chapter ############# -->
<chapter id="chapter-other-oneton" xreflabel="Writing a 1-to-N Element"> <chapter id="chapter-other-oneton" xreflabel="Writing a 1-to-N Element, Demuxer or Parser">
<title>Writing a 1-to-N element</title> <title>Writing a 1-to-N Element, Demuxer or Parser</title>
<para> <para>
FIXME: write. 1-to-N elements don't have much special needs or requirements that
haven't been discussed already. The most important thing to take care
of in 1-to-N elements (things like <classname>tee</classname>-elements
or so) is to use proper buffer refcounting and caps negotiation. If
those two are taken care of (see the <classname>tee</classname> element
if you need example code), there's little that can go wrong.
</para>
<para>
Demuxers are the 1-to-N elements that need very special care, though.
They are responsible for timestamping raw, unparsed data into
elementary video or audio streams, and there are many things that you
can optimize or do wrong. Here, several culprits will be mentioned
and common solutions will be offered. Parsers are demuxers with only
one source pad. Also, they only cut the stream into buffers, they
don't touch the data otherwise.
</para> </para>
<sect1 id="section-other-demuxer" xreflabel="Writing a Demuxer"> <sect1 id="section-oneton-capsnego" xreflabel="Demuxer Caps Negotiation">
<title>Writing a Demuxer</title> <title>Demuxer Caps Negotiation</title>
<para> <para>
WRITEME Demuxers will usually contain several elementary streams, and each
of those streams' properties will be defined in a stream header at
the start of the file (or, rather, stream) that you're parsing.
Since those are fixed and there is no possibility to negotiate
stream properties with elements earlier in the pipeline, you should
always use explicit caps on demuxer source pads. This prevents a
whole lot of caps negotiation or re-negotiation errors.
</para>
</sect1>
<sect1 id="section-oneton-data" xreflabel="Data processing and downstream events">
<title>Data processing and downstream events</title>
<para>
Data parsing, pulling this into subbuffers and sending that to the
source pads of the elementary streams is the one single most
important task of demuxers and parsers. Usually, an element will
have a <function>_loop ()</function> function using the
<classname>bytestream</classname> object to read data. Try to have
a single point of data reading from the bytestream object. In this
single point, do <emphasis>proper</emphasis> event handling (in
case there is any) and <emphasis>proper</emphasis> error handling
in case that's needed. Make your element as fault-tolerant as
possible, but do not go further than possible.
</para>
</sect1>
<sect1 id="section-oneton-parsing" xreflabel="Parsing versus interpreting">
<title>Parsing versus interpreting</title>
<para>
One particular convention that &GStreamer; demuxers follow is that
of separation of parsing and interpreting. The reason for this is
maintainability, clarity and code reuse. An easy example of this
is something like RIFF, which has a chunk header of 4 bytes, then
a length indicator of 4 bytes and then the actual data. We write
special functions to read one chunk, to peek a chunk ID and all
those; that's the <emphasis>parsing</emphasis> part of the demuxer.
Then, somewhere else, we like to write the main data processing
function, which calls this parse function, reads one chunk and
then does with the data whatever it needs to do.
</para>
<para>
Some example code for RIFF-reading to illustrate the above two points:
</para>
<programlisting>
static gboolean
gst_my_demuxer_peek (GstMyDemuxer *demux,
guint32 *id,
guint32 *size)
{
guint8 *data;
while (gst_bytestream_peek_bytes (demux->bs, &amp;data, 4) != 4) {
guint32 remaining;
GstEvent *event;
gst_bytestream_get_status (demux->bs, &amp;remaining, &amp;event);
if (event) {
GstEventType type = GST_EVENT_TYPE (event);
/* or maybe custom event handling, up to you - we lose reference! */
gst_pad_event_default (demux->sinkpad, event);
if (type == GST_EVENT_EOS)
return FALSE;
} else {
GST_ELEMENT_ERROR (demux, STREAM, READ, (NULL), (NULL));
return FALSE;
}
}
*id = GUINT32_FROM_LE (((guint32 *) data)[0]);
*size = GUINT32_FROM_LE (((guint32 *) data)[0]);
return TRUE;
}
static void
gst_my_demuxer_loop (GstElement *element)
{
GstMyDemuxer *demux = GST_MY_DEMUXER (element);
guint32 id, size;
if (!gst_my_demuxer_peek (demux, &amp;id, &amp;size))
return;
switch (id) {
[.. normal chunk handling ..]
}
}
</programlisting>
<para>
Reason for this is that event handling is now centralized in one
place and the <function>_loop ()</function> function is a lot
cleaner and more readable. Those are common code practices, but
since the mistake of <emphasis>not</emphasis> using such common
code practices has been made too often, we explicitely mention
this here.
</para>
</sect1>
<sect1 id="section-oneton-seeking" xreflabel="Simple seeking and indexes">
<title>Simple seeking and indexes</title>
<para>
Sources will generally receive a seek event in the exact supported
format by the element. Demuxers, however, can not seek in
themselves directly, but need to convert from one unit (e.g.
time) to the other (e.g. bytes) and send a new event to its sink
pad. Given this, the <function>_convert ()</function>-function (or,
more general: unit conversion) is the most important function in a
demuxer. Some demuxers (AVI, Matroska) and parsers will keep an
index of all chunks in a stream, firstly to improve seeking
precision and secondly so they won't lose sync. Some other demuxers
will seek the stream directly without index (e.g. MPEG, Ogg) -
usually based on something like a cumulative bitrate - and then
find the closest next chunk from their new position. The best
choice depends on the format.
</para>
<para>
Note that it is recommended for demuxers to implement event,
conversion and query handling functions (using time units or so),
in addition to the ones (usually in byte units) provided by the
pipeline source element.
</para> </para>
</sect1> </sect1>
</chapter> </chapter>