gstreamer/subprojects/gst-docs/markdown/additional/design/rtp.md

12 KiB

RTP

These design docs detail some of the lower-level mechanism of certain parts of GStreamer's RTP stack. For a higher-level overview see the RTP and RTSP support section.

RTP auxiliary stream design

Auxiliary elements

There are two kind of auxiliary elements, sender and receiver. Let's call them rtpauxsend and rtpauxreceive.

rtpauxsend has always one sink pad and can have unlimited requested src pads. If only src pad then it works in SSRC-multiplexed mode, if several src pads then it works in session multiplexed mode.

rtpauxreceive has always one ssrc pad and can have unlimited requested sink pads. If only one sink pad then it works in SSRC-multiplexed mode, if several sink pads then it works in session multiplexed mode.

Rtpbin and auxiliary elements

Basic mechanism

rtpbin knows for which session ids the given auxiliary element belong to. It's done through "set-aux-send", for rtpauxsend kind, and through "set-aux-receive" for rtpauxreceive kind. You can call those signals as much as needed for each auxiliary element. So for aux elements that work in SSRC-multiplexed mode this signal action is called only one time.

The user has to call those action signals before to request the differents rtpbin pads. rtpbin is in charge to link those auxiliary elements with the sessions, and on receiver side, rtpbin has also to handle the link with ssrcdemux.

rtpbin never knows if the given rtpauxsend is actually a rtprtxsend element or another aux element. rtpbin never knows if the given rtpauxreceive is actually a rtprtxreceive element or another aux element. rtpbin has to be kept generic so that more aux elements can be added later without changing rtpbin.

It's currently not possible to use rtpbin with auxiliary stream from gst-launch. We can discuss about having the ability for rtpbin to instanciate itself the special aux elements rtprtxsend and rtprtxreceive but they need to be configured ("payload-type" and "payload-types" properties) to make retransmission work. So having several rtprtxsend and rtprtxreceive in a rtpbin would require a lot of properties to manage them form rtpbin. And for each auxiliary elements.

If you want to use rtprtxreceive and rtprtpsend from gst-launch you have to use rtpsession, ssrcdemux and rtpjitterbuffer elements yourself. See gtk-doc of rtprtxreceive for an example.

Requesting the rtpbin's pads on the pipeline receiver side

If rtpauxreceive is set for session, i, j, k then it has to call rtpbin::"set-aux-receive" 3 times giving those ids and this aux element. It has to be done before requesting the recv_rtp_sink_i, recv_rtp_sink_j, recv_rtp_sink_k. For a concrete case rtprtxreceive, if the user wants it for session i, then it has to call rtpbin::"set-aux-receive" one time giving i and this aux element. Then the user can request recv_rtp_sink_i pad.

Calling rtpbin::"set-aux-receive" does not create the session. It add the given session id and aux element to a hashtable(key:session id, value: aux element). Then when the user ask for rtpbin.recv_rtp_sink_i, rtpbin lookup if there is an aux element for this i session id. If yes it requests a sink pad to this aux element and links it with the recv_rtp_src pad of the new gstrtpsession. rtpbin also checks that this aux element is connected only one time to ssrcdemux. Because rtpauxreceive has only one source pad. Each call to request rtpbin.recv_rtp_sink_k will also creates rtpbin.recv_rtp_src_k_ssrc_pt as usual. So that the user have it when then it requests rtpbin. (from gst-launch) or using on_rtpbinreceive_pad_added callback from an application.

Requesting the rtpbin's pads on the pipeline sender side

For the sender this is similar but a bit more complicated to implement. When the user asks for rtpbin.send_rtp_sink_i, rtpbin will lookup in its second map (key:session id, value: aux send element). If there is one aux element, then it will set the sink pad of this aux sender element to be the ghost pad rtpbin.send_rtp_sink_i that the user asked. rtpbin will also request a src pad of this aux element to connect it to gstrtpsession_i. It will automatically create rtpbin.send_rtp_src_i the usuall way. Then if the user asks rtpbin.send_rtp_src_k, then rtpbin will also lookup in that map and request another source pad of the aux element and connect it to the new gstrtpsession_k.

RTP collision design

GstRTPCollision

Custon upstream event which contains the ssrc marked as collided.

This event is generated on both pipeline sender and receiver side by the gstrtpsession element when it detects a conflict between ssrc. (same session id and same ssrc)

It's an upstream event so that means this event is for now only useful on pipeline sender side. Because elements generating packets with the collided SSRC are placed upstream from the gstrtpsession.

rtppayloader

When handling a GstRTPCollision event, the rtppayloader has to choose another ssrc.

BYE only the corresponding source, not the whole session.

When a collision happens for the given ssrc, the associated source is marked bye. But we make sure that the whole session is not itself set bye. Because internally, gstrtpsession can manages several sources and all have their own distinct ssrc.

RTP retransmission design

GstRTPRetransmissionRequest

Custom upstream event which mainly contains the ssrc and the seqnum of the packet which is asked to be retransmitted.

On the pipeline receiver side this event is generated by the gstrtpjitterbuffer element. Then it is translated to a NACK to be sent over the network.

On the pipeline sender side, this event is generated by the gstrtpsession element when it receives a NACK from the network.

rtprtxsend element

Basic mechanism

rtprtxsend keeps a history of rtp packets that it has already sent. When it receives the event GstRTPRetransmissionRequest from the downstream gstrtpsession element, it looks up the requested seqnum in its stored packets. If the packet is present in its history, it will create an RTX packet according to RFC 4588. Then this rtx packet is pushed to its src pad like other packets.

rtprtxsend works in SSRC-multiplexed mode, so it always has one sink and src pad.

Building retransmission packet from original packet

An rtx packet is mostly the same as an orignal packet, except it has its own ssrc and its own seqnum. That's why rtprtxsend works in SSRC-multiplexed mode. It also means that the same session is used. Another difference between an rtx packet and its original is that it inserts the original seqnum (OSN: 2 bytes) at the beginning of the payload. Also rtprtxsend builds rtx packet without padding, to let other elements do that. The last difference is the payload type. For now the user has to set it through the rtx-payload-type property. Later it will automatically retreive this information from SDP. See fmtp field as specified in RFC 4588 (a=fmtp:99 apt=98): fmtp is the payload type of the retransmission stream and apt the payload type of its associated master stream.

Retransmission ssrc and seqnum

To choose rtx_ssrc it randomly selects a number between 0 and 2^32-1 until it is different from master_ssrc. rtx_seqnum is randomly selected between 0 and 2^16-1.

Deeper in the stored buffer history

For the history it uses a GSequence with 2^15-1 as its maximum size. Which is resonable as the default value is 100. It contains the packets in reverse order they have been sent (head:newest, tail:oldest). GSequence allows to add and remove an element in constant time (like a queue). Also GSequence allows to do a binary search when rtprtxsend does a lookup in its history. It's important if it receives a lot of requests or if the history is large.

Pending rtx packets

When looking up in its history, if seqnum is found then it pushes the buffer into a GQueue to its tail. Before sending the current master stream packet, rtprtxsend sends all the buffers which are in this GQueue, taking care of converting them to rtx packets. This way, rtx packets are sent in the same order they have been requested. (g_list_foreach traverses the queue from head to tail) The GQueue is cleared between sending 2 master stream packets. So when this GQueue contains more than one element, it means that rtprtxsend had received more than one rtx request between sending 2 master packets.

Collision

When handling a GstRTPCollision event, if the ssrc is its rtx ssrc then rtprtxsend clears its history and its pending retransmission queue. Then it chooses a rtx_ssrc until it's different than master ssrc. If the GstRTPCollision event does not contain its rtx ssrc, for example its master ssrc or other, then it just forwards the event upstream, so that it can be handled by the rtppayloader.

Rtprtxreceive element

Basic mechanism

The same rtprtxreceive instance can receive several master streams and several retransmission streams. So it will try to dynamically associate an rtx ssrc with its master ssrc, so that it can reconstruct the original from the proper rtx packet.

The algorithm is based on the fact that seqnums of different streams (considering all master and all rtx streams) evolve at a different rate. It means that the initial seqnum is random for each one and the offset could also be different. So that they are statistically all different at a given time. If bad luck then the association is delayed to the next rtx request.

The algorithm also needs to know if a given packet is an rtx packet or not. To know this information there is the rtx-payload-types property. For now the user has to configure it but later it will automatically retreive this information from SDP. It needs to know if the current packet is rtx or not in order to know if it can extract the OSN from the payload. Otherwise it would extract the OSN even on master streams which means nothing and so it could do bad things. In theory maybe it could work but we have this information in SDP so why not use it to avoid bad associations.

Note that it also means that several master streams can have the same payload type. And also several rtx streams can have the same payload type. So the information from SDP which gives us which rtx payload type belongs to a given master payload type is not enough to do the association between rtx ssrc and master ssrc.

rtprtxreceive works in SSRC-multiplexed mode, so it always has one sink and src pad.

Deeper in the association algorithm

When it receives a GstRTPRetransmissionRequest event it will remember the ssrc and the seqnum from this request.

On incoming packets, if the packet has its ssrc already associated then it knows if the ssrc is an rtx ssrc or a master stream ssrc. If this is a rtx packet then it recontructs the original and pushes the result to the src pad as if it was a master packet.

If the ssrc is not yet associated rtprtxreceive checks the payload type. if the packet has its payload type marked as rtx then it will extract the OSN (original seqnum number) and lookup in its stored requests if a seqnum matches. If found, then it associates the current ssrc to the master ssrc marked in the request. If not found it just drops the packet. Then it removes the request from the stored requests.

If there are 2 requests with the same seqnum and different ssrc, then the couple seqnum,ssrc is removed from the stored requests. A stored request actually means that actually the couple seqnum,ssrc is stored. If it happens the request is dropped but it avoids to do bad associations. In this case the association is just delayed to the next request.

Building original packet from rtx packet

Header, extensions, payload and padding are mostly the same. Except that the OSN is removed from the payload. Then ssrc, seqnum, and original payload type are correctly set. Original payload type is actually also stored when the rtx request is handled.