gstreamer

mirror of https://gitlab.freedesktop.org/gstreamer/gstreamer.git synced 2025-01-13 19:05:37 +00:00

Author	SHA1	Message	Date
Seungha Yang	b4efdeba11	nvdec: Don't hardcode DPB size Too many decode surface would waste GPU memory. Also it seems to be introducing additional latency depending on stream. Since nvcodec sdk version 9.0, CUVID parser API has been providing the minimum required number of surface. By using it, we can save GPU memory and reduce possible latency.	2020-04-09 16:30:58 +09:00
Seungha Yang	bd706edc52	nvh265enc: Update for video-hdr struct change See the change of -base https://gitlab.freedesktop.org/gstreamer/gst-plugins-base/-/merge_requests/594	2020-04-01 05:18:11 +00:00
Seungha Yang	22eab50907	nvdec: Add fallback for CUDA/OpenGL interop failure It happens when local OpenGL context belongs to non-nvidia GPU.	2020-03-19 13:58:09 +09:00
Nirbheek Chauhan	266dc41596	nvcodec: Mark class data as may-be-leaked to quiet the leaks tracer The class data with the caps in it will be leaked if the element is registered but never instantiated. There is no way around this. Mark the caps as such so that the leaks tracer does not warn about it. This is the same as pad template caps getting leaked, which are also marked as may-be-leaked. These objects are initialized exactly once, and are 'global' data.	2020-02-12 00:00:51 +05:30
Nirbheek Chauhan	3ca87d9988	nvcodec: Fix crash in decoder on 32-bit Windows Same fix as `1a7ea45ffd`, but I didn't test the decoder so I missed that the function pointers here weren't using the correct calling convention too.	2020-02-06 13:39:52 +00:00
Sebastian Dröge	5b8ff98f96	nvdec: Don't leak template caps when registering elements with old NVIDIA driver	2020-02-05 09:49:20 +00:00
Seungha Yang	3f4a84bd32	nvenc: Query maximum supported API version We've been using NvEncodeAPICreateInstance method to find the supported API version, but that seems to be insufficient since there is a case where plugin failed in opening encoding session even if NvEncodeAPICreateInstance succeeded. Asking driver about the version would be the most certain way.	2020-02-03 14:15:28 +00:00
Nicolas Dufresne	d393232bc2	nvdec: Do not map GStreamer discont to CUVid discont Setting the CUVID_PKT_DISCONTINUITY implies clearing any past information about the stream in the decoder. The GStreamer discont flag is used for discontinuity caused by a seek, for first buffer and if a buffer was dropped. In the first two cases, the parsers and demuxers should ensure we start from a synchronization point, so it's unlikely that delta will be matched against the wrong state. For packet lost, the discontinuity flag will prevent the decoder from doing any concealment, with a result that ca be much worst visually, or freeze the playback until an IDR is met. It's better to let the decoder handle that for us. Removing this flag, also workaround a but in NVidia parser that makes it ignore our ENDOFFRAME flag and increase the latency by one frame.	2020-01-25 13:39:03 +00:00
Nicolas Dufresne	a28ce16b3f	nvdec: Tell the parser we have complete pictures This sets the CUVID_PKT_ENDOFPICTURE flag in order to inform the decoder that we have a complete picture. This should remove one frame latency otherwise introduce by NVidia parser.	2020-01-25 13:39:03 +00:00
Seungha Yang	a10f26aa3a	nvenc: Do not access to broken encode session If an encode session failed in initializing, the encode session would be broken and the next nvenc API will cause crash. Fixes: https://gitlab.freedesktop.org/gstreamer/gst-plugins-bad/issues/1179	2020-01-21 16:34:41 +09:00
Nirbheek Chauhan	bda687344b	nvcodec: Print debug info when initializing nvenc We weren't printing the return value.	2020-01-20 17:15:55 +05:30
Nirbheek Chauhan	1a7ea45ffd	nvcodec: Fix crash on 32-bit Windows We weren't using the correct calling convention when calling CUDA and CUVID APIs. `CUDAAPI` is `__stdcall` on Windows. This was working fine on x64 because `__stdcall` is ignored and there's no special calling convention. However, on x86, we need to use `__stdcall`.	2020-01-20 17:15:55 +05:30
Nirbheek Chauhan	7e93ae0638	nvcodec: cuda.h only needs glib.h, not gst.h Just a nitpick. Also, force the compiler to use our stub header instead of searching for it in the include paths.	2020-01-20 15:10:51 +05:30
Seungha Yang	e8d527df93	nvenc: Query supported minimum resolution Hard-coded 16x16 resolution is likely to differ from the device's support in most cases. If we can use NV_ENC_CAPS_WIDTH_MIN and NV_ENC_CAPS_HEIGHT_MIN, update pad template with returned value.	2020-01-16 15:24:03 +00:00
Seungha Yang	58a663c1e5	nvcodec: Bump SDK header to version 9.1 Update header to query minimum resolution of encoder and to control the number of reference frame if it's supported	2020-01-16 15:24:03 +00:00
Seungha Yang	49bccf0433	nvcodec: Refactor plugin initialization Create CUDA context per device, instead of per codec and encoder/decoder. Allocating CUDA context is heavy operation so we should reuse it as much as possible. Fixes: https://gitlab.freedesktop.org/gstreamer/gst-plugins-bad/issues/1130	2019-12-24 08:10:14 +00:00
Seungha Yang	0cf67c3be7	nvenc: Fix crash when nvenc was reused then freed without encoding GstNvBaseEnc::n_bufs was set from the previous encoding session but it wasn't cleared after stop. That might result to invalid memory access at the next start (no encoded data) and then stop sequence. Instead of defining a variable for array length, use GArray::len directly to avoid such confusion.	2019-11-22 03:02:57 +00:00
Seungha Yang	aef414375a	nvenc: Remove unused code path refilling queue would not happen	2019-11-22 03:02:57 +00:00
Aaron Boxer	6d3429af34	documentation: fixed a heap o' typos	2019-11-05 09:11:25 -05:00
Tim-Philipp Müller	f218ec2794	Remove autotools build system	2019-10-14 13:54:27 +01:00
Seungha Yang	52dfbbe5da	nvenc: Early terminate handle_frame if the last flow was not GST_FLOW_OK If the last flow was not GST_FLOW_OK, the encoding thread is not running and there is nothing to pop from GAsyncQueue (this causes deadlock). To prevent deadlock, just return the handle_frame without further encoding process if the last flow was not GST_FLOW_OK. Note that the last flow will be cleared per FLUSH_STOP and STREAM_START event.	2019-09-11 15:21:03 +00:00
Seungha Yang	68a51abdcd	nvenc: Add support VUYA format The addition is very simple. Map NV_ENC_BUFFER_FORMAT_AYUV format to GST_VIDEO_FORMAT_VUYA and add a condition for the VUYA format.	2019-09-11 14:33:54 +00:00
Seungha Yang	14b9a1cffd	nvdec: Add support for mpeg4 video decoding with codec_data Decoder should handle codec_data of mpeg4 video which includes essential config data.	2019-09-11 14:00:48 +00:00
Seungha Yang	af77988b9f	nvenc: Reduce the number of pre-allocated device memory The hard-coded upper bound 32 (or 48 depending on resolution) might waste GPU memory and high resolution encoding causes OUT-OF-MEMORY allocation error quite easily. This commit calculates the number of required pre-allocated device memory based on encoding options and it can reduce the amount of device memory used by nvenc.	2019-09-11 11:44:03 +00:00
Seungha Yang	e3508a4f26	nvdec: Update plugin description and fix typo Use consistent description with nvenc, and fix typo s/devide/device/g	2019-09-11 15:16:45 +09:00
Seungha Yang	1cbb23cf79	nvenc: Adjust DTS when bframe is enabled NVDEC driver always uses input timestamp without adjustment even if bframe encoding was enabled. So DTS can be larger than PTS when bframe was enabled. To ensure PTS >= DTS, we should adjust the timestamp manually based on the PTS difference between the first encoded frame and the second one. That's also the maximum PTS/DTS difference.	2019-09-11 13:18:12 +09:00
Seungha Yang	83a1c7a9a6	nvenc: Add qp-{min,max,const}-{i,p,b} properties This new properties allows more detailed target QP value setting	2019-09-11 13:18:12 +09:00
Seungha Yang	d3a909ccdd	nvenc: Add properties to support bframe encoding if device supports it Note that bframe encoding capability varies with GPU architecture	2019-09-11 13:18:12 +09:00
Seungha Yang	94f2843774	nvenc: Refactoring internal buffer pool structure To support rc-lookahead and bframe encoding, nvenc needs one more staging queue, because NvEncEncodePicture can return NV_ENC_ERR_NEED_MORE_INPUT but which was not considered so far. As documented by NVENC programming guide, pending buffers should wait other inputs until NvEncEncodePicture returns success. New encoding flow is - Submit raw picture buffer to encoder with NvEncEncodePicture - The submitted input/output buffer pair will be queued to pending_queue - If NvEncEncodePicture returned success, then move all pair in pending_queue to final stage - Otherwise, wait more input raw pictures. Another change is dropping NV_ENC_LOCK_INPUT_BUFFER usage. So now nvenc always uses CUDA memory input buffer. As a result, both opengl and system memory handling are unified.	2019-09-11 13:18:12 +09:00
Seungha Yang	e73acbaa5c	nvenc: Remove pointless iteration and cleanup some code * The number of iteration is always one so the iteration is useless and that makes code complicated. * Also defining named structure can code mroe readable. * g_free is null safe	2019-09-11 13:18:12 +09:00
Seungha Yang	81272eaa82	nvenc: Add more rate-control options New rate-control modes are introduced (if device can support) * cbr-ld-hr: CBR low-delay high quality * cbr-hq: CBR high quality * vbr-hq: VBR high quality Also, various configurable rate-control related properties are added.	2019-09-11 13:18:12 +09:00
Seungha Yang	ea19a7c715	nvenc: Add support for weighted prediction option Note that this property will be exposed only if the device supports the weighted prediction.	2019-09-11 13:18:12 +09:00
Seungha Yang	d05cbdbd72	nvenc: Add property for AUD insertion Make AUD insertion configurable option	2019-09-11 13:18:12 +09:00
Seungha Yang	b3b723462e	nvenc: Refactor class hierarchy to handle device capability dependent options Introducing new dynamic class between GstNvBaseEncClass and each subclass to be able to access device specific properties and capabilities from each subclass implementation side.	2019-09-11 13:18:09 +09:00
Marc Leeman	3ef503346a	nvcodec: minor spell corrects in log messages	2019-09-10 23:13:17 +00:00
Seungha Yang	09fd34dbb0	nvenc: Add support runtime resolution change freely Do not restrict allowed maximum resolution depending on the initial resolution. If new resolution is larger than previous one, just re-init encode session.	2019-09-02 10:59:03 +09:00
Seungha Yang	fa83f086be	nvdec: Check flow return of the only current handle_frame() to fix seeking issue Due to uncleared last flow, decoding after seek was never possible (last_ret == GST_FLOW_FLUSHING). nvdec dose not need to keep track of the previous flow return, and actually the interest is data/even flow of the current handle_frame().	2019-08-30 11:27:27 +09:00
Seungha Yang	be3a3da829	nvdec: Fallback to system memory if OpenGL context could not support PBO memory If the environment could not support OpenGL PBO memory, nvdec will do negotiation with system memory as fallback.	2019-08-30 01:36:46 +09:00
Seungha Yang	069fe93452	nvdec: Add support dynamic output format change Implementing ::negotiate() method to support runtime output format change. If downstream was reconfigured, baseclass will invoke ::negotiate() method, and nvdec should update output memory type depending on downstream caps.	2019-08-30 01:36:46 +09:00
Seungha Yang	39f800c449	nvdec: Re-negotiate whenever output format is changed Input stream might be silently changed without ::set_format() call. Since nvdec has internal parser, nvdec element can figure out the format change by itself.	2019-08-30 01:36:41 +09:00
Seungha Yang	f4f8941a91	nvdec: Add support 4:4:4 and 4:2:0 12bit decoding Depending on GPU architecture, HEVC decoder can support 4:4:4 format up to 12 bitdepth. This commit covers VP9 4:2:0 12 bits decoding also.	2019-08-29 13:39:59 +00:00
Seungha Yang	ff9838fd3d	nvenc: Add support for old drivers which could not understand SDK version 9.0 Add helper functions to support old drivers with our previous SDK version 8.1	2019-08-29 13:39:59 +00:00
Seungha Yang	afebb15d99	nvenc: Use consistent snake case convention	2019-08-29 13:39:59 +00:00
Seungha Yang	1010b9f567	nvcodec: Bump SDK header to version 9.0 The latest Turing architecture (e.g., RTX serise) can support decoding HEVC 4:4:4 format up to 12bits.	2019-08-29 13:39:59 +00:00
Seungha Yang	338a32b672	nvenc: Port to GstCudaGraphicsResource Register openGL resource only once per memory. Also if upstream provides the registered information, reuse the information instead of doing it again. This can improve performance dramatically depending on system since the resource registration might cause high overhead.	2019-08-29 18:45:25 +09:00
Seungha Yang	d0846f8eab	nvdec: Port to GstCudaGraphicsResource Make it possible to share registered graphics resource among nvidia encoders and decoders.	2019-08-29 18:05:51 +09:00
Seungha Yang	da075b94a9	cudautils: Add GstCudaGraphicsResource structure for better openGL interoperability Introduce GstCudaGraphicsResource structure to represent registered CUDA graphics resources and to enable sharing the information among nvdec and nvenc. This structure can reduce the number of resource registration which cause high overhead.	2019-08-29 18:04:33 +09:00
Seungha Yang	8dc2b4a393	nvdec: Port to openGL PBO memory For openGL interoperability, nvdec uses cuGraphicsGLRegisterImage API which is to register openGL texture image. Meanwhile nvenc uses cuGraphicsGLRegisterBuffer API to registure openGL buffer object. That means two kinds of graphics resources are registered per memory when nvdec/nvenc are configured at the same time. The graphics resource registration brings possibly high overhead so the registration should be performed only once per resource from optimization point of view.	2019-08-29 18:04:33 +09:00
Seungha Yang	9bfd6d13e6	nvdec: Filter openGL API version to use To ensure PBO buffer, openGL API >= 3 is required.	2019-08-29 18:04:29 +09:00
Seungha Yang	807e311ae8	nvdec: Always response QUERY_CONTEXT even if openGL is unavailable on the system nvdec can response for the CUDA context type query regardless of openGL availability.	2019-08-21 14:14:07 +09:00

1 2

97 commits