gstreamer

mirror of https://gitlab.freedesktop.org/gstreamer/gstreamer.git synced 2024-12-17 05:46:36 +00:00

Author	SHA1	Message	Date
Seungha Yang	338a32b672	nvenc: Port to GstCudaGraphicsResource Register openGL resource only once per memory. Also if upstream provides the registered information, reuse the information instead of doing it again. This can improve performance dramatically depending on system since the resource registration might cause high overhead.	2019-08-29 18:45:25 +09:00
Seungha Yang	d0846f8eab	nvdec: Port to GstCudaGraphicsResource Make it possible to share registered graphics resource among nvidia encoders and decoders.	2019-08-29 18:05:51 +09:00
Seungha Yang	da075b94a9	cudautils: Add GstCudaGraphicsResource structure for better openGL interoperability Introduce GstCudaGraphicsResource structure to represent registered CUDA graphics resources and to enable sharing the information among nvdec and nvenc. This structure can reduce the number of resource registration which cause high overhead.	2019-08-29 18:04:33 +09:00
Seungha Yang	8dc2b4a393	nvdec: Port to openGL PBO memory For openGL interoperability, nvdec uses cuGraphicsGLRegisterImage API which is to register openGL texture image. Meanwhile nvenc uses cuGraphicsGLRegisterBuffer API to registure openGL buffer object. That means two kinds of graphics resources are registered per memory when nvdec/nvenc are configured at the same time. The graphics resource registration brings possibly high overhead so the registration should be performed only once per resource from optimization point of view.	2019-08-29 18:04:33 +09:00
Seungha Yang	9bfd6d13e6	nvdec: Filter openGL API version to use To ensure PBO buffer, openGL API >= 3 is required.	2019-08-29 18:04:29 +09:00
Seungha Yang	807e311ae8	nvdec: Always response QUERY_CONTEXT even if openGL is unavailable on the system nvdec can response for the CUDA context type query regardless of openGL availability.	2019-08-21 14:14:07 +09:00
Seungha Yang	4f60117db9	nvdec: Fix possible null object unref gst_query_get_n_allocation_pools > 0 does not guarantee that the N th internal array has GstBufferPool object. So users should check the returned GstBufferPool object from gst_query_parse_nth_allocation_pool.	2019-08-20 10:14:54 +09:00
Seungha Yang	eab564d857	nvcodec: Use default flag for CUDA stream creation Since nvdec/nvenc engine is running on default stream, non-default CUDA stream should be synchronized with default stream eventually.	2019-08-19 07:13:26 +00:00
Seungha Yang	ca6657367c	nvenc: Use non default CUDA stream and async operation Use CUDA async operation if possible with non default CUDA stream	2019-08-19 01:18:52 +00:00
Seungha Yang	5615e9258f	nvdec: Don't use default CUDA stream Async CUDA operation with default stream (NULL CUstream) is not much beneficial than blocking operation since all CUDA operations which belong to the CUDA context will be synchronized with the default stream's operation. Note that CUDA stream will share all resources of the corresponding CUDA context but which can help parallel operation similar to the relation between thread and process	2019-08-19 01:18:52 +00:00
Seungha Yang	20d8f54e63	nvdec: Push/Pop CUDA context around library API call	2019-08-19 01:18:52 +00:00
Seungha Yang	f7b2b1b99d	nvdec: Fix timestamp mismatch on draining frames The internal decoding state must be GST_NVDEC_STATE_PARSE before calling CuvidParseVideoData(). Otherwise, nvdec will be confused on decode callback as if the frame is decoding only frame and the input timestamp of corresponding frame will be ignored. Eventually one decoded frame will have non-increased PTS.	2019-08-18 15:52:32 +09:00
Seungha Yang	b64733972e	nvdec: Do not access nvdec object from destroy function of qdata The destroy callback can be called just before the fìnalization of GstMiniObject. So the nvdec object might be destroyed already. Instead, store the GstCudaContext with increased ref to safely unregister the CUDA resource.	2019-08-16 19:40:31 +09:00
Seungha Yang	e6d21d048a	nvenc: Add support YV12 format YV12 format is supported by Nvidia NVENC without manual conversion. So nvenc is exposing YV12 format at sinkpad template but there is some missing point around uploading the memory to GPU.	2019-08-09 11:43:22 +09:00
Seungha Yang	8dbaed0af7	nvh265enc: Enable HDR related SEI nal insertion If upstream provides the HDR related information, create SEI message nals and pass them to NVENC.	2019-08-08 23:18:14 +09:00
Seungha Yang	f3e12a0b56	nvh265enc: Add support YUV 444 10bits encoding Note that h264 encoder does not support the YUV 444 10bits format	2019-08-08 00:46:16 +09:00
Seungha Yang	fa5e6f546b	nvenc: Remove unnecessary constraint from YUV420 10bits capability decision YUV444 capability shouldn't be applied to YUV420 10 bits format	2019-08-08 00:46:12 +09:00
Seungha Yang	cc4d0e91e3	nvenc: Fix broken RGB format support Add missing format check introduced by the commit `7de4dbdeb2`	2019-08-07 07:27:36 +00:00
Seungha Yang	9d0545d1a2	nvcodec: Wrap CUDA API return check with gst_cuda_result The gst_cuda_result macro function is more helpful for debugging than previous cuda_OK because gst_cuda_result prints the function and line number. If the CUDA API return was not CUDA_SUCCESS, gst_cuda_result will print WARNING level debug message with error name, error text strings.	2019-08-07 00:59:36 +00:00
Seungha Yang	d69b590683	nvdec: Port to GstCUDAContext ... and drop CUvideoctxlock usage. The CUvideoctxlock basically has the identical role of cuda context push/pop but nvdec specific way. Since we can share the CUDA context among encoders and decoders, use CUDA context directly for accessing GPU API.	2019-08-07 00:59:36 +00:00
Seungha Yang	5cf0351418	nvenc: Port to GstCudaContext ... and add support CUDA context sharing similar to glcontext sharing. Multiple CUDA context per GPU is not the best practice. The context sharing method is very similar to that of glcontext. The difference is that there can be multiple context object on a pipeline since the CUDA context is created per GPU id. For example, a pipeline has nvh264dec (uses GPU #0) and nvh264device0dec (uses GPU #1), then two CUDA context will propagated to all pipeline.	2019-08-07 00:59:36 +00:00
Seungha Yang	094e4a9f5c	nvcodec: Introduce NVIDA CUDA helpers New object and helper functions can remove duplicated code from nvenc/nvdec. Also this is prework for CUDA device context sharing among nvdec(s)/nvenc(s).	2019-08-07 00:59:36 +00:00
Seungha Yang	7de4dbdeb2	nvenc: Return profile compatible input formats from GstVideoEncoder::getcaps Do not accept any input formats which could not be supported by downstream requested codec profiles.	2019-08-06 15:03:22 +00:00
Seungha Yang	9e81f8e700	nvenc: Fix caps negotiation failure on unspecified interlace-mode During GstVideoInfo conversion from GstCaps, interlace-mode is inferred to progressive so unspecified interlace-mode should not cause any negotiation issue. Simly set GST_PAD_FLAG_ACCEPT_INTERSECT flag on sinkpad to fix issue.	2019-08-06 15:03:22 +00:00
Seungha Yang	b43d0f785c	nvenc: Remove unused member variables Supported interlace-mode and codec profiles are checked during plugin init and those values are never used.	2019-08-06 15:03:22 +00:00
Seungha Yang	f7f9f327cd	nvdec: Respect upstream provided timestamp Decoder sometimes reports nonincreasing timestamp. Use input frame's timestamp like other decoder elements.	2019-08-05 20:32:39 +00:00
Seungha Yang	e68bfd7566	nvenc: Add support RGB 8/10bits formats BGRA/RGBA/RGB10A2/BGR10A2 formats can be supported by nvenc. Depending on device, supported format can be different. Fixes: https://gitlab.freedesktop.org/gstreamer/gst-plugins-bad/issues/1038	2019-08-05 18:55:28 +00:00
Seungha Yang	c99b160b50	nvdec: Use upstream framerate if possible Encoded bitstream might not have valid framerate. If upstream provided non-variable-framerate (i.e., fps_n > 0 and fps_d > 0) use upstream framerate instead of parsed one.	2019-08-05 15:32:43 +00:00
Seungha Yang	158b4d8649	nvenc: Fix crash with unspecified framerate Nvidia driver seems to calculating floating point framerate without validation. This causes crash both on linux and Windows. Fixes: https://gitlab.freedesktop.org/gstreamer/gst-plugins-bad/issues/1012	2019-08-05 15:32:43 +00:00
Seungha Yang	2a76807c9a	configure: Update for nvcodec dependency change nvcodec is compilable without external dependency	2019-07-31 15:36:04 +00:00
Seungha Yang	f1cbab7cfd	nvdec: Fix build warning error gstnvdec.c:1222:3: error: implicit declaration of function ‘memset’ [-Werror=implicit-function-declaration] memset (&type_info, 0, sizeof (type_info)); ^~~~~~	2019-07-31 15:36:04 +00:00
Seungha Yang	4fa5a82762	nvenc: Fix build error with x86 msvc __stdcall is accepted or ignored by the compiler on x64 but x86 is not the case. So the function definition should be consistent with declaration. Fixes: https://gitlab.freedesktop.org/gstreamer/gst-plugins-bad/issues/1039	2019-07-30 19:12:46 +09:00
Seungha Yang	0445ed6ba5	nvenc: Fix deadlock when pad_push return was not GST_FLOW_OK Encoding thread is terminated without any notification so upstream streaming thread is locked because there is nothing to pop from GAsyncQueue. If downstream returns error, we need put SHUTDOWN_COOKIE to GAsyncQueue for chain function can wakeup.	2019-07-30 17:49:25 +09:00
Seungha Yang	3faf439347	nvcodec: Fix broken ABI in cuda stub header to fix nvenc with opengl Fix the broken ABI introduced by the commit `367e742e5d` From CUDA Toolkit 3.2, size_t has been used in CUDA_MEMCPY2D structure instead of unsigned int.	2019-07-30 11:13:18 +09:00
Seungha Yang	694f91da88	nvdec: Make OpenGL dependency optional By adding system memory support for nvdec, both en/decoder in the nvcodec plugin are able to be usable regardless of OpenGL dependency. Besides, the direct use of system memory might have less overhead than OpenGL memory depending on use cases. (e.g., transcoding using S/W encoder)	2019-07-26 00:01:23 +00:00
Seungha Yang	733c109ce9	nvcodec: Clean up pointless return values around plugin init Any plugin which returned FALSE from plugin_init will be blacklisted so the plugin will be unusable even if an user install required runtime dependency next time. So that's the reason why nvcodec returns TRUE always. This commit is to remove possible misreading code.	2019-07-25 08:47:50 +00:00
Seungha Yang	7b9045d846	nvcodec: Change log level for g_module_open failure Since we build nvcodec plugin without external CUDA dependency, CUDA and en/decoder library loading failure can be natural behavior. Emit error only when the module was opend but required symbols are missing.	2019-07-25 08:47:50 +00:00
Seungha Yang	e5a98cf9d8	nvdec: Add support for 10bits 4:2:0 decoding This commit includes h265 main-10 profile support if the device can decode it. Note that since h264 10bits decoding is not supported by nvidia GPU for now, the additional code path for h264 high-10 profile is a preparation for the future Nvidia's enhancement.	2019-07-25 08:06:26 +00:00
Seungha Yang	d692350fc3	nvdec: Specify supported profiles of h264/h265 codec See more details about supported formats at nvidia codec sdk document "NVDEC_VideoDecoder_API_ProgGuide.pdf" Table 1. Hardware Video Decoder Capabilities. Fixes: https://gitlab.freedesktop.org/gstreamer/gst-plugins-bad/issues/926	2019-07-25 08:06:26 +00:00
Seungha Yang	c8640e23f4	nvdec: Skip draining before creating internal parser GstVideoDecoder::drain/flush can be called at very initial state with stream-start and flush-stop event, respectively. Draning with NULL CUvideoparser seems to unsafe and that eventually failed to handle it.	2019-07-25 07:11:04 +00:00
Seungha Yang	367e742e5d	nvcodec: Drop system installed cuda.h dependency ... and add our stub cuda header. Newly introduced stub cuda.h file is defining minimal types in order to build nvcodec plugin without system installed CUDA toolkit dependency. This will make cross-compile possible.	2019-07-23 16:32:31 +09:00
Seungha Yang	a2ada54265	nvcodec: Keep requested rank for default device Fix for default encoder and decoder element factory to make them have higher rank than the others.	2019-07-23 10:28:52 +09:00
Seungha Yang	92afa74939	nvenc: Register elements per GPU device with capability check * By this commit, if there are more than one device, nvenc element factory will be created per device like nvh264device{device-id}enc and nvh265device{device-id}enc in addition to nvh264enc and nvh265enc, so that the element factory can expose the exact capability of the device for the codec. * Each element factory will have fixed cuda-device-id which is determined during plugin initialization depending on the capability of corresponding device. (e.g., when only the second device can encode h265 among two GPU, then nvh265enc will choose "1" (zero-based numbering) as it's target cuda-device-id. As we have element factory per GPU device, "cuda-device-id" property is changed to read-only. * nvh265enc gains ability to encoding 4:4:4 8bits, 4:2:0 10 bits formats and up to 8K resolution depending on device capability. Additionally, I420 GLMemory input is supported by nvenc.	2019-07-22 21:01:41 +00:00
Seungha Yang	0239152bca	nvdec: Create CUDA context with registered device id Only the default device has been used by NVDEC so far. This commit make it possible to use registered device id. To simplify device id selection, GstNvDecCudaContext usage is removed.	2019-07-22 17:39:45 +00:00
Seungha Yang	1df2f13d0c	nvdec: Register elements per device/codec with capability check By this commit, each codec has its own element factory so the nvdec element factory is removed. Also, if there are more than one device, additional nvdec element factory will be created per device like nvh264device{device-id}dec, so that the element factory can expose the exact capability of the device for the codec.	2019-07-22 17:39:45 +00:00
Seungha Yang	afe3c7e3ef	nvcodec: Drop cudaGL.h dependency nvcodec does not use any type/define/enum in cudaGL.h.	2019-07-22 23:11:14 +09:00
Seungha Yang	48a6641717	nvdec: Fix video stuttering issue with VP9 Address nvidia driver specific behavior to avoid unexpected frame mismatch between GStreamer and NVDEC.	2019-07-19 18:44:32 +09:00
Seungha Yang	8018fa2526	nvdec: Drop async queue and handle data on callback of CUvideoparser Callbacks of CUvideoparser is called on the streaming thread. So the use of async queue has no benefit. Make control flow straightforward instead of long while/switch loop.	2019-07-19 18:44:32 +09:00
Seungha Yang	8753561015	nvdec: Port to color_{primaries,transfer,matrix}_to_iso ... and update the color information only when upstream was not provided the information.	2019-07-17 06:34:21 +00:00
Seungha Yang	e01c68524f	nvenc: Specify colorimetry related VUI parameters Set the colorimetry config for the information to be embedded in encodec bitstream.	2019-07-17 14:45:05 +09:00
Seungha Yang	8862abd7c6	nvdec: Fix possible frame drop on EOS On eos, baseclass videoencoder call finish() vfunc instead of drain()	2019-07-09 20:52:23 +09:00
Marc Leeman	489ff8604f	nvcodec: do a generic cuda tests before going into version specifics	2019-07-08 10:37:46 +00:00
Seungha Yang	c18fda03d9	nvdec,nvenc: Port to dynamic library loading ... and put them into new nvcodec plugin. * nvcodec plugin Now each nvenc and nvdec element is moved to be a part of nvcodec plugin for better interoperability. Additionally, cuda runtime API header dependencies (i.e., cuda_runtime_api.h and cuda_gl_interop.h) are removed. Note that cuda runtime APIs have prefix "cuda". Since 1.16 release with Windows support, only "cuda.h" and "cudaGL.h" dependent symbols have been used except for some defined types. However, those types could be replaced with other types which were defined by "cuda.h". * dynamic library loading CUDA library will be opened with g_module_open() instead of build-time linking. On Windows, nvcuda.dll is installed to system path by CUDA Toolkit installer, and on *nix, user should ensure that libcuda.so.1 can be loadable (i.e., via LD_LIBRARY_PATH or default dlopen path) Therefore, NVIDIA_VIDEO_CODEC_SDK_PATH env build time dependency for Windows is removed.	2019-07-08 10:37:46 +00:00

1 2 3 4

153 commits