Thanks Smorodov for your reply.
I have video streams and want to process those on GPU. My focus is more on GPU module of OpenCV. I create a separate thread [using pthreads] for processing each file. I can process two files and there is no crash. My program crashes in OpenCV module of 3rd/4th thread when running more than two threads in the program. I provided error details here
My focus is to run as many parallel threads as possible using pthreads or any other option. I mentioned CUDA streams as that was one of the option to run multiple kernels on GPU.