GPU Parallelization for Non-Deterministic Model

Hi, I'm looking forward to implement a membrane computing model which is a non-deterministic model supporting maximal parallelism (in a sense that every possible 'rule' which can be applied, will be applied). I am new to GPU parallelization which is what I have in mind (also overall new to C++ parallelization techniques/libraries) and would like to hear some opinions on what options I am given to accomplish this task in C++.

Thanks