Several simple examples for neural network toolkits (PyTorch, TensorFlow, etc.) calling custom CUDA operators. We provide several ways to compile the CUDA kernels and their cpp wrappers, including jit ...