The paper reports that "having an RoI pooling layer that is differentiable w.r.t the box coordinates is a nontrivial problem" and refers to "ROI Warping" (crops
I wish to create a custom pooling layer which can efficiently work on GPUs. For instance, I have following input tensor in = <tf.Tensor: shape=(4, 5), dtype=