Added comments

2024-01-30 17:06:29 +00:00 · 2024-01-30 17:06:29 +00:00 · bcff06f9c2
commit bcff06f9c2
parent 38a16e75fe
1 changed files with 17 additions and 0 deletions
--- a/model/reverse_perspective.py
+++ b/model/reverse_perspective.py
@ -26,6 +26,21 @@ class PerspectiveEstimator(nn.Module):
    Input: Pre-processed, uniformly-sized image data
    Output: Perspective factor

+    **Note**
+    --------
+    Loss input needs to be computed from beyond the **entire** rev-perspective
+    network. Needs to therefore compute:
+    - Effective pixel of each row after transformation.
+    - Feature density (count) along row, summed over column.
+
+    Loss is computed as a variance over row feature densities. Ref. paper 3.2.
+    After all, it is reasonable to say that you see more when you look at
+    faraway places.
+
+    This do imply that **we need to obtain a reasonably good feature extractor
+    from general images before training this submodule**. Hence, for now, we
+    prob. should work on transformer first.
+
    :param input_shape: (N, C, H, W)
    :param conv_kernel_shape: Oriented as (H, W)
    :param conv_dilation: equidistance dilation factor along H, W
@ -96,3 +111,5 @@ class PerspectiveEstimator(nn.Module):
        out = torch.exp(-out) + self.epsilon

        return out
+
+