Testing Larger Spatial Contexts
Background: The computational cost of Transformer architectures, particularly those incorporating spatial context, increases quadratically with larger input sizes due to the spatial transformer stage.
Question / Future Work: The potential benefits of spatial input sizes larger than the tested 30 x 30 pixel extent remain untested due to the high computational cost, and future work should address the trade-off between increased context and computational feasibility.
Metadata & Links
- created_at
- 2026-03-26T07:10:46Z