SpatialJB Exploits Transformer Spatial Weakness To Bypass Guardrails

Authors Zhiyi Mou et al. submitted on Jan. 14, 2026 a paper introducing SpatialJB, a jailbreak method that redistributes tokens across rows, columns, and diagonals to exploit Transformer spatial weaknesses and disrupt LLM output generation. Experiments on leading models report nearly 100% attack success rates and sustain over 75% success against advanced filters including the OpenAI Moderation API; authors also evaluate baseline defenses and release code and a demo.
Scoring Rationale
High novelty, broad industry impact, and reproducible experiments with code; preprint status and limited external validation reduce verification certainty.
Practice with real Ad Tech data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Ad Tech problems

