EN
中文

Issue Explainers

Interactive visual explanations for community questions

#105
Clarification on how OCR annotations are used during training
How PaddleOCR tags become multi-label Partial FC training signals — with animated ArcFace visualization, KaTeX formulas, and full pipeline walkthrough.
👤 JerryPW 📅 2026-04-08 6 sections Interactive animations
#112
Clarification on Intra-frame (I-frame) interval in data preprocessing
Fixed GOP=16 is defined during HEVC encoding (Step 2). Step 3 only reads the existing bitstream and anchors the first sampled frame as I-frame. Includes animated GOP visualization, code references, and KaTeX formulas.
👤 RRooyyCChheenn 📅 2026-04-14 5 sections GOP animation
#113
Are all I-frame tokens intended to be preserved in the current implementation?
Analysis of paper vs. implementation mismatch. The DALI dataloader zeros out I-frame residuals, preventing them from being selected by Top-K. Includes code references and comparison with Compressed Video Reader's keep_first_full_frame option.
👤 Wenbo-Nie 📅 2026-04-14 7 sections Paper vs. Implementation
#116
Frame-wise Normalization & Global Top-K
Why we normalize per frame before selecting patches globally — with animated matrix demo, pipeline walkthrough, and code references.
👤 ThisisBillhe 📅 2026-04-16 8 sections Interactive demo