#105
Clarification on how OCR annotations are used during training
How PaddleOCR tags become multi-label Partial FC training signals — with animated ArcFace visualization, KaTeX formulas, and full pipeline walkthrough.
👤 JerryPW
📅 2026-04-08
6 sections
Interactive animations
→
#112
Clarification on Intra-frame (I-frame) interval in data preprocessing
Fixed GOP=16 is defined during HEVC encoding (Step 2). Step 3 only reads the existing bitstream and anchors the first sampled frame as I-frame. Includes animated GOP visualization, code references, and KaTeX formulas.
👤 RRooyyCChheenn
📅 2026-04-14
5 sections
GOP animation
→
#113
Are all I-frame tokens intended to be preserved in the current implementation?
Analysis of paper vs. implementation mismatch. The DALI dataloader zeros out I-frame residuals, preventing them from being selected by Top-K. Includes code references and comparison with Compressed Video Reader's keep_first_full_frame option.
👤 Wenbo-Nie
📅 2026-04-14
7 sections
Paper vs. Implementation
→
#116
Frame-wise Normalization & Global Top-K
Why we normalize per frame before selecting patches globally — with animated matrix demo, pipeline walkthrough, and code references.
👤 ThisisBillhe
📅 2026-04-16
8 sections
Interactive demo
→