feat: single-image mode, 7 examples (discrimination, description, 3x localization, counting, OCR), bbox visualization 5195a24 verified jiang-cc commited on Apr 9