§20.8.42
Visual Question Answering(VQAv2 / GQA / OK-VQA)评估?
- §20.8Crowd Counting:density map 回归(MCNN / CSRNet / P2PNet / CLIP-EBC)?→
- §20.8细粒度分类(FGVC):Bilinear CNN / 注意力 / Part-based / CLIP-FGVC?→
- §20.8图像美学评估(AVA、TAD66K)与 NIMA / VILA?→
- §20.8Image Captioning(Show-Attend-Tell → BLIP-2 → InternVL)?→
- §20.8AIGC 检测与水印(DIRE、AIDE、Stable Signature、Tree-Ring、Gaussian Shading)?→
- §20.1相机标定 Zhang 氏方法(棋盘格)的内参/畸变求解流程?→