#48
summarized by : Anonymous
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

どんな論文か?

improving the training speed of detr variants; identified that instability of bipartite graph matching contributes to slow convergence of detr; introducing denoising principle into detection models
placeholder

新規性

noise gt queries are added to the decoder to skip the bipartite matching (reconstruction loss imposed to recover the gt); attention mask is designed to avoid accessing the gt information

結果

SOTA performance using Resnet50 backbone among DETR-like methods; best performance at 12 epochs, 50 epochs

その他(なぜ通ったか?等)