CellCentroidFormer: Combining Self-attention and Convolution for Cell Detection [2206.00338]