InViG: Benchmarking Interactive Visual Grounding with 500K Human-Robot Interactions [2310.12147]