Towards Testing and Evaluating Vision-Language-Action Models for Robotic Manipulation: An Empirical Study [2409.12894]