RGB Image

RGB images, representing color information in red, green, and blue channels, are fundamental to computer vision, serving as input for a wide range of tasks. Current research focuses on leveraging RGB data for diverse applications, including 3D object reconstruction (often employing transformer networks and Gaussian splatting), human pose estimation (using graph convolutional networks and privileged information), and robotic manipulation (through Sim2Real transfer and foundation models). These advancements significantly impact fields like robotics, medical imaging, and remote sensing by enabling more robust and efficient solutions for tasks ranging from automated object grasping to flood detection and surgical skill assessment.

Papers