Timestamp Supervision

Timestamp supervision is a machine learning technique that uses sparsely annotated timestamps instead of full frame-level labels to train models for tasks like action segmentation and automatic speech recognition (ASR). Current research focuses on developing robust algorithms, such as expectation-maximization methods and graph convolutional networks, to effectively utilize these limited annotations and mitigate the impact of missing or inaccurate timestamps. This approach significantly reduces annotation costs while achieving performance comparable to or even exceeding fully supervised methods in various applications, making it a valuable tool for efficient model training in time-series data analysis.

Papers