Transport-Oriented Feature Aggregation for Speaker Embedding Learning [2206.12857]