Learning from negative feedback, or positive feedback or both [2410.04166]