QGFN: Controllable Greediness with Action Values [2402.05234]