keywords Conservative Pricing Demand Learning Dynamic Pricing Multi-Armed Bandit Upper Confidence Bound