Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Q-Learning for Inventory Optimization
Live code walkthrough of Q‑Learning applied to inventory optimization, covering reward design, Q‑table mechanics, state transitions, and practical deployment considerations.
This hands-on technical session demonstrates the implementation and practical application of Q-Learning in solving real-world inventory management challenges. The presentation will feature:
- Live code walkthrough of a complete Q-Learning implementation for inventory optimization, examining the mathematical foundations and state-action-reward framework that drives the system
- Interactive demonstration showing how reinforcement learning outperforms traditional rule-based inventory systems across various scenarios (seasonal demand, supply chain disruptions, capacity constraints)
- Deep dive into the Q-table mechanics, demonstrating how the algorithm evaluates different states and actions, including visualization of the learning process over training epochs
- Step-by-step examination of the reward function design and its critical impact on model behavior, with live modification of parameters to demonstrate their effects
- Technical dissection of state transition functions, balancing exploration vs. exploitation, and hyperparameter optimization techniques specific to inventory applications
The session will demonstrate how to implement several advanced RL techniques within the inventory context:
- Temporal difference learning with multi-step lookahead
- Decay-based exploration strategies
- State discretization for continuous inventory spaces
- Lead time handling in RL decision systems
- Deployment considerations in production environments
I’ll conclude with a live demonstration of the model adapting to audience-suggested challenging scenarios, showing how it responds to unexpected demand patterns, capacity constraints, and supply chain disruptions.