Publications
Weakly Coupled Deep Q-Networks
Ibrahim El Shar, Daniel R. Jiang.
Advances in neural information processing systems 37 (NeurIPS 2023).
We propose Weakly Coupled Deep Q-Networks (WCDQN) a new deep RL algorithm for weakly coupled MDPs.
Lookahead-Bounded Q-Learning
Ibrahim El Shar, Daniel R. Jiang.
The International Conference on Machine Learning (ICML 2020).
We propose a new provably convergent variant of Q-learning that leverages upper and lower bounds derived using information relaxation techniques.
Multi-Objective Reinforcement Learning for Sustainable Supply Chain Optimization
Ibrahim El Shar, Haiyan Wang, Chetan Gupta.
IEEE International Conference on Automation Science and Engineering (CASE) 2023.
Deep Reinforcement Learning toward Robust Multi-echelon Supply Chain Inventory Optimization
Ibrahim El Shar, Wenhuan Sun, Haiyan Wang, Chetan Gupta.
IEEE International Conference on Automation Science and Engineering (CASE) 2022.
This paper is based on my work during my internship at Hitachi America, Ltd. R&D.
Spatial Dynamic Pricing for Shared-Resource Systems
Ibrahim El Shar, Daniel Jiang.
Work in Progress.
We study the problem of spatial dynamic Pricing for a fixed number of shared resources that circulate in a network. For the general network, we show that the optimal value function is concave and for a network composed of two locations, we show that the optimal policy enjoys certain monotonicity and bounded sensitivity properties. We use these results to propose a novel heuristic and a deep reinforcement learning (DRL) algorithm.
Continuous inventory control with stochastic and non-stationary Markovian demand
Walid W. Nasr, Ibrahim El Shar.
European Journal of Operational Research, 2018.
Projects
We work in an endless grid-world with the task of collecting valuable items and tools. Collecting rewards efficiently involves taking the shortest path to collect items and having a high level plan that guides the agent and prioritize where to go and what items to collect. We propose a novel greedy heuristic that we compare to Deep Q-networks (DQN), imitation learning and Deep Q-learning from Demonstrations (DQfD).
We propose a novel sentence-based text representation for language classification problems, which we evaluate using the IMDB Movie Review Dataset.
We study the problem of dynamic airline pricing when the customers can return their tickets for a posted price each period. The goal is to optimize the prices to set for flight tickets pt and the value of returned tickets prt at the beginning of each period t.
We study the case: A Prescription for Budget Woes at Gracious University Hospital. We develop a simulation model using Python 3.6 and SymPy 1.4 to come up with an automated dispensing cabinets (s, S) restocking policy that satisfies both patients and hospital personnel requirements.