I am a Senior Data Scientist at Walmart, based in California. I lead the design and deployment of reinforcement learning systems for real-world optimization problems at scale — including dynamic pricing for crowd-sourced delivery and task assignment for in-store operations. My work draws from both operations research and machine learning, with a focus on sequential decision-making, multi-agent coordination, and robust learning in high-stakes environments. Previously, I was a Machine Learning Scientist at Hitachi America, where I led initiatives involving reinforcement learning and Generative AI and Large Language Models (LLMs) for supply chain resilience. I hold a Ph.D. in Operations Research from the University of Pittsburgh and a master's degree in Financial Engineering and Operations Research from the American University of Beirut.
Publications in peer-reviewed conferences and journals. My research focuses on reinforcement learning, operations research, and supply chain optimization.
2024
Haiyan Wang, Atsuki Kiuchi, Hsiu-Khuern Tang, Chetan Gupta, EL-SHAR Ibrahim, SUN Wenhuan
US Patent App. 18/105,593
A novel system that combines bound-enhanced reinforcement learning techniques for optimizing distribution supply chain management, using upper and lower bounds of optimal inventory costs with epsilon-greedy exploration in multi-echelon supply chains.
2023
Ibrahim El Shar, Daniel R. Jiang
Advances in Neural Information Processing Systems 37 (NeurIPS 2023)
We propose Weakly Coupled Deep Q-Networks (WCDQN), a new deep RL algorithm for weakly coupled MDPs, where the state and action spaces can be decomposed into nearly independent components except for a linking constraint on the action space.
2023
Ibrahim El Shar, Haiyan Wang, Chetan Gupta
IEEE International Conference on Automation Science and Engineering (CASE) 2023
An approach to balance economic, environmental, and social objectives in supply chain optimization using multi-objective reinforcement learning.
2022
Ibrahim El Shar, Wenhuan Sun, Haiyan Wang, Chetan Gupta
IEEE International Conference on Automation Science and Engineering (CASE) 2022
This paper is based on my work during my internship at Hitachi America, Ltd. R&D, focusing on inventory optimization in complex supply chains.
2022
Ibrahim El Shar, Daniel Jiang
Work in Progress
We study the problem of spatial dynamic pricing for shared resources that circulate in a network. For the general network, we show that the optimal value function is concave and for a network composed of two locations, we show that the optimal policy enjoys certain monotonicity and bounded sensitivity properties. These results inform our novel heuristic and deep reinforcement learning algorithm.
2020
2018
Walid W. Nasr, Ibrahim El Shar
European Journal of Operational Research, 2018
This paper addresses inventory control optimization under complex demand patterns with Markovian dynamics and non-stationarity.
2019-2020
Developed reinforcement learning and imitation learning methods for an endless grid-world environment with complex reward collection tasks.
2018-2019
Developed a novel sentence-based text representation for language classification problems, evaluated using the IMDB Movie Review Dataset.
2018
Studied the problem of dynamic airline pricing with ticket return considerations, optimizing both initial ticket pricing and return value policies.
2018
Analyzed budget optimization for hospital inventory management, developing a simulation model for automated dispensing cabinets restocking policy.