DRL Outperforms MVO For Portfolio Optimization

A Feb 19, 2026 arXiv preprint by Srijan Sood compares model-free deep reinforcement learning (DRL) to Mean-Variance Optimization (MVO) for portfolio allocation. The authors describe practical adjustments for both methods and report backtests where the DRL agent outperforms MVO on Sharpe ratio, maximum drawdown, and absolute returns. Results suggest DRL can be a viable alternative for quantitatively driven portfolio management.
Scoring Rationale
Demonstrates meaningful DRL gains in thorough backtests, but remains a single arXiv preprint lacking peer review.
Practice with real FinTech & Trading data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all FinTech & Trading problems


