Amazon Bedrock Adds Reinforcement Fine-Tuning Support

Amazon Web Services in December 2025 launched Reinforcement Fine-Tuning (RFT) on Amazon Bedrock for Nova models and expanded support in February 2026 to open-weight models including OpenAI GPT-OSS 20B and Qwen 3 32B. The post provides an end-to-end walkthrough — using GSM8K and gpt-oss-20B — showing OpenAI-compatible SDK integration, Lambda reward deployment, GRPO-based optimization, and CloudWatch monitoring. This enables enterprises to fine-tune LLMs via iterative reward signals without exporting data.
Key Points
- 1Introduces RFT on Bedrock for Nova and open-weight models (gpt-oss-20B, Qwen 3 32B).
- 2Enables iterative learning from multi-response reward signals using GRPO, reducing need for large labeled datasets.
- 3Allows deploying Lambda-based reward functions, monitoring with CloudWatch, and fine-tuning via OpenAI-compatible SDK.
Scoring Rationale
Official AWS release with practical, end-to-end guidance; significant enterprise impact but builds on established RFT techniques.
Sources
Public references used for this report.
Practice with real Retail & eCommerce data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Retail & eCommerce problems

