+62 813-8532-9115 info@scirepid.com

 

Atul Sharma

Belum punya Author-ID ?


Judul Sitasi Tahun
PPO-based Reinforcement Learning with Human Feedback with Hybrid Oversight and Predictive Reward Evaluation for AGI (Atul Sharma)
DOI : 10.62411/faith.3048-3719-276 - Volume: 2, Issue: 3, Sitasi : 21
24-Oct-2025 | Abstrak | PDF File | Resource | Last.29-Jan-2026
21 2025
Artikel Per 5.Tahun
Sitasi Per Tahun
Co Authors