
Chanwoo Park
@chanwoopark20
Games, Multi-agent (gen) AI | @speedrun SR003 | @mit EECS Ph.D. Candidate
ID: 1457347791723069440
https://chanwoo-park-official.github.io/ 07-11-2021 14:04:11
702 Tweet
1,1K Followers
1,1K Following











That is the reason you need an evolving reward function. huggingface.co/papers/2504.20… Check out this paper. -- providing some answers about "curriculum learning / evolving reward / reward hacking" using evaluative thinking. Zae Myung Kim



