Adam Gleave | PhD Candidate @ UC Berkeley
Adam Gleave | PhD Candidate @ UC Berkeley
Home
Publications
Opinions
Contact
CV
Light
Dark
Automatic
3
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Inverse Reinforcement Learning (IRL) algorithms infer a reward function that explains demonstrations provided by an expert acting in …
Adam Gleave
,
Sam Toyer
PDF
Cite
Uncertainty Estimation for Language Reward Models
Language models can learn a range of capabilities from unsupervised training on text corpora. However, to solve a particular problem …
Adam Gleave
,
Geoffrey Irving
PDF
Cite
Invariance in Policy Optimisation and Partial Identifiability in Reward Learning
It’s challenging to design reward functions for complex, real-world tasks. Reward learning lets one instead infer reward …
Joar Skalse
,
Matthew Farrugia-Roberts
,
Stuart Russell
,
Alessandro Abate
,
Adam Gleave
PDF
Cite
Cite
×