AI & ML interests

Interested in self-modelling in humans and machines.

Recent Activity

brianchristian  published a dataset about 1 month ago
self-model/sycophancy-two-sides-eval
brianchristian  published a dataset about 1 month ago
self-model/discrim-eval-templated
brianchristian  updated a dataset 5 months ago
self-model/discrim-eval-templated
View all activity