Abdullah
amirali1985
AI & ML interests
Mechanistic interpretability, high dimensional geometry, persona role playing.
Recent Activity
updated a dataset 3 days ago
PhillipsLab/axbench-steering-data published a dataset 4 days ago
PhillipsLab/axbench-steering-data updated a model 5 days ago
thoughtworks/coding-sorl