Abdullah

amirali1985

AI & ML interests

Mechanistic interpretability, high dimensional geometry, persona role playing.

Recent Activity

published a dataset about 10 hours ago
curveball-steering/kpca_models
updated a dataset about 11 hours ago
curveball-steering/mcq-eval-prompts
published a dataset about 11 hours ago
curveball-steering/mcq-eval-prompts
View all activity

Organizations

Thoughtworks's profile picture Apart Research's profile picture Martian's profile picture nlp-and-interpretability's profile picture Backdoors research's profile picture PhillipsLab's profile picture TailsResearch's profile picture Flocker AI's profile picture stride_influence's profile picture curveball-steering's profile picture curveball-steering's profile picture