This repository contains code for the paper "Extracting and Understanding the Superficial Knowledge in Alignment (NAACL 2025)"
-
Notifications
You must be signed in to change notification settings - Fork 0
[NAACL 2025] Extracting and Understanding the Superficial Knowledge in Alignment, Runjin Chen, Gabriel Jacob Perin, Xuxi Chen, Xilun Chen, Yan Han, Nina S. T. Hirata , Junyuan Hong, Bhavya Kailkhura
VITA-Group/Superficial_Alignment
About
[NAACL 2025] Extracting and Understanding the Superficial Knowledge in Alignment, Runjin Chen, Gabriel Jacob Perin, Xuxi Chen, Xilun Chen, Yan Han, Nina S. T. Hirata , Junyuan Hong, Bhavya Kailkhura
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published