Language Model Contains Personality Subnetworks

(arxiv.org)

35 points | by PaulHoule 6 hours ago ago

22 comments