Christoph_Winter

Thanks for this - I really enjoy your newsletter!

Re the new Chinese law, you say: "The model should refuse to answer at least 95% of questions that would violate the law, while answering at least 95% of questions that are not illegal."

Could you clarify whether illegality here refers to the question or the (potential) AI-generated response? I would assume that it relates to the response rather than the question but your statement seems to indicate the former.

The related twitter thread (where I assume you got the info from?) seems unclear to me.

Effective Altruism Forum
EA Forum

Posts
2

Comments
1

Christoph_Winter

Posts 2

Comments1

Posts
2

Comments
1