This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
AI Safety Newsletter
EA Forum
Login
Sign up
AI Safety Newsletter
Get notified
38
AI Safety Newsletter #1 [CAIS Linkpost]
Akash
Akash
+ 0 more
·
2y
ago
0
0
56
AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media
Oliver Z
Oliver Z
,
Dan H
,
Akash
,
aogara
+ 0 more
·
2y
ago
· 4m read
1
1
35
AI Safety Newsletter #3: AI policy proposals and a new challenger approaches
Oliver Z
Oliver Z
,
Dan H
,
Akash
,
aogara
+ 0 more
·
2y
ago
· 5m read
1
1
35
AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
2y
ago
· 6m read
2
2
60
AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
2y
ago
· 5m read
0
0
32
AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
2y
ago
· 7m read
1
1
23
AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
2y
ago
· 8m read
0
0
16
AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 7m read
3
3
12
AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
1y
ago
· 9m read
2
2
30
AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
1y
ago
· 8m read
3
3
25
AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
1y
ago
· 10m read
0
0
26
AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use
Center for AI Safety
Center for AI Safety
,
Dan H
+ 0 more
·
1y
ago
· 5m read
0
0
7
AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer
Center for AI Safety
Center for AI Safety
,
Dan H
,
Corin Katzke
,
aogara
+ 0 more
·
1y
ago
· 7m read
0
0
15
AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
1y
ago
· 9m read
0
0
12
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
1y
ago
· 7m read
0
0
12
AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
1y
ago
· 10m read
0
0
13
AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
1y
ago
· 5m read
0
0
15
AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
1y
ago
· 6m read
1
1
7
AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
1y
ago
· 6m read
0
0
16
AISN #24: Kissinger Urges US-China Cooperation on AI, China's New AI Law, US Export Controls, International Institutions, and Open Source AI
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
+ 0 more
·
1y
ago
· 7m read
1
1
21
AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
1y
ago
· 7m read
0
0
11
AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAI
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
allisoncyhuang
,
Dan H
+ 0 more
·
1y
ago
· 7m read
0
0
10
AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
,
allisoncyhuang
+ 0 more
·
1y
ago
· 7m read
0
0
17
AISN #28: Center for AI Safety 2023 Year in Review
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
11mo
ago
· 6m read
1
1
5
AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
+ 0 more
·
11mo
ago
· 7m read
0
0
7
AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
+ 0 more
·
10mo
ago
· 7m read
1
1
27
AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
9mo
ago
· 8m read
0
0
15
AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
Dan H
+ 0 more
·
9mo
ago
· 10m read
2
2
19
AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
AlexaPanYue
,
Dan H
+ 0 more
·
7mo
ago
· 11m read
0
0
21
AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
Dan H
+ 0 more
·
7mo
ago
· 10m read
5
5
14
AISN #35: Lobbying on AI Regulation Plus, New Models from OpenAI and Google, and Legal Regimes for Training on Copyrighted Data
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
Dan H
+ 0 more
·
6mo
ago
· 7m read
0
0
6
AISN #36: Voluntary Commitments are Insufficient Plus, a Senate AI Policy Roadmap, and Chapter 1: An Overview of Catastrophic Risks
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
Julius
,
Dan H
+ 0 more
·
6mo
ago
· 6m read
0
0
15
AI Safety Newsletter #37: US Launches Antitrust Investigations Plus, recent criticisms of OpenAI and Anthropic, and a summary of Situational Awareness
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Julius
,
Dan H
+ 0 more
·
5mo
ago
· 6m read
0
0
8
AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate AI Plus, “Circuit Breakers” for AI systems, and updates on China’s AI industry
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Julius
,
Dan H
+ 0 more
·
4mo
ago
· 5m read
0
0
17
AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety?
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
Julius
,
AlexaPanYue
,
Dan H
+ 0 more
·
3mo
ago
· 7m read
0
0
6
AI Safety Newsletter #39: Implications of a Trump Administration for AI Policy Plus, Safety Engineering
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Julius
,
Dan H
+ 0 more
·
4mo
ago
· 7m read
0
0
12
AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
Julius
,
andrewz
,
Dan H
+ 0 more
·
2mo
ago
· 6m read
0
0
10
AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI Governance Summary
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
Julius
,
AlexaPanYue
,
andrewz
,
Dan H
+ 0 more
·
2mo
ago
· 7m read
0
0
6
AI Safety Newsletter #43: White House Issues First National Security Memo on AI Plus, AI and Job Displacement, and AI Takes Over the Nobels
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
AlexaPanYue
,
Dan H
+ 0 more
·
24d
ago
· 7m read
0
0
11
AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems
Center for AI Safety
Center for AI Safety
,
Corin Katzke
,
Julius
,
andrewz
,
Dan H
+ 0 more
·
2d
ago
· 6m read
0
0