Dr. Nahema Marchal (@nahema_marchal) Twitter Tweets • TwiCopy

Dr. Nahema Marchal

@nahema_marchal

+ Follow

research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | ~on mat leave~ views own

ID: 29987775

calendar_today09-04-2009 13:55:12

1,1K Tweet

1,1K Followers

1,1K Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Deadline! Fri Dec 9, apps DUE for 2 yr postdoc w/ me, @tarleton, Nancy Baym is not here, & danah boyd. We are looking for researchers (typically ABD or 1ish year out from their PhD) studying the intersections of tech & society—how we work, play, create, & govern. See socialmediacollective.org/2022/11/09/smc…

thumb_up_off_alt48

chat_bubble_outline2

repeat33

shareShare

Laura Weidinger

@weidingerlaura

2 years ago

🚨PAPER'S OUT! 🚨Very excited that today we’re releasing a new holistic framework for evaluating the safety of generative AI systems. Big evaluation gaps remain + we suggest steps to close these. Paper: arxiv.org/abs/2310.11986, blog: bit.ly/socialethicalG… (1/n)

thumb_up_off_alt256

chat_bubble_outline9

repeat64

shareShare

Arianna Manzini

@arianna_manzini

a year ago

📢 New paper out! Super proud to share this great interdisciplinary effort (50+ amazing colleagues ❤️), where we investigate the ethical and societal implications of advanced AI assistants⭐️ 📜Looking for key insights? We have a blog post! deepmind.google/discover/blog/…

thumb_up_off_alt44

chat_bubble_outline3

repeat12

shareShare

Séb Krier

@sebkrier

a year ago

🔮 New Google DeepMind paper exploring what persuasion and manipulation in the context of language models. 👀 Existing safeguard approaches often focus on harmful outcomes of persuasion. This research argues for a deeper examination of the process of AI persuasion itself to

thumb_up_off_alt298

chat_bubble_outline14

repeat63

shareShare

Laura Weidinger

@weidingerlaura

a year ago

📣 New report out! 🎉How do we know whether an AI is “safe”? We share learnings from developing safety evaluation of large scale systems at Google DeepMind for a broad audience. Report: arxiv.org/abs/2404.14068 Key lessons: 🪡 (1/n)

thumb_up_off_alt36

chat_bubble_outline1

repeat10

shareShare

Canfer Akbulut

@canfer_akbulut

a year ago

Have you been thinking about the implications of anthropomorphic AI quite a bit this week? 🤔 We explore the risks of anthropomorphic AI systems in our Ethics of Advanced AI Assistants report. Key insights in thread 💡deepmind.google/discover/blog/…

thumb_up_off_alt34

chat_bubble_outline2

repeat9

shareShare

Laura Weidinger

@weidingerlaura

a year ago

New paper out! Very excited that we’re able to share STAR: SocioTechnical Approach to Red Teaming Language Models. We've made some methodological advancements focusing on human red teaming for ethical and social harms. 🧵Check out arxiv.org/abs/2406.11757

thumb_up_off_alt142

chat_bubble_outline6

repeat43

shareShare

Geoffrey Irving

@geoffreyirving

a year ago

Job post! We are hiring ML Research Scientists at AISI to help explore the technical details of safety cases for advanced AI systems. Please apply if you enjoy mapping out the arguments and open research problems behind a variety of technical safety approaches! 🧵

thumb_up_off_alt148

chat_bubble_outline4

repeat25

shareShare

Dr. Nahema Marchal

Gate.io

Mary L. Gray

Laura Weidinger

Arianna Manzini

Séb Krier

Laura Weidinger

Canfer Akbulut

Laura Weidinger

Geoffrey Irving