Griffiths Computational Cognitive Science Lab (@cocosci

carlos g. correa

a year ago

My paper on hierarchical plans is out in Cognition!🎉 tldr: We ask participants to generate hierarchical plans in a programming game. People prefer to reuse beyond what standard accounts predict, which we formalize as induction of a grammar over actions. authors.elsevier.com/a/1kBQr2Hx2xLNA

thumb_up_off_alt152

chat_bubble_outline1

repeat29

shareShare

Kaiqu Liang

@kaiqu_liang

10 months ago

Think your RLHF-trained AI is aligned with your goals? ⚠️ We found that RLHF can induce significant misalignment when humans provide feedback by predicting future outcomes 🤔, creating incentives for LLM deception 😱 Introduce ✨RLHS (Hindsight Simulation)✨: By simulating

thumb_up_off_alt241

chat_bubble_outline4

repeat34

shareShare

Xuechunzi Bai

@baixuechunzi

9 months ago

Excited to share that our paper is now out in PNASNews! 🎉 Check it out: pnas.org/doi/10.1073/pn… Code and data: github.com/baixuechunzi/l… Big shoutout to my amazing coauthors Angelina Wang @angelinawang.bsky.social Ilia Sucholutsky Griffiths Computational Cognitive Science Lab!

Excited to share that our paper is now out in <a href="/PNASNews/">PNASNews</a>! 🎉

Check it out: pnas.org/doi/10.1073/pn…
Code and data: github.com/baixuechunzi/l…

Big shoutout to my amazing coauthors <a href="/ang3linawang/">Angelina Wang @angelinawang.bsky.social</a> <a href="/sucholutsky/">Ilia Sucholutsky</a> <a href="/cocosci_lab/">Griffiths Computational Cognitive Science Lab</a>!

thumb_up_off_alt210

chat_bubble_outline7

repeat53

shareShare

Gianluca Bencomo

@gianlucabencomo

8 months ago

New pre-print! In this work, we explore the extent to which different inductive biases can be instantiated among disparate neural architectures, specifically Transformers, CNNs, MLPs, and LSTMs. Link: arxiv.org/abs/2502.20237 (1/4)

thumb_up_off_alt100

chat_bubble_outline6

repeat22

shareShare

Griffiths Computational Cognitive Science Lab

@cocosci_lab

8 months ago

New preprint reveals that large language models blend two distinct representations of numbers -- as strings and as integers -- which can lead to some surprising errors. This work shows how methods from cognitive science can be useful for understanding AI systems.

thumb_up_off_alt65

chat_bubble_outline1

repeat6

shareShare

Lance Ying

@lance_ying42

8 months ago

Many studies suggest AI has achieved human-like performance on various cognitive tasks. But what is “human-like” performance? Our new paper conducted a human re-labeling of several popular AI benchmarks and found widespread biases and flaws in task and label designs. We make 5

thumb_up_off_alt184

chat_bubble_outline7

repeat47

shareShare

Max David Gupta

@maxdavidgupta1

7 months ago

Happy to share my first first-authored work at Griffiths Computational Cognitive Science Lab. Determining sameness or difference between objects is utterly trivial to humans, but surprisingly inaccessible to AI. Meta-learning can help neural networks overcome this barrier. Link: arxiv.org/abs/2503.23212 (1/5)

thumb_up_off_alt103

chat_bubble_outline2

repeat15

shareShare

Griffiths Computational Cognitive Science Lab

@cocosci_lab

7 months ago

We are looking for a new lab manager, shared with the Concepts and Cognition Lab of Tania Lombrozo. Apply here: research-princeton.icims.com/jobs/20656/res…

thumb_up_off_alt27

chat_bubble_outline1

repeat22

shareShare

Veniamin Veselovsky

@vminvsky

7 months ago

New paper: Language models have “universal” concept representation – but can they capture cultural nuance? 🌏 If someone from Japan asks an LLM what color a pumpkin is, will it correctly say green (as they are in Japan)? Or does cultural nuance require more than just language?

thumb_up_off_alt131

chat_bubble_outline6

repeat33

shareShare

Gianluca Bencomo

@gianlucabencomo

7 months ago

Every ChatGPT query costs more energy than the entire life of a fruit fly.

thumb_up_off_alt17

chat_bubble_outline1

repeat3

shareShare

Sev Harootonian

@harootonian

6 months ago

🚨 New preprint alert! 🚨 Thrilled to share new research on teaching! Work supervised by Griffiths Computational Cognitive Science Lab, Yael Niv @yaelniv.bsky.social, and Mark Ho. This project asks: When do people teach by mentalizing vs with heuristics? 1/3 osf.io/preprints/osf/…

🚨 New preprint alert! 🚨

Thrilled to share new research on teaching!
Work supervised by <a href="/cocosci_lab/">Griffiths Computational Cognitive Science Lab</a>, <a href="/yael_niv/">Yael Niv @yaelniv.bsky.social</a>, and <a href="/mark_ho_/">Mark Ho</a>.

This project asks:
When do people teach by mentalizing vs with heuristics? 1/3

osf.io/preprints/osf/…

thumb_up_off_alt18

chat_bubble_outline1

repeat4

shareShare

Alexander Ku

@alex_y_ku

6 months ago

(1/11) Evolutionary biology offers powerful lens into Transformers learning dynamics! Two learning modes in Transformers (in-weights & in-context) mirror adaptive strategies in evolution. Crucially, environmental predictability shapes both systems similarly.

thumb_up_off_alt163

chat_bubble_outline12

repeat26

shareShare

Griffiths Computational Cognitive Science Lab

@cocosci_lab

6 months ago

New preprint! In-context and in-weights learning are two interacting forms of plasticity, like genetic evolution and phenotypic plasticity. We use ideas from evolutionary biology to predict when neural networks will use each kind of learning.

thumb_up_off_alt74

chat_bubble_outline0

repeat10

shareShare

Griffiths Computational Cognitive Science Lab

@cocosci_lab

6 months ago

This paper uses metalearning to distill a Bayesian prior into a set of initial weights for a neural network, providing a way to create networks with interpretable soft inductive biases. The resulting networks can learn just as quickly as a Bayesian model when applied to new data.

thumb_up_off_alt123

chat_bubble_outline0

repeat19

shareShare

Griffiths Computational Cognitive Science Lab

@cocosci_lab

6 months ago

New preprint shows that training large language models to produce better chains of thought for predicting human decisions also results in them producing better psychological explanations.

thumb_up_off_alt28

chat_bubble_outline0

repeat2

shareShare

Griffiths Computational Cognitive Science Lab

@cocosci_lab

5 months ago

In this new preprint we use methods from cognitive science to explore how well large language models make inferences from observations and construct interventions for understanding complex black-box systems that are analogous to those that scientists seek to understand

thumb_up_off_alt58

chat_bubble_outline0

repeat7

shareShare

Griffiths Computational Cognitive Science Lab

@cocosci_lab

5 months ago

Video games are a powerful tool for assessing the inductive biases of AI systems, as they are engineered based on how humans perceive the world and pursue their goals. This new benchmark evaluates the ability of vision language models using some challenging classic video games.

thumb_up_off_alt23

chat_bubble_outline0

repeat3

shareShare