Chadi Helwe ๐Ÿ‡ฑ๐Ÿ‡ง ๐Ÿ‡ซ๐Ÿ‡ท (@chadi_helwe) 's Twitter Profile
Chadi Helwe ๐Ÿ‡ฑ๐Ÿ‡ง ๐Ÿ‡ซ๐Ÿ‡ท

@chadi_helwe

PhD in Artificial Intelligence from @IP_Paris_ Creator of logitorch.ai

ID: 363124507

linkhttp://chadihelwe.me calendar_today27-08-2011 15:29:46

188 Tweet

321 Followers

2,2K Following

Dima ุฏูŠู…ุง ุตุงุฏู‚ (@dimasadek) 's Twitter Profile Photo

ู‚ูุถูŠ ุงู„ุงู…ุฑ. ุงู†ู‡ุงุฑ ุงู„ู‚ุถุงุก ููŠ ู„ุจู†ุงู†.

Armin Ronacher โ‡Œ (@mitsuhiko) 's Twitter Profile Photo

Today in timezone madness: Lebanon had a last minute announcement of DST postponement. The Olson database complied. Now there is a dispute if it happens, and *parts* of the government reversed it. The country now has two timezones concurrently depending on who you ask.

Giselle Khoury (@gizou10) 's Twitter Profile Photo

ูŠุง ุนูŠุจ ุงู„ุดูˆู… ู„ุจู†ุงู† ุงู„ุฌุฏูŠุฏ ู…ู† ุงู†ุฌุงุฒุงุชู‡ ุงู„ุฃูˆู„ู‰ ู…ู†ุน ุงู†ุชุฎุงุจ ุฑุฆูŠุณ ู„ู„ุฌู…ู‡ูˆุฑูŠุฉ ุงู„ู„ุจู†ุงู†ูŠุฉ.

(๐Ÿ™๐Ÿ‡ต๐Ÿ‡ธ) ุนู…ุฑูˆ ู‚ู„ุฌ Amr Keleg (@amrkeleg) 's Twitter Profile Photo

ู‡ุฐู‡ ุฏุนูˆุฉ ู„ู„ู…ุดุงุฑูƒุฉ ููŠ ุงุณุชุทู„ุงุนู†ุง ุงู„ุฐูŠ ูŠุณุชู…ุฑ ุญุชู‰ ุงู„ุชุงุณุน ู…ู† ูŠูˆู„ูŠูˆ/ุชู…ูˆุฒ ูˆุงู„ู…ุชุนู„ู‚ ุจุงู„ุฃุฑุงุจูŠุฒูŠ (ูŠุดุงุฑ ุฅู„ูŠู‡ุง ุฃุญูŠุงู†ู‹ุง ุจุงุณู… ุฃุฑุงุจูŠุด ุฃูˆ ูุฑุงู†ูƒูˆ ุฃุฑุงุจ): edinburghinformatics.eu.qualtrics.com/jfe/form/SV_8pโ€ฆ (๐Ÿ™๐Ÿ‡ต๐Ÿ‡ธ) ุนู…ุฑูˆ ู‚ู„ุฌ Amr Keleg ุฃุญู…ุฏ ุฃู…ูŠู† Taha Yassine ุฅูŠู…ุงู† ฺจู„ูŠู„ Chadi Helwe ๐Ÿ‡ฑ๐Ÿ‡ง ๐Ÿ‡ซ๐Ÿ‡ท Nedjma Ousidhoum ู†ุฌู…ุฉ ุฃูˆุณูŠุฏู‡ู… (1/3) ๐Ÿงต

ู‡ุฐู‡ ุฏุนูˆุฉ ู„ู„ู…ุดุงุฑูƒุฉ ููŠ ุงุณุชุทู„ุงุนู†ุง ุงู„ุฐูŠ ูŠุณุชู…ุฑ ุญุชู‰ ุงู„ุชุงุณุน ู…ู† ูŠูˆู„ูŠูˆ/ุชู…ูˆุฒ ูˆุงู„ู…ุชุนู„ู‚ ุจุงู„ุฃุฑุงุจูŠุฒูŠ (ูŠุดุงุฑ ุฅู„ูŠู‡ุง ุฃุญูŠุงู†ู‹ุง ุจุงุณู… ุฃุฑุงุจูŠุด ุฃูˆ ูุฑุงู†ูƒูˆ ุฃุฑุงุจ): edinburghinformatics.eu.qualtrics.com/jfe/form/SV_8pโ€ฆ

<a href="/Amrkeleg/">(๐Ÿ™๐Ÿ‡ต๐Ÿ‡ธ) ุนู…ุฑูˆ ู‚ู„ุฌ Amr Keleg</a>  ุฃุญู…ุฏ ุฃู…ูŠู†
<a href="/taha_yssne/">Taha Yassine</a>  ุฅูŠู…ุงู† ฺจู„ูŠู„
<a href="/Chadi_Helwe/">Chadi Helwe ๐Ÿ‡ฑ๐Ÿ‡ง ๐Ÿ‡ซ๐Ÿ‡ท</a> <a href="/nedjmaou/">Nedjma Ousidhoum ู†ุฌู…ุฉ ุฃูˆุณูŠุฏู‡ู…</a>

(1/3) ๐Ÿงต
Francesco Orabona (@bremen79) 's Twitter Profile Photo

For people at #ICML2025, we have 2 papers in the Workshop on Assessing World Models 1) How are LLMs at playing trading card games (TCG)?Not good once you take care of the contamination! We create a new TGC evaluation task with publicly game engine but hidden card implementation

For people at #ICML2025, we have 2 papers in the Workshop on Assessing World Models

1) How are LLMs at playing trading card games (TCG)?Not good once you take care of the contamination! We create a new TGC evaluation task with publicly game engine but hidden card implementation
Francesco Orabona (@bremen79) 's Twitter Profile Photo

2) We introduce ReviseQA, a new benchmark that evaluates the ability to perform logical reasoning when information changes over multiple conversational turns. Our experiments show that current LLMs often fail to maintain logical consistency when updating beliefs.

2) We introduce ReviseQA, a new benchmark that evaluates the ability to perform logical reasoning when information changes over multiple conversational turns. Our experiments show that current LLMs often fail to maintain logical consistency when updating beliefs.