kstechly (@kayastechly) 's Twitter Profile
kstechly

@kayastechly

Linguistics M.A. at ASU working in the Yochan lab. Starting a Comp Sci PhD at Yale advised by Tom McCoy and Tyler Brooke-Wilson in Fall 2025.

ID: 1707282194576920576

linkhttp://kstechly.github.io calendar_today28-09-2023 06:32:58

11 Tweet

163 Followers

83 Following

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

One paper, lead by kstechly (w/ Matthew M), evaluated the claims over a suite of graph coloring problems. The setup allows for GPT4 guessing a valid coloring in stand alone and self-critiquing modes. There is an external sound verifier outside the self-critiquing loop. 2/

One paper, lead by <a href="/kayastechly/">kstechly</a> (w/ <a href="/mattdmarq/">Matthew M</a>), evaluated the claims over a suite of graph coloring problems. The setup allows for GPT4 guessing a valid coloring in stand alone and self-critiquing modes. There is an external sound verifier outside the self-critiquing loop. 2/
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

📢 Check out these posters on LLM Self-Critiquing (in)abilities in reasoning and planning tasks, being presented at the #NeurIPS2023 "Foundation Models for Decision Making" workshop today (12/15) by yochanites Karthik Valmeekam and kstechly in Hall E2.

📢 Check out these posters on LLM Self-Critiquing (in)abilities in reasoning and planning tasks, being presented at the #NeurIPS2023 "Foundation Models for Decision Making" workshop today (12/15) by yochanites <a href="/karthikv792/">Karthik Valmeekam</a> and <a href="/kayastechly/">kstechly</a> in Hall E2.
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

📢 "On the self-verification limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 (lead by Karthik Valmeekam and kstechly) 👇 Investigates LLM self-verification in three formal benchmarks--Game of 24, Graph Coloring and Planning, and shows that accuracy

📢 "On the self-verification limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115  (lead by <a href="/karthikv792/">Karthik Valmeekam</a> and <a href="/kayastechly/">kstechly</a>) 👇

Investigates LLM  self-verification in three formal benchmarks--Game of 24, Graph Coloring and Planning, and shows that accuracy
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

A research note describing our evaluation of the planning capabilities of o1 🍓 is now on arXiv.org arxiv.org/abs/2409.13373 (thanks to Karthik Valmeekam & kstechly). As promised, here is a summary (..although you should read the whole thing..) 🧵 1/

A research note describing our evaluation of the planning capabilities of o1 🍓 is now on <a href="/arxiv/">arXiv.org</a> arxiv.org/abs/2409.13373 (thanks to <a href="/karthikv792/">Karthik Valmeekam</a> &amp; <a href="/kayastechly/">kstechly</a>). As promised, here is a summary (..although you should read the whole thing..) 🧵 1/
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

So Karthik Valmeekam, kstechly, Lucas Saldyt and I will be at #NeurIPS2024 starting Tuesday--to present a bunch of things👇. Easiest to catch us at 11th 11AM poster session at our "Chain of thoughtlessness" poster (East hall #3010). (I am around 10th-13th--and am happy to chat

So <a href="/karthikv792/">Karthik Valmeekam</a>, <a href="/kayastechly/">kstechly</a>, <a href="/SaldytLucas/">Lucas Saldyt</a>  and I will be at #NeurIPS2024 starting Tuesday--to present a bunch of things👇.  Easiest to catch us at 11th 11AM poster session  at our "Chain of thoughtlessness" poster (East hall #3010).

(I am around 10th-13th--and am happy to chat
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

👉 Our #NeurIPS2024 paper on Chain of Thoughtlessness @ 11AM poster session today (East Hall #3010, 11AM-2pm...). All three of us are here and looking forward to chat/answer qns.. 🙏

👉 Our #NeurIPS2024 paper on Chain of Thoughtlessness @ 11AM poster session today  (East Hall #3010, 11AM-2pm...). All three of us are here and looking forward to chat/answer qns.. 🙏
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

📢"On the Self-Verification Limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 with kstechly and Karthik Valmeekam apparently made it to #ICLR2025.. Swimming🏊 to Singapore.. 😎 [The 🧵s below give the details.. ]

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

PSA for #ICLR2025 authors frantically making posters: Stop worrying! Just prompt o3 and GPT-4o and you will have an AGI-ready poster in seconds! Here is one kstechly and Karthik Valmeekam cooked up--and it looks fully legit from poster distance!

PSA for #ICLR2025 authors frantically making posters: Stop worrying! Just prompt o3 and GPT-4o and you will have an AGI-ready poster in seconds! Here is one <a href="/kayastechly/">kstechly</a> and <a href="/karthikv792/">Karthik Valmeekam</a> cooked up--and it looks fully legit from poster distance!
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

So kstechly 🎓 from Arizona State University yesterday. Kaya was a rather unusual Yochanite-- didn't take any courses or do theses with me; and wasn't even in a CS degree! She was ever present in lab, and lunches though--and a force in all group meetings and many papers. She will be missed..

So <a href="/kayastechly/">kstechly</a> 🎓 from <a href="/ASU/">Arizona State University</a> yesterday. Kaya was a rather unusual Yochanite-- didn't take any courses or do theses with me; and wasn't even in a CS degree!

She was ever present in lab, and lunches though--and a force in all group meetings and many papers. 

She will be missed..