kstechly (@kayastechly) Twitter Tweets • TwiCopy

kstechly

@kayastechly

+ Follow

Linguistics M.A. at ASU working in the Yochan lab. Starting a Comp Sci PhD at Yale advised by Tom McCoy and Tyler Brooke-Wilson in Fall 2025.

ID: 1707282194576920576

linkhttp://kstechly.github.io calendar_today28-09-2023 06:32:58

11 Tweet

163 Followers

83 Following

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 years ago

One paper, lead by kstechly (w/ Matthew M), evaluated the claims over a suite of graph coloring problems. The setup allows for GPT4 guessing a valid coloring in stand alone and self-critiquing modes. There is an external sound verifier outside the self-critiquing loop. 2/

One paper, lead by <a href="/kayastechly/">kstechly</a> (w/ <a href="/mattdmarq/">Matthew M</a>), evaluated the claims over a suite of graph coloring problems. The setup allows for GPT4 guessing a valid coloring in stand alone and self-critiquing modes. There is an external sound verifier outside the self-critiquing loop. 2/

thumb_up_off_alt34

chat_bubble_outline1

repeat3

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 years ago

📢 Check out these posters on LLM Self-Critiquing (in)abilities in reasoning and planning tasks, being presented at the #NeurIPS2023 "Foundation Models for Decision Making" workshop today (12/15) by yochanites Karthik Valmeekam and kstechly in Hall E2.

thumb_up_off_alt17

chat_bubble_outline0

repeat6

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 years ago

📢 "On the self-verification limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 (lead by Karthik Valmeekam and kstechly) 👇 Investigates LLM self-verification in three formal benchmarks--Game of 24, Graph Coloring and Planning, and shows that accuracy

📢 "On the self-verification limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 (lead by <a href="/karthikv792/">Karthik Valmeekam</a> and <a href="/kayastechly/">kstechly</a>) 👇

Investigates LLM self-verification in three formal benchmarks--Game of 24, Graph Coloring and Planning, and shows that accuracy

thumb_up_off_alt109

chat_bubble_outline5

repeat36

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 years ago

Two Yochanites drove 15 hours to Austin city limits and saw this.. 🤗

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

a year ago

A research note describing our evaluation of the planning capabilities of o1 🍓 is now on arXiv.org arxiv.org/abs/2409.13373 (thanks to Karthik Valmeekam & kstechly). As promised, here is a summary (..although you should read the whole thing..) 🧵 1/

A research note describing our evaluation of the planning capabilities of o1 🍓 is now on <a href="/arxiv/">arXiv.org</a> arxiv.org/abs/2409.13373 (thanks to <a href="/karthikv792/">Karthik Valmeekam</a> & <a href="/kayastechly/">kstechly</a>). As promised, here is a summary (..although you should read the whole thing..) 🧵 1/

thumb_up_off_alt682

chat_bubble_outline17

repeat119

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

a year ago

Woo hoo.. Chain of Thoughtlessness paper will be showing up at #NeurIPS2024 🤗 Congrats to Karthik Valmeekam kstechly [details below 👇]

thumb_up_off_alt89

chat_bubble_outline3

repeat13

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

a year ago

So Karthik Valmeekam, kstechly, Lucas Saldyt and I will be at #NeurIPS2024 starting Tuesday--to present a bunch of things👇. Easiest to catch us at 11th 11AM poster session at our "Chain of thoughtlessness" poster (East hall #3010). (I am around 10th-13th--and am happy to chat

So <a href="/karthikv792/">Karthik Valmeekam</a>, <a href="/kayastechly/">kstechly</a>, <a href="/SaldytLucas/">Lucas Saldyt</a> and I will be at #NeurIPS2024 starting Tuesday--to present a bunch of things👇. Easiest to catch us at 11th 11AM poster session at our "Chain of thoughtlessness" poster (East hall #3010).

(I am around 10th-13th--and am happy to chat

thumb_up_off_alt36

chat_bubble_outline1

repeat7

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

a year ago

👉 Our #NeurIPS2024 paper on Chain of Thoughtlessness @ 11AM poster session today (East Hall #3010, 11AM-2pm...). All three of us are here and looking forward to chat/answer qns.. 🙏

thumb_up_off_alt35

chat_bubble_outline0

repeat6

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

10 months ago

📢"On the Self-Verification Limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 with kstechly and Karthik Valmeekam apparently made it to #ICLR2025.. Swimming🏊 to Singapore.. 😎 [The 🧵s below give the details.. ]

thumb_up_off_alt24

chat_bubble_outline1

repeat6

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

8 months ago

📢Delighted to share that our analysis of the planning abilities of 🍓 o1 has now been accepted to #TMLR Transactions on Machine Learning Research . Joint work with Karthik Valmeekam kstechly and Atharva Gundawar. Final version, including DeepSeek R1 results, appearing soon..

thumb_up_off_alt22

chat_bubble_outline1

repeat4

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

7 months ago

PSA for #ICLR2025 authors frantically making posters: Stop worrying! Just prompt o3 and GPT-4o and you will have an AGI-ready poster in seconds! Here is one kstechly and Karthik Valmeekam cooked up--and it looks fully legit from poster distance!

thumb_up_off_alt81

chat_bubble_outline7

repeat12

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

6 months ago

So kstechly 🎓 from Arizona State University yesterday. Kaya was a rather unusual Yochanite-- didn't take any courses or do theses with me; and wasn't even in a CS degree! She was ever present in lab, and lunches though--and a force in all group meetings and many papers. She will be missed..

So <a href="/kayastechly/">kstechly</a> 🎓 from <a href="/ASU/">Arizona State University</a> yesterday. Kaya was a rather unusual Yochanite-- didn't take any courses or do theses with me; and wasn't even in a CS degree!

She was ever present in lab, and lunches though--and a force in all group meetings and many papers.

She will be missed..

thumb_up_off_alt17

chat_bubble_outline2

repeat3

shareShare