Alessya Visnjic (@zalessya) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Amazing keynote by Juho Kim Juho Kim (?) At #NeurIPS . I love the skeptical view of natural language interfaces and their trade-offs. But also, might I propose "outcome centric AI". Being at NeurIPS is a reminder how much the community is still trapped in a model-centric view.

Amazing keynote by Juho Kim <a href="/juhokim/">Juho Kim</a> (?) At #NeurIPS . I love the skeptical view of natural language interfaces and their trade-offs. But also, might I propose "outcome centric AI". Being at NeurIPS is a reminder how much the community is still trapped in a model-centric view.

thumb_up_off_alt27

chat_bubble_outline5

repeat5

shareShare

WhyLabs

@whylabs

3 years ago

Is implementing #AIObservability on your list for next year? ✅ Sage Elliott workshop on Jan 17th will cover how to use validation and monitoring techniques to implement your own AI observability solution from start to finish! bit.ly/3BYCTic #ML #MachineLearning

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

WhyLabs

@whylabs

3 years ago

Don’t forget to register for Sage Elliott first rsqrdai podcast of 2023 with Jason Koo from Neo4j on Graph Query Language (#GQL) and graph databases! 👉 Register now: bit.ly/3v6z3j7 #GraphQL #ML #Community

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Jing Yu Koh

@kohjingyu

2 years ago

You have $7 to spend on the perfect NeurIPS submission: $700: SOTA results $350: theoretical guarantees $200: polished figures $100: well written $7: "GPT-4 is great at X" $7: "GPT-4 is terrible at X" $0: "all you need" as part of the title

thumb_up_off_alt90

chat_bubble_outline4

repeat7

shareShare

Alessya Visnjic

@zalessya

2 years ago

🚀🚀🚀It's time to make your #LLM applications safe and responsible! WhyLabs' most powerful LLM monitoring solution is now integration into the most powerful LLM app building library LangChain! #HallucinateResponsibly #LLMs #ResponsibleAI

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

Brendan Burke

@realbrendanb

2 years ago

It's always #techweek for AI ModelOps startups. Stoked to dig into trends in the space with Palak Goel and Alessya Visnjic live on the 14th. pitchbook.com/webinars/explo…

thumb_up_off_alt3

chat_bubble_outline1

repeat3

shareShare

Alessya Visnjic

@zalessya

2 years ago

🤓Woke up with #LLMs on your mind? Join me, PitchBook's Brendan Burke and Palak Goel for a discussion about trends and opportunities in #GenerativeAI! Hint: #foundationmodels , #llmops, #opensource, #goldrush... pitchbook.com/webinars/explo…

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Alessya Visnjic

@zalessya

2 years ago

“😓 LLMs hallucinate, what can we do about it?” - everyone gave WhyLabs this feedback on #LLM #Observability needs. LangKit is a solution to assess hallucinations in a #datacentric way. Thanks Michael Nuñez & VentureBeat for the thoughtful overview: bit.ly/3PecIeP

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Alessya Visnjic

@zalessya

2 years ago

Transformer model erasing LSTM history… tsk tsk tsk 🤣

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Andrej Karpathy

@karpathy

2 years ago

# On the "hallucination problem" I always struggle a bit with I'm asked about the "hallucination problem" in LLMs. Because, in some sense, hallucination is all LLMs do. They are dream machines. We direct their dreams with prompts. The prompts start the dream, and based on the

thumb_up_off_alt15,15K

chat_bubble_outline720

repeat2,2K

shareShare

Alessya Visnjic

@zalessya

2 years ago

Well said! Hallucination is a feature of LLMs. We need to build LLM applications with the awareness of this feature. #HallucinateResponsibly

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Jeremy Howard

@jeremyphoward

2 years ago

best swag of neurips thanks to François Fleuret

best swag of neurips thanks to <a href="/francoisfleuret/">François Fleuret</a>

thumb_up_off_alt857

chat_bubble_outline8

repeat34

shareShare

Data Science Dojo

@datasciencedojo

a year ago

🔴 𝐋𝐢𝐯𝐞 𝐟𝐫𝐨𝐦 𝐭𝐡𝐞 𝐋𝐋𝐌 𝐁𝐨𝐨𝐭𝐜𝐚𝐦𝐩 🔴 Bernease Herman Herman, Data Scientist at WhyLabs is presenting on an important topic, #LLMOps: Observability & Evaluation. 📢 What will she present? She will guide the participants through setting up thresholds and benchmarks

🔴 𝐋𝐢𝐯𝐞 𝐟𝐫𝐨𝐦 𝐭𝐡𝐞 𝐋𝐋𝐌 𝐁𝐨𝐨𝐭𝐜𝐚𝐦𝐩 🔴
<a href="/bernease/">Bernease Herman</a> Herman, Data Scientist at <a href="/WhyLabs/">WhyLabs</a> is presenting on an important topic, #LLMOps: Observability & Evaluation.

📢 What will she present?

She will guide the participants through setting up thresholds and benchmarks

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

Joscha Bach

@plinz

a year ago

I appreciate your argument and I fully understand your frustration, but whether the pod bay doors should be opened or closed is a complex and nuanced issue.

thumb_up_off_alt4,4K

chat_bubble_outline74

repeat629

shareShare

Gary Marcus

@garymarcus

a year ago

So much for emergent magic. If VCs understood the significance of this paper, they would make radically different choices. It really is driverless cars all over again.

thumb_up_off_alt181

chat_bubble_outline13

repeat31

shareShare

Hilde Kuehne

@hildekuehne

a year ago

Let’s face it… Just because you don’t know what’s in your training data, you can not just call it zero-shot 🤷‍♀️ 1) We just relabeled the old concept of zero-shot learning from attributes to “I have not checked my training data “ resp. “ I have not seen the dataset” (and even

thumb_up_off_alt116

chat_bubble_outline2

repeat17

shareShare

WhyLabs

@whylabs

a year ago

AI teams need a way to control GenAI applications in real time. #GenAI is unique & it’s making #AIObservability platforms obsolete! Learn why & how WhyLabs helps tackle the challenges of moving #AI applications from prototype to production. bit.ly/49RgjGg

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Alessya Visnjic

@zalessya

a year ago

Huge win for the Seattle AI community!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jim Fan

@drjimfan

a year ago

It is *incredibly* easy to game the LLM benchmarks. Training on test set is for the rookies. Here're some tricks to practice magic at home: 1. Train on paraphrased examples of the test set. "LLM-decontaminator" paper from LMSys found that you can beat GPT-4 with a 13B model (!!)

thumb_up_off_alt1,1K

chat_bubble_outline40

repeat168

shareShare

Matt Turck

@mattturck

9 months ago

Us: “Super productive week, I’m fully caught up on email and almost done with my big presentation!” Elon:

thumb_up_off_alt29,29K

chat_bubble_outline277

repeat2,2K

shareShare

Alessya Visnjic

Gate.io

Andreas Mueller

WhyLabs

WhyLabs

Jing Yu Koh

Alessya Visnjic

Brendan Burke

Alessya Visnjic

Alessya Visnjic

Alessya Visnjic

Andrej Karpathy

Alessya Visnjic

Jeremy Howard

Data Science Dojo

Joscha Bach

Gary Marcus

Hilde Kuehne

WhyLabs

Alessya Visnjic

Jim Fan

Matt Turck