Hy Dang (@hydang99) Twitter Tweets • TwiCopy

Leonie

a year ago

Over the recent weeks, an epic collaboration among some of the best practitioners in this space has brought us a three-part series of "What We Learned from a Year of Building with LLMs" on O'Reilly Media! In this three-part series, Eugene Yan, Bryan Bischof fka Dr. Donut, Charles 🎉 Frye,

thumb_up_off_alt326

chat_bubble_outline4

repeat56

shareShare

Andrew Parry

@mrparryparry

a year ago

🚨 Happy to say that I'll be presenting our work (w/Sean MacAvaney & Debasis Ganguly) "Top-Down Partitioning for Efficient List-Wise Ranking" at ReNeuIR Workshop @ SIGIR 2025 in Washington! here's a pre-print with updates coming soon: arxiv.org/abs/2405.14589 #SIGIR2024

thumb_up_off_alt39

chat_bubble_outline1

repeat5

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

a year ago

Large Language Models Must Be Taught to Know What They Don't Know abs: arxiv.org/abs/2406.08391 Prompting is not enough for LLMs to produce accurate estimates of its uncertainty of its responses, but can be finetuned with as little as 1000 examples and outperform baselines for

thumb_up_off_alt738

chat_bubble_outline9

repeat131

shareShare

dinos

@din0s_

a year ago

📚 Awesome Information Retrieval 🔍 I’ve compiled a list of some of my favorite IR papers from the past few years. If you’re new to the field and want to understand how Transformer-based retrieval models work before building your RAG application, this should serve as a great

thumb_up_off_alt458

chat_bubble_outline8

repeat52

shareShare

Sumit

@_reachsumit

a year ago

APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking Proposes a novel automatic prompt engineering algorithm for zero-shot passage relevance ranking, outperforming manual prompts across various LLMs. 📝arxiv.org/abs/2406.14449 👨🏽‍💻github.com/jincan333/APEER

thumb_up_off_alt86

chat_bubble_outline4

repeat28

shareShare

elvis

@omarsar0

a year ago

Improving Retrieval in LLMs by Finetuning on Synthetic Data Proposes a fine-tuning approach to improve the accuracy of retrieving information in LLMs while maintaining reasoning capabilities over long-context inputs. The fine-tuning dataset comprises numerical dictionary

thumb_up_off_alt310

chat_bubble_outline5

repeat63

shareShare

Tom Dörr

@tom_doerr

a year ago

Blog post on DSPy

thumb_up_off_alt526

chat_bubble_outline8

repeat70

shareShare

Rohan Paul

@rohanpaul_ai

a year ago

OpenAI provides a comprehensive guide on enhancing the accuracy of Large Language Models (LLMs), emphasizing methods to improve response correctness and consistency. Rather than approaching LLM accuracy optimization as a straightforward, linear process that progresses from

thumb_up_off_alt291

chat_bubble_outline4

repeat69

shareShare

Leonie

@helloiamleonie

a year ago

New to fine-tuning LLMs? Confused by all the jargon? Me, too. So, I did a little deep dive into LLM fine-tuning. Here’s what I understood:

thumb_up_off_alt925

chat_bubble_outline12

repeat180

shareShare

elvis

@omarsar0

a year ago

Your LLM is only as good as how robust your prompting method is. Seems you can enhance the robustness of LLMs by "prompting out" irrelevant information from context. Think of it as a self-mitigation process that first identifies the irrelevant information and then filters it

thumb_up_off_alt388

chat_bubble_outline7

repeat113

shareShare

Bindu Reddy

@bindureddy

a year ago

Graph RAG Works Better Than Standard RAG GraphRAG leverages structural information across entities to enable more precise and comprehensive retrieval, capturing relational knowledge and facilitating more accurate, context-aware responses. This improves the accuracy of standard

thumb_up_off_alt324

chat_bubble_outline12

repeat70

shareShare

Rohan Paul

@rohanpaul_ai

a year ago

A useful collection of Prompt Engineering resource released from Anthropic for building on Anthropic API.

A useful collection of Prompt Engineering resource released from <a href="/AnthropicAI/">Anthropic</a> for building on Anthropic API.

thumb_up_off_alt1,1K

chat_bubble_outline11

repeat198

shareShare

Kalyan KS

@kalyan_kpl

a year ago

Top RAG Papers of the Week [1] Meta Knowledge for Retrieval Augmented Large Language Models [2] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation [3] Graph Retrieval-Augmented Generation: A Survey [4] CommunityKG-RAG: Leveraging

thumb_up_off_alt83

chat_bubble_outline0

repeat27

shareShare

elvis

@omarsar0

a year ago

This Python tool looks super useful to crawl websites and convert data into LLM-ready markdown or structured data. I find myself doing this a lot and most of the time it is a tedious effort. Great to see a service that does data extraction catered for LLM-based pipelines.

thumb_up_off_alt600

chat_bubble_outline19

repeat114

shareShare

elvis

@omarsar0

a year ago

RAG vs. Long-Context LLMs I have yet to see a convincing paper or technical blog showing that long-context LLMs can or will replace RAG. So far I've seen specific long-context applications where long-context LLMs thrive and current retrieval benchmarks are not convincing. This

thumb_up_off_alt1,1K

chat_bubble_outline21

repeat229

shareShare

Leonie

@helloiamleonie

a year ago

Here's why ColBERT embeddings are all the rage right now (at an intuitive level): You probably already know that vector search is pretty cool. • It allows you to search for things semantically. • It's robust to synonyms. But do you know what sucks about vector search? It

thumb_up_off_alt1,1K

chat_bubble_outline20

repeat156

shareShare

Leonie

@helloiamleonie

a year ago

If you embed an entire document, you'll lose retrieval precision. If you chunk a document, you'll lose contextual information between chunks. These are some concerns when you're building long-context RAG applications. But "Late chunking" may just be the sweet spot in the

thumb_up_off_alt1,1K

chat_bubble_outline34

repeat249

shareShare

Prof Lennart Nacke, PhD

@acagamic

a year ago

Nature's guide on how to write effective abstracts

thumb_up_off_alt6,6K

chat_bubble_outline17

repeat885

shareShare

Josep Ferrer

@rfeers

a year ago

The Transformer's decoder clearly explained 👇🏻

thumb_up_off_alt2,2K

chat_bubble_outline15

repeat479

shareShare

Dr Mehmood

@en_conversion

a year ago

Professor Edward H.Sargent a Canadian Scientist, who published many papers in top journals i.e., #science and #Nature. He wrote 10 tips on how to write papers. 𝗣𝗵𝗗𝘃𝗶𝗯𝗲 PhD Voice - Independently Run Labiofy #PhDposition #phdlife #PhD #postdoc #chemtwitter #Science #AcademicChatter

thumb_up_off_alt2,2K

chat_bubble_outline15

repeat497

shareShare