Hansi Zeng (@hansizeng) 's Twitter Profile
Hansi Zeng

@hansizeng

CS PhD @ UMass Amherst CIIR | Prev Intern @GoogleDeepMind, @Amazon @Lowes

ID: 1103796483382497282

linkhttps://hansizeng.github.io/ calendar_today07-03-2019 23:16:03

45 Tweet

298 Followers

359 Following

Tianxin Wei (@wei_tianxin) 's Twitter Profile Photo

๐Ÿš€ Excited to share our latest work at #ICLR2024! ๐Ÿ“„ "Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond", conducted during my internship at Amazon! ๐ŸŒLeveraging multi-modal data for various personalized tasks.

๐Ÿš€ Excited to share our latest work at #ICLR2024! 

๐Ÿ“„ "Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond", conducted during my internship at <a href="/amazon/">Amazon</a>!

๐ŸŒLeveraging multi-modal data for various personalized tasks.
Bowen Jin (@bowenjin13) 's Twitter Profile Photo

๐Ÿš€Excited to share "Language Models as Semantic Indexers" is accepted to ICML 2024! โญ๏ธWe propose to learn document semantic IDs with large language models in a self-supervised fashion. โญ๏ธThe learned semantic IDs can benefit LLM generative recommendation and retrieval. #LLM #IR

๐Ÿš€Excited to share "Language Models as Semantic Indexers" is accepted to ICML 2024!

โญ๏ธWe propose to learn document semantic IDs with large language models in a self-supervised fashion.
โญ๏ธThe learned semantic IDs can benefit LLM generative recommendation and retrieval.
#LLM #IR
The IRLab at the University of Amsterdam (@irlab_amsterdam) 's Twitter Profile Photo

Join us next Friday, June 28th, for our last SEA meet-up before the summer break! ๐ŸŽ‰ @snbruch (Pinecone) and Hansi Zeng (UMass Amherst) will discuss recent innovations in search indexing for dense retrieval and in scaling generative IR. Sign up here ๐Ÿ‘‰ meetup.com/de-DE/sea-searโ€ฆ

Hansi Zeng (@hansizeng) 's Twitter Profile Photo

Excited to be in DC for SIGIR! My first time for an in-person conference! I'll present my paper, "Planning-Ahead in Generative Retrieval" tomorrow at 3:05 pm (Monday). Don't miss the M2.1 GenIR and future of LLMs for search session. Looking forward to chatting with IR folks!

Bowen Jin (@bowenjin13) 's Twitter Profile Photo

Not arrived at Vienna!๐Ÿ˜… Sad that cannot make it this time. #ICML2024 However, if you are interested in ๐—Ÿ๐—Ÿ๐—  ๐—ฎ๐—ป๐—ฑ ๐—ฟ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฒ๐˜ƒ๐—ฎ๐—น, come and check out our paper tomorrow afternoow at Hall C 4-9 #404! paper link: arxiv.org/pdf/2310.07815

Zhenrui Yue (@yueeeeeeee2837) 's Twitter Profile Photo

๐Ÿ”ฅ Unleashing the Power of Inference Scaling in Long-Context RAG ๐Ÿ”ฅ Excited to share our latest research Google DeepMind "Inference Scaling for Long-Context Retrieval Augmented Generation (RAG)" ๐ŸŽ“, in collaboration with Honglei Zhuang Aijun Bai Kai Hui Rolf Jagerman Hansi Zeng

๐Ÿ”ฅ Unleashing the Power of Inference Scaling in Long-Context RAG ๐Ÿ”ฅ
Excited to share our latest research <a href="/GoogleDeepMind/">Google DeepMind</a> "Inference Scaling for Long-Context Retrieval Augmented Generation (RAG)" ๐ŸŽ“, in collaboration with <a href="/HongleiZhuang/">Honglei Zhuang</a> Aijun Bai <a href="/kaihuibj/">Kai Hui</a> <a href="/RolfJagerman/">Rolf Jagerman</a> <a href="/HansiZeng/">Hansi Zeng</a>
Hamed Zamani (@hamedzamani) 's Twitter Profile Photo

๐Ÿ“ข An excellent opportunity for PhD students in IR and NLP: The Center for Intelligent Information Retrieval (CIIR) at the UMass Amherst is initiating an exciting Research Internship program for Summer 2025. See the thread for more info. ๐Ÿ‘‡ #SIGIR #NLProc

Bowen Jin (@bowenjin13) 's Twitter Profile Photo

LLM Alignment as Retriever Optimization: An Information Retrieval Perspective arxiv.org/abs/2502.03699 We introduce a comprehensive framework that connects LLM alignment techniques with the established IR principles, providing a new perspective on LLM alignment.

LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
arxiv.org/abs/2502.03699
We introduce a comprehensive framework that connects LLM alignment techniques with the established IR principles, providing a new perspective on LLM alignment.
Julian Killingback (@julian_a42f9a) 's Twitter Profile Photo

๐ŸŽ‰I'm happy to announce my first PhD paper: "Hypencoder: Hypernetworks for Information Retrieval" ๐ŸŽ‰ We investigate a way to model relevance beyond inner-products. Instead of using a query vector, we use a query-specific neural net produced by a hypernetwork encoder (Hypencoder)

๐ŸŽ‰I'm happy to announce my first PhD paper: "Hypencoder: Hypernetworks for Information Retrieval" ๐ŸŽ‰

We investigate a way to model relevance beyond inner-products. Instead of using a query vector, we use a query-specific neural net produced by a hypernetwork encoder (Hypencoder)
Sumit (@_reachsumit) 's Twitter Profile Photo

Scaling Sparse and Dense Retrieval in Decoder-Only LLMs Hansi Zeng et al investigate how different retrieval paradigms scale with larger models, showing sparse retrieval consistently outperforms dense retrieval while demonstrating better generalization ๐Ÿ“arxiv.org/abs/2502.15526

Bowen Jin (@bowenjin13) 's Twitter Profile Photo

๐Ÿš€ Introducing ๐—ฆ๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต-๐—ฅ๐Ÿญ โ€“ the first ๐—ฟ๐—ฒ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—ผ๐—ณ ๐——๐—ฒ๐—ฒ๐—ฝ๐˜€๐—ฒ๐—ฒ๐—ธ-๐—ฅ๐Ÿญ (๐˜‡๐—ฒ๐—ฟ๐—ผ) for training reasoning and search-augmented LLM agents with reinforcement learning! This is a step towards training an ๐—ผ๐—ฝ๐—ฒ๐—ป-๐˜€๐—ผ๐˜‚๐—ฟ๐—ฐ๐—ฒ ๐—ข๐—ฝ๐—ฒ๐—ป๐—”๐—œ โ€œ๐——๐—ฒ๐—ฒ๐—ฝ

Bowen Jin (@bowenjin13) 's Twitter Profile Photo

๐Ÿš€ Excited to announce that our paper ๐’๐ž๐š๐ซ๐œ๐ก-๐‘๐Ÿ is now live! ๐Ÿ“„ We introduce an RL framework (an extension of ๐ƒ๐ž๐ž๐ฉ๐ฌ๐ž๐ž๐ค-๐‘๐Ÿ) for training reasoning-and-retrieval interleaved LLMs. Weโ€™re also open-sourcing all resourcesโ€”models, data, and more! ๐Ÿ“œ Paper:

๐Ÿš€ Excited to announce that our paper ๐’๐ž๐š๐ซ๐œ๐ก-๐‘๐Ÿ
 is now live! ๐Ÿ“„

We introduce an RL framework (an extension of ๐ƒ๐ž๐ž๐ฉ๐ฌ๐ž๐ž๐ค-๐‘๐Ÿ) for training reasoning-and-retrieval interleaved LLMs. Weโ€™re also open-sourcing all resourcesโ€”models, data, and more!

๐Ÿ“œ Paper:
Hansi Zeng (@hansizeng) 's Twitter Profile Photo

Can LLMs learn to search and reason interleavedly for complex QA tasks without predefined rules? Yes! ๐Ÿš€ Check out Search-R1, our framework that leverages the DeepSeek-R1-style RL approach to achieve this.

Julian Killingback (@julian_a42f9a) 's Twitter Profile Photo

Iโ€™m thrilled to share that two of my papers were accepted to SIGIR 2025: โ€œHypencoder: Hypernetworks for Information Retrievalโ€ and โ€œScaling Sparse and Dense Retrieval in Decoder-Only LLMsโ€ with Hansi Zeng Hamed Zamani

Bowen Jin (@bowenjin13) 's Twitter Profile Photo

๐Ÿšจ Big updates to ๐—ฆ๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต-๐—ฅ๐Ÿญ! ๐Ÿš€ ๐Ÿง  Now supports ๐—บ๐˜‚๐—น๐˜๐—ถ-๐—ป๐—ผ๐—ฑ๐—ฒ ๐˜๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด โ€” train 32B+ LLMs with search + reasoning: github.com/PeterGriffinJiโ€ฆ ๐Ÿ” Added support for ๐—น๐—ผ๐—ฐ๐—ฎ๐—น ๐˜€๐—ฝ๐—ฎ๐—ฟ๐˜€๐—ฒ/๐—ฑ๐—ฒ๐—ป๐˜€๐—ฒ ๐—ฟ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฒ๐˜ƒ๐—ฒ๐—ฟ๐˜€ & ๐—ผ๐—ป๐—น๐—ถ๐—ป๐—ฒ ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต