Yizhi Li (@yizhilll) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Yizhi Li

@yizhilll

9 months ago

Stay tuned😄

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

We’re pleased to share that the Manchester NLP Group will be presenting *11 papers* at #EMNLP2024. Feel free to drop by and chat with our students and colleagues during their poster sessions and presentations! 🐝🍻 Computer Science @ The University of Manchester @manchester_nlp

thumb_up_off_alt23

chat_bubble_outline1

repeat13

shareShare

Yizhi Li

@yizhilll

8 months ago

🥳Welcome to meet with Natraj, who will definitely provide you a lot of insights.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yizhi Li

@yizhilll

8 months ago

Hi community, I am running the student member of siggen_acl 🤝 Vote for me to build bridges and boost our research community together - "students, industry & academia". I am here to serve YOU. Every vote matters! #NLG #SIGGEN2024 vote at bitl.to/3IQk

Hi community, I am running the student member of <a href="/siggen_acl/">siggen_acl</a> 🤝 Vote for me to build bridges and boost our research community together - "students, industry & academia". I am here to serve YOU. Every vote matters! #NLG #SIGGEN2024 vote at bitl.to/3IQk

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Wenhu Chen

@wenhuchen

8 months ago

**SoTA-VLM** Vision Language models have been known to be weak at reasoning. Many open source models are inclined to produce very short phrase answers without any intermediate reasoning. One of the major reasons is the lack of CoT-rich instruction datasets. In MAmmoTH-VL, we

thumb_up_off_alt48

chat_bubble_outline0

repeat9

shareShare

Ge Zhang

@gezhang86038849

7 months ago

[2/n] 1. Deduplicate: Fineweb undergoes exact deduplication and minhash deduplication. 2. URL Label: Count all the root URLs of fineweb, and use GPT4 to label the top 1 million root URLs in terms of quantity. 3. Broad Recall: Down-sample and recall the data of each domain from

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Qian Liu

@sivil_taram

7 months ago

🎉 Announcing the first Open Science for Foundation Models (SCI-FM) Workshop at #ICLR2025! Join us in advancing transparency and reproducibility in AI through open foundation models. 🤝 Looking to contribute? Join our Program Committee: bit.ly/4acBBjF 🔍 Learn more at:

thumb_up_off_alt175

chat_bubble_outline5

repeat45

shareShare

Manchester NLP

@manchester_nlp

6 months ago

Don't miss Manchester NLP's papers at #COLING2025! 📄✨ Check them out!

Don't miss <a href="/Manchester_NLP/">Manchester NLP</a>'s papers at #COLING2025! 📄✨ Check them out!

thumb_up_off_alt11

chat_bubble_outline1

repeat6

shareShare

Yizhi Li

@yizhilll

6 months ago

Just listen the demo and come back for comment!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yizhi Li

@yizhilll

5 months ago

Check out the new release survey on AI4Science :)

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ge Zhang

@gezhang86038849

5 months ago

[1/n] SuperExcited to announce SuperGPQA!!! We spend more than half a year to finally make it done! SuperGPQA is a comprehensive benchmark that evaluates graduate-level knowledge and reasoning capabilities across 285 disciplines. It also provides the largest human-LLM

thumb_up_off_alt215

chat_bubble_outline5

repeat50

shareShare

Haibin

@eric_haibin_lin

5 months ago

❗️Open source MOE kernels alert❗️ Introducing COMET, a computation/communication library for MoE models from Bytedance. Battle-tested in our 10k+ GPU clusters, COMET shows promising efficiency gains and significant GPU-hour savings (millions 💰💰💰). Integration of DualPipe &

thumb_up_off_alt222

chat_bubble_outline8

repeat59

shareShare

Yizhi Li

@yizhilll

4 months ago

Amazing!! I try to run it with a very concise prompt to build a website for "predict your recent fortune by your date of birth" and it finished the job perfectly well.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Siwei Wu（吴思为）

@siweiwu7

4 months ago

[1/n] Delighted to share our new work "COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values". Paper: arxiv.org/abs/2504.05535 HF Daily Paper: huggingface.co/papers/2504.05… Code: github.com/multimodal-art… Data: huggingface.co/collections/m-…

thumb_up_off_alt19

chat_bubble_outline1

repeat10

shareShare

Aran Komatsuzaki

@arankomatsuzaki

4 months ago

Scaling Laws for Native Multimodal Models - Early-fusion exhibits stronger perf at lower param counts, is more efficient to train, and is easier to deploy, compared w/ late fusion. - Incorporating MoEs allows for models that learn modality-specific weights, significantly

thumb_up_off_alt462

chat_bubble_outline4

repeat78

shareShare

Yizhi Li

@yizhilll

2 months ago

It was really fun to visit Cardiff!

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Ge Zhang

@gezhang86038849

2 months ago

[1/n] 🚨 Game On for LLM Reasoning—Meet KORGym! 🎮✨ Ever wondered how to truly assess an LLM’s reasoning ability beyond memorized knowledge? Meet our latest breakthrough: KORGym—a dynamic, multi-turn game platform built to reveal the real reasoning skills of language models!

thumb_up_off_alt33

chat_bubble_outline1

repeat8

shareShare