Manli Shu (@manlishu) 's Twitter Profile
Manli Shu

@manlishu

Multimodal LLMs @SFResearch | PhD @umdcs. Prev @Google @Nvidia Words are my own.

ID: 1325125504622604296

linkhttps://azshue.github.io/ calendar_today07-11-2020 17:18:44

38 Tweet

446 Followers

403 Following

Furong Huang (@furongh) 's Twitter Profile Photo

🚨 Breaking Research Discovery! 🚨 Large Vision Language Models (#VLMs) amaze with coherence but hide risks. 🤯🗡️ 🔍 Meet "Shadowcast"🥷: A stealthy, mind-bending AI data poisoning method. 🕵️‍♂️💻 Project page 🔗: vlm-poison.github.io #LLMs #DataSecurity A 🧵 👇

🚨 Breaking Research Discovery! 🚨
Large Vision Language Models (#VLMs) amaze with coherence but hide risks. 🤯🗡️

🔍 Meet "Shadowcast"🥷: A stealthy, mind-bending AI data poisoning method. 🕵️‍♂️💻

Project page 🔗: vlm-poison.github.io

#LLMs #DataSecurity 

A 🧵 👇
Silvio Savarese (@silviocinguetta) 's Twitter Profile Photo

Excited to share that our newest multi modal foundation model xGen-mm is out and it’s open source! It’s small (<5B models) and shining in both pre-trained and fine-tuned benchmarks. Check it out 👉Hugging Face: tinyurl.com/ybdmf5zs Salesforce AI Research #SalesforceAI #AI #ML

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens abs: arxiv.org/abs/2406.11271 A new interleaved multimodal pretraining dataset, consists of one trillion text tokens and three billion images, a 10x scale-up from existing

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

abs: arxiv.org/abs/2406.11271

A new interleaved multimodal pretraining dataset, consists of one trillion text tokens and three billion images, a 10x scale-up from existing
Manli Shu (@manlishu) 's Twitter Profile Photo

MINT-1T is now available on 🤗 huggingface.co/collections/ml…. A large-scale (1T tokens), open-source, interleaved image-text dataset with diverse data sources (HTML, PDFs, and ArXiv papers).

Anas Awadalla (@anas_awadalla) 's Twitter Profile Photo

We are excited to release🍃MINT-1T, the first one trillion token multimodal interleaved dataset with 3.4 billion images, built in collaboration with Salesforce AI Research! Dataset: github.com/mlfoundations/… Paper: arxiv.org/abs/2406.11271 Blog: blog.salesforceairesearch.com/mint-1t/ 🧵

We are excited to release🍃MINT-1T, the first one trillion token multimodal interleaved dataset with 3.4 billion images, built in collaboration with <a href="/SFResearch/">Salesforce AI Research</a>!

Dataset: github.com/mlfoundations/…
Paper: arxiv.org/abs/2406.11271
Blog: blog.salesforceairesearch.com/mint-1t/

🧵
Juan Carlos Niebles (@jcniebles) 's Twitter Profile Photo

We just open sourced TACO 🌮 ! arxiv: arxiv.org/abs/2412.05479 github: github.com/SalesforceAIRe… See this thread to learn more! ⬇️🧵