Daan van Esch (@daanvanesch) 's Twitter Profile
Daan van Esch

@daanvanesch

I work on speech and language technologies at Google. I like languages, history, maps, traveling, cycling, and buying way too many books.

ID: 443148287

linkhttp://www.daanvanesch.nl calendar_today21-12-2011 21:34:54

4,4K Tweet

1,1K Followers

907 Following

Jason Riesa (@jasonriesa) 's Twitter Profile Photo

🚀 Join the Gemini Multilinguality team Google DeepMind 🌐 We’re looking for researchers passionate about making LLMs helpful for all. Dramatically improve model quality, coverage, and cultural relevance across hundreds of languages. #NLProc #MultilingualAI #i18n #LLMs

Helsinki-NLP (@helsinkinlp) 's Twitter Profile Photo

The 18th MT marathon will be organized in beautiful Helsinki in the end of August, 2025. We invite you to a week-long gathering of researchers, developers and students with lectures, labs and hacking projects. More information will come - stay tuned!

Antoine Bosselut (@abosselut) 's Twitter Profile Photo

Translating MMLU is great, but global users of multilingual #LLMs don't care all that much about LLM's understanding of US Law! Our new #NLProc work centers multilingual #LLM evaluations toward regional knowledge. A community work led by Angelika Romanou in collab with Cohere For AI !

Martijn Bartelds (@barteldsmartijn) 's Twitter Profile Photo

Excited to announce the launch of our ML-SUPERB 2.0 challenge INTERSPEECH 2025 2025! Join us in pushing the boundaries of multilingual ASR and LID! 🚀 💻 multilingual.superbbenchmark.org

Aida Nematzadeh 🦋 (@aidanematzadeh) 's Twitter Profile Photo

I am hiring for RS/RE positions! If you are interested in language-flavored multimodal learning, evaluation, or post-training apply here 🦎 boards.greenhouse.io/deepmind/jobs/… I will also be #NeurIPS2024 so come say hi! (Please email me to find time to chat)

Daan van Esch (@daanvanesch) 's Twitter Profile Photo

Awesome to see another massively multilingual text data set, after GlotCC also became available recently! Looking forward to reading more about how the challenge of language identification was tackled here (it's one of my favorite problems: aclanthology.org/2020.coling-ma…)

Guilherme Penedo (@gui_penedo) 's Twitter Profile Photo

We have recently released 🥂 FineWeb2: a large pretraining dataset with 1000s of languages. Today, we're launching a large community initiative to make it better: an annotation effort to classify the educational content of data in these languages, inspired by 📚 FineWeb-Edu.

We have recently released 🥂 FineWeb2: a large pretraining dataset with 1000s of languages.

Today, we're launching a large community initiative to make it better: an annotation effort to classify the educational content of data in these languages, inspired by 📚 FineWeb-Edu.
toonsutraofficial (@toonsutra) 's Twitter Profile Photo

📢 Working with Google DeepMind to revolutionize comic translation using Gemini 2.0 Flash. 🎯 Making global stories accessible in Indian languages 📚 3,100+ titles and growing 🤖 AI-powered translations Learn more: ai.google.dev/showcase/toons…

Amir H. Kargaran (@amir_nlp) 's Twitter Profile Photo

If you are at #NeurIPS2024 and wondering how we can secure clean document-level pretraining data for ~1,000 languages, come to our poster this evening in EAST Hall, poster 3102. I'm wearing an Munich Center for Machine Learning t-shirt.

If you are at #NeurIPS2024 and wondering how we can secure clean document-level pretraining data for ~1,000 languages, come to our poster this evening in EAST Hall, poster 3102. I'm wearing an <a href="/MunichCenterML/">Munich Center for Machine Learning</a> t-shirt.
Google AI (@googleai) 's Twitter Profile Photo

Applications open on Dec 20 for the Research Scholar program, which aims to strengthen long-term collaboration with the academic community by supporting early-career professors pursuing research in fields relevant to Google. Learn more & apply by Jan 27 ↓goo.gle/RS

Jeff Dean (@jeffdean) 's Twitter Profile Photo

If you're a researcher and think access to our Gemini models could accelerate or enable your research, check out our Gemini for Research program! ai.google.dev/gemini-api/doc… Read more below as well ⬇️

Wikimedia Foundation (@wikimedia) 's Twitter Profile Photo

Five new languages are now live on Wikipedia, thanks to the Future of Language Incubation initiative! Southern Ndebele, Pannonian Rusyn, Iban, Obolo, and Tai Nüa are now part of the platform, making it easier for diverse communities to share knowledge ➡️ w.wiki/C9fX

Five new languages are now live on Wikipedia, thanks to the Future of Language Incubation initiative!

Southern Ndebele, Pannonian Rusyn, Iban, Obolo, and Tai Nüa are now part of the platform, making it easier for diverse communities to share knowledge ➡️ w.wiki/C9fX
Heiga Zen (全 炳河) (@heiga_zen) 's Twitter Profile Photo

Calling researchers! Google.org Scientific Advancement team is accepting applications for Research Scholar Program. It provides funding & support to researchers working on projects that have the potential to make positive impacts on the world research.google/programs-and-e…

Ankur Bapna (@ankurbpn) 's Twitter Profile Photo

Happy to see the first feature powered by Gemini native audio outputs ship out to public - especially since it's MASSIVELY multilingual. Lots more coming soon 😉

Simon (@tokumin) 's Twitter Profile Photo

Time for another NotebookLM Ship! 🚢 Today we're launching NotebookLM Audio Overviews in over 50 languages (actually closer to 75!). This was a HUGE lift from across Labs, GDM, Speech, Core and all our international Googlers who tuned in and gave feedback leading up to launch.

Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

We’re bringing 50+ languages to NotebookLM’s Audio Overviews, including French, Hindi, Japanese + Portuguese, with more on the way. 🌍 Get podcast conversations based on the sources you give it, even if they are in different languages. Our Audio Overview hosts demonstrate: