Manos Tsagkias (@samanos) 's Twitter Profile
Manos Tsagkias

@samanos

Siri@Apple. X-Founder @904labs, @SolumbroLeisure, @MyYardSoftware

ID: 49287656

linkhttp://manostsagkias.com calendar_today21-06-2009 10:40:13

839 Tweet

731 Followers

666 Following

Manos Tsagkias (@samanos) 's Twitter Profile Photo

Happy to share the preprint for our #icassp2021 paper "Error-driven Pruning of Language Models for Virtual Assistants" w/ the Siri Speech team at Apple – arxiv.org/abs/2102.07219

Happy to share the preprint for our #icassp2021  paper "Error-driven Pruning of Language Models for Virtual Assistants" w/ the Siri Speech team at 
<a href="/Apple/">Apple</a> – arxiv.org/abs/2102.07219
Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Andrej Karpathy We've built an openly available, massive text dataset with 800GB: pile.eleuther.ai We're also building a massive image and image-text dataset that are open with each image having either CC0, CC-BY or CC-BY-SA license. I'm truly excited for the future of open datasets!

Chip Huyen (@chipro) 's Twitter Profile Photo

An early draft of the machine learning interviews book is out 🥳 The book is open-sourced and free. Job search is a stressful process, and I hope that this effort can help in some way. Contributions and feedback are appreciated! huyenchip.com/ml-interviews-…

Max Welling (@wellingmax) 's Twitter Profile Photo

Proud to share the webpage for the brand-new Microsoft Research Lab Amsterdam where we will work on molecular simulation for green technology and healthcare. We are hiring deep learning for computational chemistry researchers and engineers. microsoft.com/en-us/research…

Jimmy Lin (@lintool) 's Twitter Profile Photo

Most of the #trec2021 baselines on the MS MARCO V2 data are now available in Anserini as reproducible regression tests. Enjoy! github.com/castorini/anse…

Most of the #trec2021 baselines on the MS MARCO V2 data are now available in Anserini as reproducible regression tests. Enjoy! github.com/castorini/anse…
Guido van Rossum (@gvanrossum) 's Twitter Profile Photo

Just discovered Hedy, a gradual programming language. It's a new idea on how to teach programming to beginners. Very cool! hedycode.com Hedy

Steph from OpenVC (@stephnass) 's Twitter Profile Photo

We've built THE list of VC lists. 23 databases. 220k+ investors. 100% free. Please like + RT this tweet to help a founder out. Aaaand... here we go 👇

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

1) What is LaMDA and What Does it Want? cajundiscordian.medium.com/what-is-lamda-… 2) Interview cajundiscordian.medium.com/is-lamda-senti… What can be said with confidence imo is that things are about to get a lot weirder because models appear to follow smooth scaling laws and data+model size can still plenty grow.

1) What is LaMDA and What Does it Want? cajundiscordian.medium.com/what-is-lamda-…
2) Interview cajundiscordian.medium.com/is-lamda-senti…

What can be said with confidence imo is that things are about to get a lot weirder because models appear to follow smooth scaling laws and data+model size can still plenty grow.
Manos Tsagkias (@samanos) 's Twitter Profile Photo

The times are a-changin’? Google for general search, and TikTok for lifestyle search? A more elaborate version of this thread would make an interesting IR paper.

NASA Exoplanets (@nasaexoplanets) 's Twitter Profile Photo

The misconception that there is no sound in space originates because most space is a ~vacuum, providing no way for sound waves to travel. A galaxy cluster has so much gas that we've picked up actual sound. Here it's amplified, and mixed with other data, to hear a black hole!

Claudia Hauff 🇪🇺 🇺🇦 🇩🇪 🇳🇱 (@charlottehase) 's Twitter Profile Photo

Still going through the #SIGIR2022 proceedings. Here is another gem: a neat workflow of which metric to choose for which ranking problem. From "Ranking Interruptus: When Truncated Rankings Are Better and How to Measure That" by Enrico Amigó et al. PDF: damianospina.com/publication/am…

Still going through the #SIGIR2022 proceedings. Here is another gem: a neat workflow of which metric to choose for which ranking problem. From "Ranking Interruptus: When Truncated Rankings Are Better and How to Measure That" by Enrico Amigó et al. PDF: damianospina.com/publication/am…
Manos Tsagkias (@samanos) 's Twitter Profile Photo

Code search seems an interesting area for IR w/ a mix of challenges from (micro-)blog and desktop search: rapid updates, and little internal structure (e.g., links). Do we have datasets for code search? theregister.com/2023/02/07/git…

Awni Hannun (@awnihannun) 's Twitter Profile Photo

Just in time for the holidays, we are releasing some new software today from Apple machine learning research. MLX is an efficient machine learning framework specifically designed for Apple silicon (i.e. your laptop!) Code: github.com/ml-explore/mlx Docs: ml-explore.github.io/mlx/build/html…

apenwarr (@apenwarr) 's Twitter Profile Photo

This is the nerdiest article on org design I have ever read. Inject it straight into my brain! codahale.com//work-is-work/

John Langford (@johnclangford) 's Twitter Profile Photo

New reqs for low to high level researcher positions: jobs.careers.microsoft.com/global/en/job/… , jobs.careers.microsoft.com/global/en/job/…, jobs.careers.microsoft.com/global/en/job/…, jobs.careers.microsoft.com/global/en/job/…, with postdocs from Akshay and Miro Dudik x.com/MiroDudik/stat… . Please apply or pass to those who may :-)

Sebastian Raschka (@rasbt) 's Twitter Profile Photo

"What Matters In Transformers?" is an interesting paper (arxiv.org/abs/2406.15786) that finds you can actually remove half of the attention layers in LLMs like Llama without noticeably reducing modeling performance. The concept is relatively simple. The authors delete attention

"What Matters In Transformers?" is an interesting paper (arxiv.org/abs/2406.15786) that finds you can actually remove half of the attention layers in LLMs like Llama without noticeably reducing modeling performance.

The concept is relatively simple. The authors delete attention
Jacob Austin (@jacobaustin132) 's Twitter Profile Photo

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n
Katja Hofmann (@katjahofmann) 's Twitter Profile Photo

Today in Nature: our research on world and human action models (WHAM) - generative ai models of video games, aimed towards supporting game creatives in gameplay ideation : nature.com/articles/s4158… - huge congrats to everyone who made this happen, I couldn't be more proud 🥳