Christophe Cerisara (@ccerisara) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Jacob Eisenstein

@jacobeisenstein

2 years ago

Curious about co2 emissions associated with conference travel? Roy Schwartz and i ran the numbers! (just in time for #EMNLP2023 and #NeurIPS2023) gist.github.com/jacobeisenstei…

thumb_up_off_alt42

chat_bubble_outline3

repeat13

shareShare

With Benoît de Courson and Benjamin Azoulay from Gallicagram we are releasing on Hugging Face what is probably the largest open corpus in French: 85 billon words in the public domain. huggingface.co/datasets/PleIA… huggingface.co/datasets/PleIA…

thumb_up_off_alt351

chat_bubble_outline15

repeat102

shareShare

Stéphane Bortzmeyer

@bortzmeyer

a year ago

Comme disait la CNIL : il faut utiliser Microsoft Azure pour les données de santé, il n'y a pas d'hébergeur européen sérieux en sécurité techreport.com/news/microsoft…

thumb_up_off_alt315

chat_bubble_outline9

repeat175

shareShare

Alexander Doria

@dorialexander

a year ago

Announcing today in @Wired the release of Common Corpus, the largest collection of fully open corpus on HuggingFace: nearly 500b words (600-700b tokens) in public domain. wired.com/story/proof-yo…

thumb_up_off_alt648

chat_bubble_outline21

repeat157

shareShare

Nando de Freitas

@nandodf

a year ago

There appears to be a mismatch between publishing criteria in AI conferences and "what actually works". It is easy to publish new mathematical constructs (e.g. new models, new layers, new modules, new losses), but as Apple's MM1 paper concludes: 1. Encoder Lesson: Image

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat196

shareShare

Thomas Wolf

@thom_wolf

a year ago

[75min talk] i finally recorded this lecture I gave two weeks ago because people kept asking me for a video so here it is, enjoy "The Little guide to building Large Language Models in 2024" tried to keep it short and comprehensive – focusing on concepts that are crucial for

thumb_up_off_alt1,1K

chat_bubble_outline14

repeat241

shareShare

Yixin Wan @EMNLP2024

@yixin_wan_

a year ago

I call it “social awareness” 🤣

thumb_up_off_alt270

chat_bubble_outline6

repeat16

shareShare

Alexander Doria

@dorialexander

a year ago

Big announcement: pleias releases a massive open corpus of 2 million Youtube videos in Creative Commons (CC-By) on Hugging Face. Youtube-Commons features 30 billion words of audio transcriptions in multiple languages, and soon other modalities huggingface.co/datasets/PleIA…

Big announcement: <a href="/pleiasfr/">pleias</a> releases a massive open corpus of 2 million Youtube videos in Creative Commons (CC-By) on <a href="/huggingface/">Hugging Face</a>. Youtube-Commons features 30 billion words of audio transcriptions in multiple languages, and soon other modalities huggingface.co/datasets/PleIA…

thumb_up_off_alt565

chat_bubble_outline21

repeat126

shareShare

Emile Marzolf

@emile_marzolf

a year ago

🤖Je vois que l’IA générative “Albert” de l’Etat fait beaucoup parler, intéresse et est aussi critiquée/moquée. On retrace la genèse de ce projet, aujourd’hui testé à petite échelle auprès d’une soixantaine de conseillers des maisons France Services ⤵️

thumb_up_off_alt296

chat_bubble_outline14

repeat139

shareShare

Alexander Doria

@dorialexander

a year ago

Il y a 10 ans, nous avions fait fuiter l'accord de licence nationale avec Elsevier avec Rayna ¯\_(ツ)_/¯ 😷🤓🧬🇪🇺👩‍💻📚✍️ dans Rue89. Depuis la science ouverte a énormément avancé (le plan S, mandat sur HAL, baromètre du MESRI), et on verse toujours des millions à Elsevier.

$Il y a 10 ans, nous avions fait fuiter l'accord de licence nationale avec Elsevier avec <a href="/MaliciaRogue/">Rayna ¯\_(ツ)_/¯ 😷🤓🧬🇪🇺👩‍💻📚✍️</a> dans Rue89. Depuis la science ouverte a énormément avancé (le plan S, mandat sur HAL, baromètre du MESRI), et on verse toujours des millions à Elsevier.$

thumb_up_off_alt41

chat_bubble_outline3

repeat27

shareShare

Stephen Mayhew

@mayhewsw

a year ago

How to solve the peer review crisis? Promise 5 citations to each reviewer.

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare

CNRS 🌍

@cnrs

a year ago

Naïo Technologies LACTIPS AldoriaSpace Vect-Horus HEPHAISTOS-Pharma Biomemory Ministère Enseignement supérieur et Recherche Viva Technology Réseau SATT CNRS Innovation #CNRStalks autour de l'#IA générative sur l'espace CNRS 🌍 (J49) à #Vivatech avec Adeline Nazarenko, directrice de CNRS Sciences informatiques, Christophe Cerisara, chercheur CNRS 🌍 et Romuald Elie, responsable de l'équipe de recherche en IA à Google DeepMind

<a href="/naiotech/">Naïo Technologies</a> <a href="/LACTIPS_SA/">LACTIPS</a> <a href="/AldoriaSpace/">AldoriaSpace</a> <a href="/VectHorus/">Vect-Horus</a> <a href="/HEPHAISTOSPhar1/">HEPHAISTOS-Pharma</a> <a href="/BiomemoryLabs/">Biomemory</a> <a href="/sup_recherche/">Ministère Enseignement supérieur et Recherche</a> <a href="/VivaTech/">Viva Technology</a> <a href="/ReseauSATT/">Réseau SATT</a> <a href="/cnrsinnovation/">CNRS Innovation</a> #CNRStalks autour de l'#IA générative sur l'espace <a href="/CNRS/">CNRS 🌍</a> (J49) à #Vivatech avec Adeline Nazarenko, directrice de <a href="/CNRSinformatics/">CNRS Sciences informatiques</a>, Christophe Cerisara, chercheur <a href="/CNRS/">CNRS 🌍</a> et Romuald Elie, responsable de l'équipe de recherche en IA à <a href="/GoogleDeepMind/">Google DeepMind</a>

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

hubert guillaud

@hubertguillaud

a year ago

La question de la souveraineté technologique est souvent mal posée. En Europe et en France particulièrement, l'opinion commune veut que la réglementation européenne soit la cause de tous nos problèmes et notamment que nous n'ayons pas de grandes entreprises de la tech. 1/10

thumb_up_off_alt24

chat_bubble_outline1

repeat17

shareShare

yobibyte

@y0b1byte

a year ago

New blog! Notebooks are McDonalds of Code. You can come to McDonalds and order a salad, but you won't. Same with notebooks, you can write NASA-production-grade software in a notebook, but most likely you won't. Notebooks make you lazy, and encourage bad practices. **common

thumb_up_off_alt326

chat_bubble_outline30

repeat41

shareShare

Stéphane Bortzmeyer

@bortzmeyer

a year ago

L'Internet des Objets, c'est quand on ne peut plus allumer la lumière car le contrôleur a les résolveurs #DNS d'#OpenDNS en dur et qu'on ne peut pas les changer : x.com/seheyah/status…

thumb_up_off_alt487

chat_bubble_outline28

repeat188

shareShare

Loria

@labo_loria

a year ago

👏 Toutes nos félicitations à Alaaeddine Chaoub (Synalp) pour sa soutenance de thèse ! 📚"Deep learning representations for prognostics and health management" Lorraine CNRS Centre-Est CNRS Sciences informatiques Centre Inria de l'Université de Lorraine

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

maxime amblard

@maximeamblard

a year ago

Vous entrez en L3 et vous voulez faire de #IA, du #nlp, des #llm bref vous préparer à entrer dans le master international de TAL à l’IDMC ! Nous avons encore des places ouvertes pour la rentrée.

thumb_up_off_alt5

chat_bubble_outline0

repeat4

shareShare

Brice Le Borgne

@briceleborgne

a year ago

Ce sont des données passionnantes que le ministère refuse de publier. Obtenues par franceinfo, les dotations horaires des établissements révèlent de fortes inégalités : les lycées privés sont souvent mieux dotés que ceux du public.

Ce sont des données passionnantes que le ministère refuse de publier. Obtenues par <a href="/franceinfo/">franceinfo</a>, les dotations horaires des établissements révèlent de fortes inégalités : les lycées privés sont souvent mieux dotés que ceux du public.

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat1,1K

shareShare

MT Group at FBK

@fbk_mt

10 months ago

Now it's our Sara Papi presenting "Mosel: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages" soon to be published at #EMNLP2024 #LT2024FBK

Now it's our <a href="/sarapapi/">Sara Papi</a> presenting "Mosel: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages" soon to be published at #EMNLP2024

#LT2024FBK

thumb_up_off_alt31

chat_bubble_outline0

repeat7

shareShare

David Picard

@david_picard

10 months ago

Avec le prix Nobel de physique pour l'apprentissage et les réseaux de neurones, il est clair qu'on est face à un changement majeur (type électricité, nucléaire, télécom). Il nous faut une politique nationale plus ambitieuse! Jean Zay est déjà trop petit! Genci Ministère Enseignement supérieur et Recherche

thumb_up_off_alt22

chat_bubble_outline1

repeat5

shareShare