BigScience Research Workshop (@bigsciencew) Twitter Tweets • TwiCopy

BigCode

3 years ago

print("Hello world! 🎉") Excited to announce the BigCode project led by ServiceNow Research and Hugging Face! In the spirit of BigScience we aim to develop large language models for code in an open and responsible way. Join here: bigcode-project.org/docs/about/joi… A thread with our goals🧵

print("Hello world! 🎉")

Excited to announce the BigCode project led by <a href="/ServiceNowRSRCH/">ServiceNow Research</a> and <a href="/huggingface/">Hugging Face</a>! In the spirit of BigScience we aim to develop large language models for code in an open and responsible way.

Join here: bigcode-project.org/docs/about/joi…

A thread with our goals🧵

thumb_up_off_alt212

chat_bubble_outline5

repeat71

shareShare

Sasha Luccioni, PhD 🦋🌎✨🤗

@sashamtl

3 years ago

The BigScience Research Workshop carbon footprint paper is live!! 🎉 Check it out to see how we calculated BLOOM's carbon footprint, covering all steps from the manufacturing of equipment 💻 to deployment! 🚀 arxiv.org/abs/2211.02001

thumb_up_off_alt174

chat_bubble_outline8

repeat49

shareShare

Niklas Muennighoff

@muennighoff

3 years ago

Crosslingual Generalization through Multitask Finetuning 🌸 Demo: huggingface.co/bigscience/blo… 📜 arxiv.org/abs/2211.01786 💻github.com/bigscience-wor… We present BLOOMZ & mT0, a family of models w/ up to 176B params that follow human instructions in >100 languages zero-shot. 1/7

thumb_up_off_alt289

chat_bubble_outline11

repeat78

shareShare

BigScience Research Workshop

@bigsciencew

3 years ago

Big day today with two papers out! BLOOM carbon footprint at arxiv.org/abs/2211.02001, new models BLOOMZ and mt0 at huggingface.co/bigscience/blo…

thumb_up_off_alt43

chat_bubble_outline2

repeat11

shareShare

clem 🤗

@clementdelangue

3 years ago

The Bloom paper is out. Looks like it's doing worse than current GPT3 API in zero-shot generation tasks in English but better than other open-source LLMs & better than all in zs multi-lingual (which was the main goal). Proud of the work from the community! arxiv.org/abs/2211.05100

thumb_up_off_alt598

chat_bubble_outline11

repeat106

shareShare

Max Ryabinin

@m_ryabinin

3 years ago

Petals, a system for easy decentralized inference and adaptation of 100B+ LLMs, is now online! 🌸Generate text with BLOOM-176B using Colab or a desktop GPU 🔌Fine-tune large models for your tasks 👥Help others by contributing your GPUs or host a new swarm colab.research.google.com/drive/1Ervk6HP…

thumb_up_off_alt259

chat_bubble_outline5

repeat57

shareShare

Yong Zheng-Xin (Yong)

@yong_zhengxin

3 years ago

(Repost for corrected Arxiv) 🧐What’s the best way to quickly adapt large multilingual language models to new languages? We present our new paper from BigScience Research Workshop 🌸: BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting. 📜 arxiv.org/abs/2212.09535 [1/9]

thumb_up_off_alt66

chat_bubble_outline2

repeat27

shareShare

Anna Rogers

@annargrs

3 years ago

Worried about benchmark data contamination? Studying LLM memorization or attribution? BigScience Research Workshop BLOOM 🌸 now has exact & fuzzy search over full training data! with Ola Piktus🏆 Christopher Akiki Paulo Villegas Hugo Laurençon @ggdupont Sasha Luccioni, PhD 🦋🌎✨🤗 Yacine Jernite arxiv.org/abs/2302.14035 /1

thumb_up_off_alt124

chat_bubble_outline3

repeat29

shareShare

Aran Komatsuzaki

@arankomatsuzaki

3 years ago

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Documents the data creation and curation efforts of ROOTS corpus, a 1.6TB dataset used to train BLOOM Releases a large initial subset of the corpus data: huggingface.co/bigscience-data abs: arxiv.org/abs/2303.03915

thumb_up_off_alt124

chat_bubble_outline1

repeat36

shareShare

Giada Pistilli

@giadapistilli

3 years ago

As you already know, I am very proud of the collective work that enabled the development of BigScience Research Workshop's ethical charter. Today I am even more proud to announce that it's part of OECD Innovation's catalog to promote Trustworthy AI: such a milestone! oecd.ai/en/catalogue/t…

thumb_up_off_alt26

chat_bubble_outline1

repeat8

shareShare

BigCode

@bigcodeproject

3 years ago

Join us tomorrow, Wednesday 22nd (6:30 PM - 8:00PM CET) at the Mozilla Festival Science Fair to learn more about our work in the open and responsible development of large language models (LLMs) for code. schedule.mozillafestival.org/session/TJRU3L… #Mozfest

thumb_up_off_alt24

chat_bubble_outline0

repeat5

shareShare

BigCode

@bigcodeproject

3 years ago

Introducing: 💫StarCoder StarCoder is a 15B LLM for code with 8k context and trained only on permissive data in 80+ programming languages. It can be prompted to reach 40% pass@1 on HumanEval and act as a Tech Assistant. Try it here: shorturl.at/cYZ06r Release thread🧵

thumb_up_off_alt2,2K

chat_bubble_outline75

repeat649

shareShare

MMitchell

@mmitchell_ai

2 years ago

If you wanted to see the fun panel/Q&A we did with Londoners on AI, you can check out the recording here! My preso at the start is also on Open Science, representing Hugging Face & BigScience Research Workshop.

thumb_up_off_alt20

chat_bubble_outline1

repeat5

shareShare

Sasha Luccioni, PhD 🦋🌎✨🤗

@sashamtl

2 years ago

Never thought I'd see the day I'd have a publication in JMLR 🥹 So happy that the BLOOM carbon footprint paper has finally found a home at such an incredible venue! Thank you Shakir Mohamed for being such a great editor, it warms my heart to see your name on this paper 💚

thumb_up_off_alt187

chat_bubble_outline7

repeat19

shareShare

Yacine Jernite

@yjernite

2 years ago

I respect the caution, but also need to stress that efforts that pursue transparency as an operational value in service of actual inclusion and accountability do exist - see for example the writing on this very topic by BigScience Research Workshop, including its ethical charter. 1/3

thumb_up_off_alt19

chat_bubble_outline1

repeat5

shareShare

Omar Sanseviero

@osanseviero

2 years ago

The top 15 most-liked organizations on Hugging Face 1. Stability AI 20k likes 2. AI at Meta 20k 3. Runway 11k 4. CompVis 10k 5. Tsinghua KEG (THUDM) 7k 6. BigScience Research Workshop 7k 7. Technology Innovation Institute 7k 8. Microsoft 6.5k 9. Google AI 6k 10. OpenAI 4k 11. BigCode 4k 12. . 4k 13. UKP Lab 3k

thumb_up_off_alt449

chat_bubble_outline9

repeat104

shareShare

Stas Bekman

@stasbekman

a year ago

The Universal Checkpointing paper is out! arxiv.org/abs/2406.18820 If you remember the BigScience Research Workshop BLOOM-176B training, Tunji Ruwase and I co-invented this technology for Megatron-Deepspeed in order to enable to quickly scale up and down node topology while continuing training.

thumb_up_off_alt173

chat_bubble_outline3

repeat34

shareShare

Oxford Internet Institute

@oiioxford

a year ago

DPhil candidate @cailean_osborne shares reflections on the Open Source Initiative @[email protected] co-design process to define #opensourceAI and recommends next steps, including improving model safety and supporting more grassroots initiatives like BigScience Research Workshop.

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare