Hanlin Tang (@hanlintang) 's Twitter Profile
Hanlin Tang

@hanlintang

cto for neural networks @Databricks. previously: cto/co-founder of @MosaicML, director of @intel AI lab, @NervanaSystems

ID: 61466630

linkhttps://github.com/mosaicml/composer calendar_today30-07-2009 11:48:59

262 Tweet

1,1K Followers

448 Following

Yam Peleg (@yampeleg) 's Twitter Profile Photo

Absolutely not. The rules of the game are: If released the model: - You can to compare yourself to other open models as much as you want. If you don’t release the model: - Compare yourself also to GPT-4 before claiming who you outperform or not. Comparing only down: Not fair.

MatthewBerman (@matthewberman) 's Twitter Profile Photo

DBRX by Databricks ...it's REALLY good!! The New MoE 132b parameter model is open-source and costs $10 m to train. Thank you, Databricks, for your contribution to OS. Check out the full explanation and testing: πŸŽ₯πŸ‘‡

Bill Yuchen Lin (@billyuchenlin) 's Twitter Profile Photo

πŸ†• Check out the recent update of π•Žπ•šπ•π••π”Ήπ•–π•Ÿπ•”π•™! We have included a few more models including DBRX-Instruct Databricks and StarlingLM-beta (7B) Nexusflow which are both super powerful! DBRX-Instruct is indeed the best open LLM; Starling-LM 7B outperforms a lot of even

πŸ†• Check out the recent update of π•Žπ•šπ•π••π”Ήπ•–π•Ÿπ•”π•™! We have included a few more models including DBRX-Instruct <a href="/databricks/">Databricks</a> and StarlingLM-beta (7B) <a href="/NexusflowX/">Nexusflow</a> which are both super powerful! DBRX-Instruct is indeed the best open LLM; Starling-LM 7B outperforms a lot of even
jasmine collins (@jazco) 's Twitter Profile Photo

we all know how important LLM evaluation is.. πŸ€” i’m excited to FINALLY announce that we are starting a new πŸ“’ recipe-based evals team!!! πŸ“’ for our first study, we compared 5 LLM-generated chili recipes with the prompt: β€œGive me a chili recipe with an interesting twist” (1/n)

we all know how important LLM evaluation is.. πŸ€”

i’m excited to FINALLY announce that we are starting a new πŸ“’ recipe-based evals team!!! πŸ“’

for our first study, we compared 5 LLM-generated chili recipes with the prompt: β€œGive me a chili recipe with an interesting twist” (1/n)
virat (@virattt) 's Twitter Profile Photo

Friday is LLM battle day. I added DBRX to the financial metrics challenge. Overall, very impressed with DBRX. Main takeaways: β€’ correctly calculated metrics β€’ ranked top 4 fastest models β€’ competitive pricing DBRX was +50% cheaper and +100% faster than models in its tier.

Friday is LLM battle day.

I added DBRX to the financial metrics challenge.

Overall, very impressed with DBRX.

Main takeaways:
β€’ correctly calculated metrics 
β€’ ranked top 4 fastest models
β€’ competitive pricing

DBRX was +50% cheaper and +100% faster than models in its tier.
Tessa Barton (@tessybarton) 's Twitter Profile Photo

LLM evals are a mess! They are noisy, inconsistent, and contradictory. Scaling laws on the other hand have consistently held up to increasing scrutiny. Can we use the reliability of scaling laws to predict the quality of our eval benchmarks?

LLM evals are a mess! They are noisy, inconsistent, and contradictory. Scaling laws on the other hand have consistently held up to increasing scrutiny. Can we use the reliability of scaling laws to predict the quality of our eval benchmarks?
Ali Ghodsi (@alighodsi) 's Twitter Profile Photo

Databricks to acquire Tabular (now part of Databricks), a data platform from the original creators of Apache Iceberg. Together, we will bring format compatibility to the lakehouse for Delta Lake and Apache Iceberg databricks.com/blog/databrick…

Matei Zaharia (@matei_zaharia) 's Twitter Profile Photo

Super excited about the new Agent Framework, Tool Catalog, Vector Search, Evaluation and Training capabilities we launched today in Mosaic AI. We see more companies building compound AI systems, and we have created an end-to-end environment to do this. databricks.com/blog/mosaic-ai…

Super excited about the new Agent Framework, Tool Catalog, Vector Search, Evaluation and Training capabilities we launched today in Mosaic AI. We see more companies building compound AI systems, and we have created an end-to-end environment to do this. databricks.com/blog/mosaic-ai…
Patrick Wendell (@pwendell) 's Twitter Profile Photo

Meta AI's Llama release today is really important, likely the most important open source AI announcement ever. Many people don't understand why: 1. The quality gap between the best proprietary and open models has effectively vanished. No one really knew if this gap would get

Michael Bendersky (@bemikelive) 's Twitter Profile Photo

This is a good opportunity to announce that I recently joined the research team at Databricks where I will be working alongside Jonathan Frankle Rishabh Singh Matei Zaharia Erich Elsen, and many others on the hardest problems at the intersection of information retrieval and AI.

Freddie Vargus (@freddie_v4) 's Twitter Profile Photo

today we're releasing a new small model (0.5B) for detecting problems with tool usage in agents, trained on 50M tokens from publicly available MCP server tools it's great at picking up on tool accuracy issues and outperforms larger models