Jose Javier Gonzalez
@jjgort
Research Scientist at MosaicAI DataBricks. Working on LLMs
ID: 815647100100808705
http://josejg.com 01-01-2017 19:53:17
28 Tweet
369 Followers
91 Following
ML+art project with Divya Shanmugam, Jose Javier Gonzalez, and artist, Agnieszka Kurant ! Our GAN-based approach generates signatures containing features learned from a collection of MIT and Cambridge residents’ signatures. #creativeAI #MachineLearning MIT Clinical and Applied Machine Learning listart.mit.edu/agnieszka-kura…
Bilal AI at Meta LMSYS Org no. people misunderstand chinchilla. chinchilla doesn't tell you the point of convergence. it tells you the point of compute optimality. if all you care about is perplexity, for every FLOPs compute budget, how big model on how many tokens should you train? for reasons not fully
Popular #LLM scaling laws only factor in training costs, and ignore the costs of deployment. In a paper presented at ICML Conference 2024, Databricks Mosaic AI researchers Nikhil Sardana, Jacob Portes, and Sasha Doubov propose a modified scaling law that considers the cost of both
*LoRA Learns Less and Forgets Less* is now out in its definitive edition in TMLR🚀 Checkout the latest numbers fresh from the Databricks Mosaic Research oven 👨🍳
RLVR and test-time compute are a powerful combo for enterprises, so much so that Databricks now leads overall BIRD single-model leaderboard. This isn't about BIRD, though. It's an example of what our customers are accomplishing in their domains with our RL recipe in Agent Bricks