shane (@shncldwll) 's Twitter Profile
shane

@shncldwll

pentester + ml eng. building hackbots

ID: 970139534653624322

linkhttps://hackbot.dad calendar_today04-03-2018 03:31:04

1,1K Tweet

457 Followers

405 Following

moo (@moo_hax) 's Twitter Profile Photo

Will be hanging out at the Agentic Summit this Saturday. Happy to meet up and talk agent observability, evals, and deployment for cyber security. rdi.berkeley.edu/events/agentic…

shane (@shncldwll) 's Twitter Profile Photo

there is truly no social media-ism that makes me stop reading faster than 'let that sink in'. i would rather see "it's not x, it's y" it's beyond self parody! give it up! the sentence can draw attention to itself if you write it correctly!

shane (@shncldwll) 's Twitter Profile Photo

Wrote about evals at Dreadnode. This one is for hackers getting up to speed on agents for their use cases. How do you go from PoC to prod? Don't wait for a lab to build benchmarks that measure what you care about. Do it yourself. Here's how:

shane (@shncldwll) 's Twitter Profile Photo

before posting writing online, it's important to read it out loud to yourself. that way every time you hit a difficult to read sentence you can get really mad and delete the whole piece, preventing anyone online from suffering

Alexander Doria (@dorialexander) 's Twitter Profile Photo

Recipe gets confirmed: drop pure verifiable, take the most performant judge that can fit on GPU (latency not a bit issue, so long as batches per step are small) and ask for soft critique.